![]() | ![]() |
Formats:
|
||||||||||||
Copyright © 2009 by the Genetics Society of America Identification of a New Rice Blast Resistance Gene, Pid3, by Genomewide Comparison of Paired Nucleotide-Binding Site–Leucine-Rich Repeat Genes and Their Pseudogene Alleles Between the Two Sequenced Rice Genomes *State Key Laboratory of Plant Genomics and National Plant Gene Research Centre (Beijing), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China, ‡Institute of Crop Research, Chinese Academy of Agricultural Sciences, Beijing 100081, China, †Graduate School of Chinese Academy of Sciences, Beijing 100101, China and §Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 101300, China 1These authors contributed equally to this work. 2Corresponding author: Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China. E-mail: lhzhu/at/genetics.ac.cn Communicating editor: A. H. Paterson Received March 16, 2009; Accepted May 28, 2009. Abstract Rice blast, caused by Magnaporthe oryzae, is one of the most devastating diseases. The two major subspecies of Asian cultivated rice (Oryza sativa L.), indica and japonica, have shown obvious differences in rice blast resistance, but the genomic basis that underlies the difference is not clear. We performed a genomewide comparison of the major class of resistant gene family, the nucleotide-binding site–leucine-rich repeat (NBS–LRR) gene family, between 93-11 (indica) and Nipponbare (japonica) with a focus on their pseudogene members. We found great differences in either constitution or distribution of pseudogenes between the two genomes. According to this comparison, we designed the PCR-based molecular markers specific to the Nipponbare NBS–LRR pseudogene alleles and used them as cosegregation markers for blast susceptibility in a segregation population from a cross between a rice blast-resistant indica variety and a susceptible japonica variety. Through this approach, we identified a new blast resistance gene, Pid3, in the indica variety, Digu. The allelic Pid3 loci in most of the tested japonica varieties were identified as pseudogenes due to a nonsense mutation at the nucleotide position 2208 starting from the translation initiation site. However, this mutation was not found in any of the tested indica varieties, African cultivated rice varieties, or AA genome-containing wild rice species. These results suggest that the pseudogenization of Pid3 in japonica occurred after the divergence of indica and japonica. THE two elite rice varieties with sequenced genomes, Nipponbare from Japan and 93-11 from China, represent the two major Asian cultivated rice subspecies, Oryza sativa L. japonica and indica, respectively. Sequencing these two genomes (Goff et al. 2002; Yu et al. 2002, 2005; IRGSP 2005) has opened an avenue for exploring the genomic basis of agricultural trait differences between these two subspecies (Garris et al. 2005). Rice blast caused by Magnaporthe oryzae is one of the most devastating diseases in both indica and japonica varieties. Many rice breeders have observed the difference in blast resistance between the two subspecies. Genetic characterization of blast resistance has suggested that indica varieties have more complicated blast resistance patterns than japonica varieties (Mackill et al. 1985; Yu et al. 1987; Mackill and Bonman 1992; Pan et al. 1996; Imbe et al. 1997). In contrast to other Asian countries, China has a long history of growing both indica and japonica varieties (Normile 1997; Zong et al. 2007). When Chinese rice varieties were tested with different Japanese M. oryzae strains, most of the japonica varieties could be classified into groups according to their respective disease-response patterns to the M. oryzae strains. However, when Chinese indica varieties were tested with the same stains, it was difficult to analyze their disease-response patterns because some were resistant to all the strains (Kiyosawa 1984; Ling 1984). This suggests that the indica and japonica varieties have diverged significantly in resistance to M. oryzae strains. Plants face frequent threats of many potential pathogens, including viruses, bacteria, and fungi, in their environments. During the long coevolution with pathogens, plants have evolved sophisticated mechanisms to perceive and defend against a pathogen's attack (Dangl and Jones 2001). Innate immunity, the most important of the plant defense responses, can be classified into “basal” and “R-gene-mediated” defenses (Abramovitch et al. 2006). The R-gene-mediated defense was originally proposed by H. H. Flor (Flor 1956). This type of resistance is highly effective and easily manipulated in breeding programs to improve crop resistance. The majority of R proteins contain a nucleotide-binding site and a carboxy-terminal leucine-rich repeat domain (Martin et al. 2003). Notably, almost all the functionally identified nucleotide-binding site–leucine-rich repeat (NBS–LRR) genes are R genes with only one exception (Faigon-Soverna et al. 2006) and thus this class of genes is regarded as the core component of the plant immune system (Nimchuk et al. 2003). Thus far, 10 rice blast R genes [Pib (Wang et al. 1999), Pita (Bryan et al. 2000), Pi9 (Qu et al. 2006), Pid2 (Chen et al. 2006), Pi2 (Zhou et al. 2006), Piz-t (Zhou et al. 2006), Pi36 (Liu et al. 2007), Pi37 (Lin et al. 2007), Pikm (Ashikawa et al. 2008), and Pi5 (Lee et al. 2009)] have been identified via map-based cloning methods. All the cloned blast resistance genes encode NBS–LRR proteins except for Pid2, which encodes a receptor-like kinase (Chen et al. 2006). The large number of published blast R gene isolations reveals the importance of the NBS–LRR gene family in rice blast disease resistance. Recently, a genomewide investigation of the NBS–LRR genes of the two sequenced rice genomes was reported (Yang et al. 2006). However, that study focused only on genetic variations among common R gene sequences in the two genomes, but did not associate the genetic variations with the differential disease resistance between indica and japonica rice. In view of the difference in blast resistance between indica and japonica varieties described above, we performed a genomewide comparison of NBS–LRR gene sequences between 93-11 and Nipponbare. Our investigation focused on the systemic changes in the NBS–LRR gene sequences between the two genomes. In the survey, we found many potential NBS–LRR pseudogenes in both 93-11 and Nipponbare genomes due to premature stop codons or frameshift mutations in the ORFs. Notably we found great differences in constitution and distribution of pseudogenes between the two genomes. The rice genome has been estimated to carry ~500 NBS–LRR genes (Bai et al. 2002; Monosi et al. 2004; Zhou et al. 2004). Although most of the annotated rice NBS–LRR genes have not been functionally identified, we can expect that if a complete NBS–LRR gene functions as a blast R gene in one genome, its allelic pseudogene in the other genome might lose the function of blast resistance. Thus, we believe that the NBS–LRR pseudogene alleles can be used as cosegregation markers for blast susceptibility in an appropriate rice segregation population. In this study, we demonstrated the feasibility of this approach. We applied this novel strategy to the indica variety, Digu, and successfully identified a new blast R gene, Pid3. MATERIALS AND METHODS Plant materials: The indica and japonica varieties used in this study are listed in supporting information, Table S1. The accessions and geographic origins of the 36 lines of wild rice and 4 varieties of African cultivated rice are listed in Table S2. Fungal inoculation: The M. oryzae strains used in this study were collected and isolated from indica- and japonica-cultivated regions in China or Japan. The M. oryzae stain Zhong-10-8-14 was isolated from a japonica-cultivated region in northern China. We expected to use this strain to screen useful R genes from the highly resistant indica variety, Digu. The isolated R genes might be used to improve the blast resistance of Chinese japonica varieties. Rice seedlings at the four-leaf stage were inoculated by spraying a spore suspension (5 × 104 spores/ml) of the M. oryzae strain onto the leaves. Subsequently, the inoculated plants were placed into a chamber maintained at 25° and 100% humidity in the dark for 24 hr and then transferred to a greenhouse. The disease reaction was examined 1 week after inoculation with the susceptible variety, TP309, as a control. Identification of NBS–LRR seed sequences of rice: By searching with the keyword “NBS,” we retrieved 68 protein sequences from the Swiss-Prot database (http://www.ebi.ac.uk/swissprot/). A set of 32,127 rice full-length cDNA sequences was downloaded from the KOME database (http://cdna01.dna.affrc.go.jp/cDNA/). The cDNA sequences were translated in six-frame and the longest ORF was reported as the putative protein sequence. A set of 153 sequences with the NBS motif was selected from the complete set of predicted full-length (fl)-cDNA supported rice proteins, using a hidden Markov model (HMM) (Eddy 1998) for the NBS domain from the Pfam database (PF0931; http://pfam.wustl.edu) (Bateman et al. 2002). The 153 protein sequences were aligned only on the basis of the NBS domain, using hmmalign. This alignment was then used to develop a rice-specific HMM model to identify related sequences. The refined HMM was compared again with the complete set of fl-cDNA sequences, using estwisedb in the wise2 software package (ftp://ftp.ebi.ac.uk/pub/software/unix/wise2/) (Birney et al. 2004). All sequences that matched the model with a score of ≥25 were extracted (http://www.ebi.ac.uk/Tools/Wise2/doc_wise2.html#SECTION00055000000000000000). In total, we obtained 166 nonredundancy cDNA sequences that encoded the NBS domain. We identified a subset of them as seed sequences according to three criteria: (1) The cDNA sequences were aligned against the closest matching protein among 68 known NBS proteins, using estwise without obvious ORF-disrupted mutations (such as premature stop codons or frameshifts); (2) the noncoding regions of cDNA were initially searched against 68 known NBS protein sequences, using tblastn with an expectation score of >0.1; and (3) all sequences that met the previous criteria were verified by cross-validation, using estwise without obvious disablements. We found a total of 102 seed proteins, including 99 fl-cDNA-supported proteins and the 3 proteins encoded by Pib (Wang et al. 1999), Pita (Bryan et al. 2000), and Pi9 (Qu et al. 2006). Alignment of protein amino acid sequences with nucleotide sequences was performed with the wise2 package. Identification of NBS–LRR genes and pseudogenes in rice: Nipponbare and 93-11 genome sequences were downloaded from the TIGR Rice Genome Annotation Database (version 4.0, http://rice.plantbiology.msu.edu/) and RISE Database (http://rise.genomics.org.cn), respectively. To identify the regions that contain the NBS domain, the genomes were scanned with a rice-specific NBS HMM model, using Genewisedb with a score of ≥25. A set of 653 regions was detected from the database of Nipponbare, and a set of 553 regions was detected from that of 93-11. Using the BGI FGF pipeline (http://fgf.genomics.org.cn/), we compared the genome regions containing the NBS domain with the closest matching seed protein, using Genewise. If there was a continuous span of sequence that was ≥70% of the length of the closest matching seed protein, we designated that sequence as an NBS–LRR gene; otherwise it was designated as an NBS–LRR gene segment. If an NBS gene contained a premature stop codon or frameshift mutation when compared with the seed protein, it was designated as a pseudogene. Identification of Nipponbare and 93-11 NBS–LRR allelic pairs: We generated the Nipponbare and 93-11 NBS–LRR allelic pairs by comparing the gene sequences from the two genomes. We conducted a multiple sequence alignment of the 1206 protein sequences that contained an NBS domain from the Nipponbare and 93-11 genomes using ClustalW (Chenna et al. 2003), including 267 NBS–LRR protein segments. Then the least divergent NBS–LRR gene pairs between the Nipponbare and 93-11 genomes were extracted from the phylogenetic tree, which was generated by using PAUP 4.0 (Swofford 2000) based on the neighbor-joining (NJ) method. If they met all the following criteria, the least divergent pairs were assigned as NBS–LRR allelic pairs: (1) neither gene in the pair is an NBS–LRR segment, (2) both genes are located at the corresponding locus of the homologous chromosomes, and (3) the nucleotide sequence identity of the pair is ≥82%. Expression analysis of Pid3: The four-leaf stage plants of the resistant indica variety Digu and the susceptible japonica variety TP309 were inoculated with M. oryzae strain Zhong-10-8-14. Total RNAs were isolated from the leaf tissue at 0, 3, 6, 12, 24, 48, and 72 hr, respectively, after inoculation. An equal amount of total RNA was pretreated with RNase-free DNase I (Promega, Madison, WI) and subjected to reverse transcription, using the Reverse Transcription System (Promega). The semiquantitative RT–PCR with the Pid3-specific primers 5′-TACTACTCATGGAAGCTAGTTCTC-3′ and 5′-ACGTCACAAATCATTCGCTC-3′ was performed for 30 cycles. The RT–PCR with Actin-specific primers 5′-CCTCGTCTCGACCTTGCTGGG-3′ and 5′-GAGAACAAGCAGGAGGACGGC-3′ was performed for 25 cycles as a control. Sequence analysis of Pid3: The cDNA sequence of Pid3 was obtained through RACE–PCR and RT–PCR analysis. RACE–PCR was carried out using the SMART RACE cDNA Amplification kit (Clontech) according to the manufacturer's instructions. The Pid3-specific primers 5′-CCATCTTGCACATCCTCTTGAGTG-3′ and 5′-ACATCTTATTGGAGAAGCAC-3′ were used in 5′-RACE and 3′-RACE, separately. The primers 5′-GAAGCTAGCAAGCTATGGCGGAG-3′ and 5′-ACGTCACAAATCATTCGCTC-3′ were used in RT–PCR. The 5′- and 3′-RACE ends and RT–PCR products were cloned into the pGEM-T-easy vector (Promega) prior to sequencing. The coding regions of Pid3 alleles in Digu, 93-11, Zaiyeqing8, LTH, Nipponbare, TP309, and the wild rice A4 were amplified from the respective genomic DNAs by using the primers 5′-AGTAACACCCAAGGATAGGATAG-3′ and 5′-GAACGACAAGTGCGACATGATTG-3′. The PCR products were cloned into the pGEM-T-easy vector (Promega) and confirmed by DNA sequencing. Analysis of the pseudogenization of the Pid3 alleles: To examine the Pid3 nonsense mutation locus, we designed two derived cleaved amplified polymorphic sequence (dCAPS) markers. With dCAPS marker Pid3-dCAPS-1, a 178-bp fragment was amplified using the primers 5′-TACTACTCATGGAAGCTAGTTCTC-3′ and 5′-GCAGCACTTCTTGACTACTGTCTGT-3′ and then digested with XbaI. With dCAPS marker Pid3-dCAPS-2, a 178-bp fragment was amplified using the primers 5′-TACTACTCATGGAAGCTAGTTCTC-3′ and 5′-AGCACTTCTTGACTACTGTCTGCCT-3′ and then digested with BamHI. Vector construction and rice transformation: The 6235-bp genomic sequence of Pid3 containing the promoter region and the full coding region was amplified from genomic DNA of Digu, using the forward primer 5′-ACCGAATTCCACACATTGTACACCTACGACCAC-3′ (EcoRI recognition site is underlined) and the reverse primer 5′-ACCGTCGACGAACGACAAGTGCGACATGATTG-3′ (SalI recognition site is underlined), and then cloned into the binary vector pCAMBIA1300 through the EcoRI and SalI cloning sites. After sequence verification the final construct was introduced into Agrobacterium tumefaciens LBA4404. The callus of susceptible japonica variety TP309 was transformed according to published methods (Hiei et al. 1994). The resistance of the primary transgenic lines (T0) and of the self progenies (T1) of the T0 lines was challenged by inoculation with the M. oryzae strain Zhong-10-8-14. Segregation of the transgene in T1 plants was confirmed by dCAPS marker Pid3-dCAPS-1; double bands containing 178- and 153-bp fragments can be checked on the agarose gel if the T1 plants contained the transgenic fragment. RESULTS Genomewide comparison of the NBS–LRR genes between 93-11 and Nipponbare: To understand the genomic difference in blast resistance between indica and japonica varieties, we compared the NBS–LRR gene sequences between 93-11 and Nipponbare with a focus on the intact genes in one genome and their allelic pseudogenes in the other. We surveyed both 93-11 and Nipponbare genomes and reannotated all the NBS–LRR genes and their pseudogene counterparts, using a modified homology-based method for human pseudogene annotation (Zhang and Gerstein 2003a; Zhang et al. 2006). First, using an optimized HMM for the rice NBS domain, we identified 653 and 553 gene loci with the NBS domain from the 93-11 and Nipponbare genomes, respectively. Second, we defined a set of 102 seed protein sequences as intact NBS–LRR proteins (see details in materials and methods). The seed sequences were derived from 99 fl-cDNA (Kikuchi et al. 2003) supported genes and 3 cloned NBS–LRR genes (Wang et al. 1999; Bryan et al. 2000; Qu et al. 2006). Finally, we selected the other NBS–LRR genes from the NBS domain loci in either the 93-11 or the Nipponbare genome if they had a continuous homology span on the closest matching seed protein sequence in at least 70% of its length. The homology search in the 93-11 genome resulted in 471 NBS–LRR genes among the 653 NBS domain loci. However, 184 of them were further identified as pseudogenes due to frameshift mutation or nonsense mutation or both. Meanwhile, in the Nipponbare genome, we identified 468 NBS–LRR genes from 553 gene loci, of which 150 were considered to be pseudogenes. The detailed information on this analysis is available in Table S3. All of the annotated NBS–LRR genes and pseudogenes have been submitted to GenBank as third party annotation (accession nos.: annotated Nipponbare NBS–LRR intact genes, FJ771051–FJ771368; annotated Nipponbare NBS–LRR pseudogenes, FJ771369–FJ771518; annotated 93-11 NBS–LRR intact genes, FJ771519–FJ771802; and annotated 93-11 NBS–LRR pseudogenes, FJ771803–FJ771989). Next, we identified the NBS–LRR gene pairs by the most similarity criterion between the two genomes. Each NBS–LRR gene in one genome was paired only once to the closest homolog in the other. However, because of chromosome DNA rearrangements in some regions where some of the NBS–LRR clustered gene family locates, not every NBS–LRR gene of either Nipponbare or 93-11 has its closest match at the same loci in the other genome. Of the total 337 pairs from the two rice genomes, 301 were considered allelic, while the other 36 were nonallelic due to their different chromosome locations (Figure 1
We compared our annotated 150 NBS–LRR pseudogenes in Nipponbare with the corresponding genes annotated by TIGR (Yuan et al. 2005). In the corresponding TIGR models, the fragments that contain frameshift or nonsense mutation sites in the 90 pseudogenes we annotated and those in the other 34 pseudogenes were taken as introns and untranslated regions (UTRs), respectively. The pseudogenization ratio of the KOME fl-cDNA NBS–LRR genes (27%, 41/152) is similar to that of the genomewide annotated NBS–LRR genes (32%, 150/468) in Nipponbare. These results also suggest that most NBS–LRR pseudogenes are expressed as intact genes. Identifying an unknown blast R gene locus using an NBS–LRR pseudogene as a molecular marker: To analyze the genetic relationship between the pseudogenization of NBS–LRR genes and rice blast resistance, an F2 population was developed from a cross between the blast-resistant indica variety, Digu, and the susceptible japonica variety, TP309. A total of 473 individuals of the F2 population were inoculated with the M. oryzae strain Zhong-10-8-14. This M. oryzae strain was isolated from a japonica-cultivated area in northern China, and it is compatible to the previously cloned blast resistance gene Pid2 from the variety, Digu. In the F2 population, most plants were resistant, but we were able to clearly distinguish 24 susceptible individuals. Thus, the segregation ratio of resistance to susceptibility is 15:1 (resistant:susceptible, 449:24; χ215:1 = 0.04), indicating that Digu may contain two genes that confer resistance to the strain Zhong-10-8-14. We designed the PCR-based molecular markers specific to the 35 Nipponbare pseudogenes in the above-described 35 allelic NBS–LRR gene pairs in the third group. (See the structures of the allelic gene pairs in Figure S2 and see the PCR primers and restriction enzymes used in these markers in Table S4.) We used these primers to survey the 24 susceptible individuals and the parent varieties Digu and TP309. We found a candidate pseudogene on chromosome 6 that could be detected in all 24 susceptible individuals and TP309 but not in Digu. This suggests that the intact NBS–LRR allele in Digu corresponding to this pseudogene is most likely one of the two functional blast resistance genes against Zhong-10-8-14. We named this blast R gene Pid3, as we found it has a different chromosomal location from the two previously identified Pid(t)1 and Pid2 from Digu (Chen et al. 2004, 2006). Cloning and functional analysis of Pid3: We amplified and sequenced the candidate NBS–LRR gene Pid3 and its alleles from the genomes of the rice varieties Digu, 93-11, TP309, and Nipponbare. Both the ORFs of the Digu Pid3 candidate and the allele from 93-11 consist of the same 2772-nucleotide (nt) sequence, encoding a protein with 924 amino acids. However, the alleles in japonica varieties TP309 and Nipponbare contain the same premature stop codon due to a C to T nonsense mutation. This mutation leads to the production of a truncated 737-amino-acid polypeptide with a disrupted LRR region (Figure 2A
Using the DNA or protein sequences of the candidate Pid3 as queries, BLAST searches in GenBank (http://www.ncbi.nih.gov/blast) and KOME rice fl-cDNA databases (http://cdna01.dna.affrc.go.jp/cDNA/) were performed, respectively. No corresponding mRNA or fl-cDNA could be found. A 521-nt EST sequence (GenBank accession no. CK083629) was found to match the 3′ region of the candidate Pid3, indicating that the candidate Pid3 is transcribed. We performed semiquantitative RT–PCR analysis of Pid3 in Digu and TP309 infected with M. oryzae Zhong-10-8-14. The expression analysis indicated that Pid3 was constitutively expressed in both the resistant indica variety Digu and the susceptible japonica variety TP309 (see Figure S3). Through RT–PCR and RACE–PCR we obtained the mRNA sequence of the candidate Pid3, including a 2772-nt coding sequence, a 63-nt 5′-UTR, and a 135-nt 3′-UTR, with the TIGR-predicted intron around the nonsense mutation also being included in the mature mRNA. The candidate Pid3 encodes a 924-amino-acid polypeptide that contains a conserved NBS domain in positions 158–466 from the translation initiation site. The NBS domain includes four sequence motifs, GMGGIGKTA (positions 202–210), KRYVLVLDDVW (positions 280–290), IGRIILTSRNYDV (positions 307–319), and GLPIAI (positions 373–378), corresponding to kinase 1a (p-loop), kinase 2, kinase 3a, and GLPL motifs, respectively. At the C terminus is the LRR region that comprises 13 imperfect LRR repeats. The MHD motif, MHDILRV (positions 502–508), and the NBS–LRR linker motif, EQNFCIVVNHS (positions 516–526), are present between the NBS domain and the LRR region. At the N terminus, there is a conserved motif, RSLALSIEDVVD (positions 78–89), but no TIR or coiled-coil motif is found (Figure 2B To confirm that Pid3 confers blast resistance to the strain Zhong-8-10-14, we performed a genetic complementation test. Using Digu's genomic DNA as template, a 6235-bp DNA fragment, including the entire 2772-bp coding region, a 3010-bp upstream region, and a 451-bp downstream region, was amplified by high-fidelity PCR. After sequencing verification, this fragment was subcloned into the binary vector pCAMBIA1300, which was then introduced into susceptible variety TP309 by Agrobacterium-mediated transformation. We obtained 21 independent primary transgenic lines (T0), all with resistance to the M. oryzae strain Zhong-10-8-14. Cosegregation of the transgene and blast resistance were confirmed in selfed progenies (T1) of the T0 lines (Figure 3
Comparison of the Pid3 nonsense mutation among indica, japonica, and wild rice: To test if the ORF-disrupted mutation of Pid3 occurs in other rice varieties, we designed specific dCAPS markers that can distinguish the mutation alleles (see the primers and restriction enzymes used in the dCAPS markers in Table S4). In the japonica collection, 29 of 32 varieties contain the same C to T mutation, whereas none of the varieties in the indica collection contains this mutation. This result indicates that Pid3 is replaced by its pseudogene in most tested japonica varieties (Figure S4). The genus Oryza includes 2 cultivated species and 21 wild species (Khush 1997). O. sativa, Asian cultivated rice, is cultivated all over the world while O. glaberrima, African cultivated rice, is grown on a small scale in western Africa (Khush 1997). On the basis of gene transfer, the 2 cultivated species, O. sativa and O. glaberrima, and 6 wild species, O. rufipogon, O. nivara, O. glumaepatula, O. meridionalis, O. breviligulata, and O. longistaminata, have been grouped into a primary gene pool (Khush 1997). These 8 Oryza species all share the AA genome and the gene transfer among them can be accomplished through conventional hybridization and selection procedures (Khush 1997). Using the above-mentioned dCAPS markers, we examined whether the nonsense mutation in Pid3 occurs in four African rice varieties and a collection of 36 wild rice lines belonging to the 6 AA genome wild species. The accessions and geographic origins of the wild rice and African cultivated rice are listed in Table S2. Notably, we found that none of the tested African cultivated varieties or the tested wild rice lines contains the nonsense mutation. These results suggest that the mutation creating the nonfunctional allele of Pid3 occurred in O. sativa subspecies japonica after its split from subspecies indica. Next we sequenced the respective allelic Pid3 coding regions of the O. rufipogon wild rice line A4, the indica variety Zhaiyeqing 8 (ZYQ8) that had shown resistance to Zhong-10-8-14 (Zhu et al. 1993), and the japonica variety LTH that had been regarded as an universal blast-susceptible variety lacking all known major blast R genes (Ling et al. 2000; Tsunematsu et al. 2000). Protein sequence comparison revealed a few amino acid changes among the Pid3 proteins of A4, Digu, ZYQ8, 93-11, Nipponbare, TP309, and LTH (see the protein sequence alignment in Figure S5). The sequences of the Pid3 alleles were submitted to GenBank [accession nos. FJ745365 (Pid3-A4), FJ745366 (Pid3-ZYQ8), FJ745367 (Pid3-TP309), FJ745368 (Pid3-LTH), FJ773285 (Pid3-9311), and FJ773286 (Pid3-Nip)]. DISCUSSION As a major R gene family in plants, NBS–LRR genes have been studied extensively. These investigations, however, have been focused mainly on the functional genes with a complete NBS–LRR coding sequence. Genomewide analyses of NBS–LRR genes in Arabidopsis (Columbia) and rice (Nipponbare) have revealed that a large number of NBS–LRR genes encoded in these two genomes are pseudogenes due to frameshift mutations or premature stop codons (Meyers et al. 2003; Monosi et al. 2004). At least 12 of 149 (~8%) NBS–LRR genes in Arabidopsis (Columbia) (Meyers et al. 2003) and >100 of the ~500 (20%) NBS–LRR genes in rice (Nipponbare) (Monosi et al. 2004) were predicted as pseudogenes. In these previous studies, manual reannotation was used to verify the results obtained from automated annotation programs. However, pseudogenes are often misincorporated into gene collections due to their high level of sequence homology to their real gene counterparts (Nelson 2004). It has been demonstrated that either a frameshift mutation or a nonsense mutation in an internal exon may force gene-prediction programs to adjust their splicing patterns and thus transform the pseudogenes into variable gene models (Harrison et al. 2002; Zhang and Gerstein 2004; Zhang et al. 2004, 2006; Zhu et al. 2006). In our study, we used a homology-based method derived from a pipeline that has been successfully used in human pseudogene annotation (Zhang and Gerstein 2003b; Zhang et al. 2006) to reannotate the complete sets of NBS–LRR genes and pseudogenes in 93-11 and Nipponbare genomes. According to our annotation, >30% of the NBS–LRR genes are pseudogenes in either the 93-11 or the Nipponbare genomes, greater than the 20% that was reported previously (Monosi et al. 2004). The number of annotated NBS–LRR genes in 93-11 (471) is similar to that in Nipponbare (468). However, the number of NBS-domain-containing loci in 93-11 (653) is more than that in Nipponbare (553). The higher number of NBS-domain-containing loci in 93-11 could be artificiality inflated from the “whole-genome shotgun” strategy applied to the 93-11 genome sequencing. Sequence assembly errors in this method might put fragments from a single locus into multiple loci. Although much effort has been made to obtain a high-quality genome assembly during sequencing of 93-11 and Nipponbare, sporadic mistakes in the genome databases are inevitable and would result in misannotated genes and overestimation of the number of pseudogenes during our analysis using these databases. Thus far, >70 blast R-gene loci have been identified and mapped on rice chromosomes (Ballini et al. 2008). Some of the genes have been successfully used in blast resistance breeding. When a map-based cloning strategy is used in the identification of R genes, resistance gene analog (RGA) markers that are designed according to the conserved NBS domain could be used to identify the corresponding candidate NBS–LRR resistance gene (Mago et al. 1999; Chauhan et al. 2002; Zhuang et al. 2002; Lee et al. 2003; Sallaud et al. 2003; Chen et al. 2004). In practice, however, successful examples of isolation of an NBS–LRR resistance gene with RGA markers have rarely been reported. The ineffectiveness of this method is possibly due to the complex nature of NBS–LRR genes in the genome. In contrast to the use of RGA markers, we designed molecular markers on the basis of the pseudogenization of NBS–LRR genes. We analyzed the relationship between the NBS–LRR pseudogenes and blast susceptibility in an appropriate population derived from resistant and susceptible varieties. In our method, when cosegregation of a tested pseudogene with the susceptible phenotype is observed, the intact allele can be regarded as the candidate blast R gene. This strategy has two advantages. First, only a small number of susceptible individual plants need to be analyzed with the landing markers. Second, scoring susceptible plants is more precise than scoring resistant plants. In this study, we demonstrated the effectiveness of this strategy by identifying a previously unknown R gene, Pid3, from the blast-resistant variety, Digu, in contrast to our labor-intensive cloning of the Pid2 gene from the same variety (Chen et al. 2006). In fact, we also tried to identify the blast R gene other than Pid3 revealed in the genetic analysis of the F2 population by using the same strategy, but we could not find any other pseudogene marker associated with the susceptible individuals. This implies that not all of the susceptible alleles of blast R genes are pseudogenes. For example, one amino acid change resulted in the loss of resistance of Pita (Bryan et al. 2000) and Pi36 (Liu et al. 2007). Acknowledgments We thank Li Heng and Zheng Hongkun (Beijing Institute of Genomics) for their helpful suggestions and discussions. We thank Laura Bartley and Dante F. Placido for their careful revision of the manuscript. This research was supported by grants from the National High Technology Research and Development Program of China (863 Program) (2006AA10A101, 2006AA10Z192, and 20060110Z1142), the National Key Basic Research Science Foundation of China (2006CB101904), the Knowledge Innovation Program of the Chinese Academy of Sciences (KSCX2-YW-N-005), and the National Science Foundation of China (30640095 and 30550005). Notes Supporting information is available online at http://www.genetics.org/cgi/content/full/genetics.109.102871/DC1. Sequence data from this article have been deposited with the GenBank Data Library under accession no. FJ745364. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||
Science. 2002 Apr 5; 296(5565):92-100.
[Science. 2002]Science. 2002 Apr 5; 296(5565):79-92.
[Science. 2002]PLoS Biol. 2005 Feb; 3(2):e38.
[PLoS Biol. 2005]Nature. 2005 Aug 11; 436(7052):793-800.
[Nature. 2005]Genetics. 2005 Mar; 169(3):1631-8.
[Genetics. 2005]Nature. 2001 Jun 14; 411(6839):826-33.
[Nature. 2001]Nat Rev Mol Cell Biol. 2006 Aug; 7(8):601-11.
[Nat Rev Mol Cell Biol. 2006]Annu Rev Plant Biol. 2003; 54():23-61.
[Annu Rev Plant Biol. 2003]Plant Cell. 2006 Nov; 18(11):2919-28.
[Plant Cell. 2006]Annu Rev Genet. 2003; 37():579-609.
[Annu Rev Genet. 2003]Plant J. 1999 Jul; 19(1):55-64.
[Plant J. 1999]Plant Cell. 2000 Nov; 12(11):2033-46.
[Plant Cell. 2000]Genetics. 2006 Mar; 172(3):1901-14.
[Genetics. 2006]Plant J. 2006 Jun; 46(5):794-804.
[Plant J. 2006]Mol Plant Microbe Interact. 2006 Nov; 19(11):1216-28.
[Mol Plant Microbe Interact. 2006]Plant Mol Biol. 2006 Sep; 62(1-2):181-93.
[Plant Mol Biol. 2006]Genome Res. 2002 Dec; 12(12):1871-84.
[Genome Res. 2002]Theor Appl Genet. 2004 Nov; 109(7):1434-47.
[Theor Appl Genet. 2004]Mol Genet Genomics. 2004 May; 271(4):402-15.
[Mol Genet Genomics. 2004]Bioinformatics. 1998; 14(9):755-63.
[Bioinformatics. 1998]Nucleic Acids Res. 2002 Jan 1; 30(1):276-80.
[Nucleic Acids Res. 2002]Genome Res. 2004 May; 14(5):988-95.
[Genome Res. 2004]Plant J. 1999 Jul; 19(1):55-64.
[Plant J. 1999]Plant Cell. 2000 Nov; 12(11):2033-46.
[Plant Cell. 2000]Nucleic Acids Res. 2003 Jul 1; 31(13):3497-500.
[Nucleic Acids Res. 2003]Plant J. 1994 Aug; 6(2):271-82.
[Plant J. 1994]Gene. 2003 Jul 17; 312():61-72.
[Gene. 2003]Bioinformatics. 2006 Jun 15; 22(12):1437-9.
[Bioinformatics. 2006]Science. 2003 Jul 18; 301(5631):376-9.
[Science. 2003]Plant J. 1999 Jul; 19(1):55-64.
[Plant J. 1999]Plant Cell. 2000 Nov; 12(11):2033-46.
[Plant Cell. 2000]Plant Physiol. 2005 May; 138(1):18-26.
[Plant Physiol. 2005]Plant J. 2006 Jun; 46(5):794-804.
[Plant J. 2006]Plant Mol Biol. 1997 Sep; 35(1-2):25-34.
[Plant Mol Biol. 1997]Plant Cell. 2003 Apr; 15(4):809-34.
[Plant Cell. 2003]Theor Appl Genet. 2004 Nov; 109(7):1434-47.
[Theor Appl Genet. 2004]Hum Genomics. 2004 May; 1(4):310-6.
[Hum Genomics. 2004]Genome Res. 2002 Feb; 12(2):272-80.
[Genome Res. 2002]Curr Opin Genet Dev. 2004 Aug; 14(4):328-35.
[Curr Opin Genet Dev. 2004]Mol Plant Microbe Interact. 2008 Jul; 21(7):859-68.
[Mol Plant Microbe Interact. 2008]Mol Genet Genomics. 2002 Jul; 267(5):603-12.
[Mol Genet Genomics. 2002]Mol Genet Genomics. 2003 Apr; 269(1):101-8.
[Mol Genet Genomics. 2003]Theor Appl Genet. 2003 Mar; 106(5):794-803.
[Theor Appl Genet. 2003]Plant J. 2006 Jun; 46(5):794-804.
[Plant J. 2006]