![]() | ![]() |
Formats:
|
||||||||||||||||||||||||
Copyright © 2008, American Society for Microbiology Diversity, Activity, and Evolution of CRISPR Loci in Streptococcus thermophilus †Danisco France SAS, BP10, F-86220 Dangé-Saint-Romain, France,1 Danisco USA, Inc., 3329 Agriculture Drive, Madison, Wisconsin 53716,2 Département de Biochimie et de Microbiologie, Faculté des Sciences et de Génie, Groupe de Recherche en Ecologie Buccale, Faculté de Médecine Dentaire, Félix d'Hérelle Reference Center for Bacterial Viruses, Université Laval, G1K 7P4 Québec, Canada3 *Corresponding author. Mailing address: Danisco USA, Inc., 3329 Agriculture Drive, Madison, WI 53716. Phone: (608) 395-2648. Fax: (608) 395-2721. E-mail: rodolphe.barrangou/at/danisco.com Received August 31, 2007; Accepted November 21, 2007. This article has been cited by other articles in PMC.Abstract Clustered regularly interspaced short palindromic repeats (CRISPR) are hypervariable loci widely distributed in prokaryotes that provide acquired immunity against foreign genetic elements. Here, we characterize a novel Streptococcus thermophilus locus, CRISPR3, and experimentally demonstrate its ability to integrate novel spacers in response to bacteriophage. Also, we analyze CRISPR diversity and activity across three distinct CRISPR loci in several S. thermophilus strains. We show that both CRISPR repeats and cas genes are locus specific and functionally coupled. A total of 124 strains were studied, and 109 unique spacer arrangements were observed across the three CRISPR loci. Overall, 3,626 spacers were analyzed, including 2,829 for CRISPR1 (782 unique), 173 for CRISPR2 (16 unique), and 624 for CRISPR3 (154 unique). Sequence analysis of the spacers revealed homology and identity to phage sequences (77%), plasmid sequences (16%), and S. thermophilus chromosomal sequences (7%). Polymorphisms were observed for the CRISPR repeats, CRISPR spacers, cas genes, CRISPR motif, locus architecture, and specific sequence content. Interestingly, CRISPR loci evolved both via polarized addition of novel spacers after exposure to foreign genetic elements and via internal deletion of spacers. We hypothesize that the level of diversity is correlated with relative CRISPR activity and propose that the activity is highest for CRISPR1, followed by CRISPR3, while CRISPR2 may be degenerate. Globally, the dynamic nature of CRISPR loci might prove valuable for typing and comparative analyses of strains and microbial populations. Also, CRISPRs provide critical insights into the relationships between prokaryotes and their environments, notably the coevolution of host and viral genomes. The dairy industry relies heavily on the use of microbial starter culture systems. Among domesticated bacteria widely used in industrial applications, Streptococcus thermophilus is a key species involved in the acidification of milk and the development of texture in various fermented dairy products (3). Recent advances in genomics have provided novel insights into the many critical physiological functions carried out by S. thermophilus in fermentation processes (12). Specifically, unraveling the full genome sequences of three different strains has led to a better understanding of genes involved in acidification, texture development, and flavor enhancement (3, 12, 20). Also, comparative genomic analyses have identified specific loci involved in particular traits attributed to selected strains and pointed out differential content, notably with regard to exopolysaccharide synthesis and phage resistance (3, 12). Recent studies have also established a correlation between distinctive genomic content such as clustered regularly interspaced short palindromic repeats (CRISPR) and resistance to phages (2, 4). CRISPRs are a peculiar family of DNA repeats widely distributed in Bacteria and Archaea (8, 9, 11, 13, 18). CRISPR loci usually consist of short and highly conserved DNA repeats, typically 21 to 48 bp, repeated up to 250 times (9). The repeated sequences, typically specific to a given CRISPR locus, are interspaced by variable sequences of constant and similar length, called spacers, usually 20 to 58 bp depending on the species or the CRISPR locus. Several distinct CRISPR loci can be located on a particular prokaryotic genome (18); for example, in Methanocaldococcus jannaschii, 18 distinct CRISPR loci have been identified on the chromosome, totaling almost 1% of the genome (5). In addition, cas (CRISPR-associated) genes are often present in the direct vicinity of CRISPR loci (8, 11, 14). Based on similarities between CRISPR spacers and phage or plasmid sequences (4, 21, 23), it was proposed in the literature that CRISPR and cas genes might be involved in conferring immunity to the host cell against foreign DNA (19, 21). In the S. thermophilus chromosome, two distinct CRISPR loci have been identified, namely, CRISPR1 and CRISPR2 (3, 4). Comparative analysis of CRISPR1 sequence between various S. thermophilus strains has revealed polymorphisms (4). In addition, it was recently reported that CRISPR provides acquired resistance against viruses in prokaryotes, notably in S. thermophilus (2). This is consistent with the putative CRISPR-cas immunity system based on RNA interference (RNAi) proposed by Makarova et al. (19), although the mechanism of action remains uncharacterized. The correlation between CRISPR spacer content and phage susceptibility suggests that spacer content might provide a historical perspective of phage exposure. Since the CRISPR system is reactive to the environment, it might play a critical role in the adaptation of the host to its surroundings and explain the persistence of particular bacterial strains in ecosystems where phages are present. CRISPRs may also provide insight into the codirected genomic evolution of the phage and its host. Here, we provide a comparative analysis of CRISPR activity and diversity across a collection of S. thermophilus strains. We specifically analyzed the relationship between spacer hypervariability and CRISPR locus activity across three distinct CRISPR loci. In addition, we provide insight into the functional coupling between a given CRISPR repeat and the accompanying cas gene set. Finally, we investigated the various origins of CRISPR spacers and discuss the evolutionary modifications of CRISPR loci in direct response to genetic elements present in the environment, notably bacteriophages and plasmids. MATERIALS AND METHODS Bacterial strains, bacteriophages, media, and growth conditions. S. thermophilus strains and their derivatives were grown in M17 broth (Difco) supplemented with 0.5% (wt/vol) lactose and 0.5% (wt/vol) saccharose, followed by incubation at 42°C. For bacteriophage exposure and phage-resistant mutant selection, experiments were carried out according to protocols previously outlined (2, 7). DNA sequencing of S. thermophilus CRISPR loci. S. thermophilus genomic DNA was extracted and purified by using a DNeasy Blood & Tissue kit (Qiagen). Prior to sequencing, each CRISPR locus was amplified by PCR with LA-Taq DNA polymerase (Takara/Lonza). PCR amplification of the CRISPR1 locus was performed with the primers yc70 (5′-TGCTGAGACAACCTAGTCTCTC-3′) (4) and CR1-rev (5′-TAAACAGAGCCTCCCTATCC-3′). The CRISPR2 locus was amplified by using the primers CR2-fwd (5′-TTAGCCCCTACCATAGTGCTG-3′) and CR2-rev (5′-TTAGTCTAACACTTTCTGGAAGC-3′), whereas primers CR3-fwd (5′-CTGAGATTAATAGTGCGATTACG-3′) and CR3-rev (5′-GCTGGATATTCGTATAACATGTC-3′) were used for the amplification of CRISPR3. Each PCR product was then subjected to DNA sequencing from each end with the same primers on a CEQ8000 system (Beckman). For long PCR fragments, sequence walking was achieved using internal primers designed in spacer sequences. Computational analyses. Microbial genome sequences were obtained from NCBI (http://www.ncbi.nlm.nih.gov). Also, previously published S. thermophilus CRISPR sequences were retrieved from the National Center for Biotechnology Information, including entries from Bolotin et al. (4) and Barrangou et al. (2). In addition, the CRISPR database CRISPRdb (9) and the CRISPR identification application CRISPRFinder (10) were used to retrieve and find CRISPR repeats and spacer sequences, respectively. CRISPR spacers were visualized as color combinations, as previously described (2). For sequence similarity analyses, comparisons to public sequences were carried out using BLASTn (1) at the National Center for Biotechnology Information. Matches to sequences found within S. thermophilus CRISPR loci were ignored, notably for published CRISPR sequences (DQ072985 to DQ073008 and EF434458 to EF434504) and S. thermophilus chromosomal CRISPR loci (CP000023, CP000024, and CP000419). All matches with a bit score above 40.0 were retained, corresponding to 100% identity over at least 20 bp, and only the top hit annotation was considered for classification. Secondary structure predictions were obtained by using the Mfold 3.2 program (26). Multiple sequence alignments and phylogenetic analyses were carried out by using CLUSTAL X (15). Sequence consensus motifs were visualized by using WebLogo (6). RESULTS Identification of CRISPR3 in S. thermophilus. Two CRISPR loci, CRISPR1 and CRISPR2, have previously been described in S. thermophilus strains CNRZ1066 and LMG 18311 (3, 4). The genome sequence of strain LMD-9 was recently determined (20). We identified a new CRISPR locus, CRISPR3, located at base pair positions 1377794 to 1377229 on the negative strand of the LMD-9 genome, between open reading frames (ORFs) STER_1474 and STER_1473 (Fig. (Fig.1A).1A
Surprisingly, the CRISPR3 locus is not present in the genomes of strains CNRZ1066 and LMG 18311, although the overall genome content is highly conserved between the three strains (3, 20). Nevertheless, one imperfect CRISPR3 repeat can be identified between ORFs str1518/stu1518 (hypothetical gene) and str1519/stu1519 (serB gene) in the genomes of CNRZ1066 and LMG 18311 (Fig. (Fig.1B).1B Overall, we investigated the occurrence of CRISPR3 in 66 strains and found this locus in 53 of them (80%). Specifically, among the four different structures documented (Fig. (Fig.1),1 Activity of CRISPR loci. We propose that CRISPR activity be defined as the ability of a CRISPR locus to integrate novel repeat-spacer units in response to exposure to foreign genetic elements, such as bacteriophage, to provide resistance. CRISPR1 activity in S. thermophilus has been demonstrated previously (2). Here, we provide additional experimental data showing novel spacer acquisition after exposure to bacteriophage in four different strains, as shown on Fig. Fig.2.2
We also investigated the ability of CRISPR3 to integrate novel spacers after phage challenge in phage-resistant mutants. We isolated bacteriophage-insensitive mutants (BIMs), derived from either DGCC7710 or LMD-9, after challenge with phage Φ858, Φ3821, or Φ4241 (Fig. (Fig.2).2 Overall, a total of 41 distinct CRISPR BIMs were isolated, including 37 for CRISPR1, none for CRISPR2, and 4 for CRISPR3. Accordingly, it appears that CRISPR1 has the highest ability to integrate novel spacers in response to phage exposure, followed by CRISPR3, whereas we have no evidence that CRISPR2 is active. CRISPR repeats are locus specific. Three distinct CRISPR loci have now been identified in S. thermophilus and may be simultaneously present on the chromosome, as is the case in LMD-9 (20). Usually, each CRISPR locus is defined by the sequence of the repeat. We define the typical repeat as the most frequent repeat within a CRISPR locus. The typical repeat sequences of the three S. thermophilus CRISPR loci are different (Table 1), although they all are 36 bp long. Sequence comparison between the three typical repeats showed that CRISPR1 and CRISPR2 share 30.6% identity, while CRISPR1 and CRISPR3 are 52.8% identical, and CRISPR2 and CRISPR3 share 47.2% nucleotides.
Sequence comparison within each CRISPR locus showed that, although the repeat sequence is usually highly conserved throughout the locus, polymorphisms can be observed, notably for the terminal repeat (Table 1). Specifically, sequence degeneracy is observed at the 3′ end of the terminal repeat. This observation is particularly important for the correct annotation and orientation of CRISPR loci, since the last spacer/repeat unit, including the terminal repeat, which is usually degenerate, is often missed, or the repeats are frequently annotated on the opposite DNA strand. The repeat orientation appears to be consistent with the orientation of adjacent cas genes. In addition, variations within the repeat sequence can be observed throughout a CRISPR locus. The ratios of atypical repeats were 0.3% for CRISPR1, 25% for CRISPR2, and 0.2% for CRISPR3 (Table 1), indicating that repeat sequence degeneracy was much higher for CRISPR2, while CRISPR1 and CRISPR3 seemed to possess highly conserved repeat sequences. Interestingly, for CRISPR3, no polymorphism was observed in the terminal repeat. CRISPR repeats may form secondary structures. Previous studies have suggested that CRISPR repeats may form stable hairpin-like secondary structures due to their partially palindromic nature (13, 16). Moreover, CRISPR loci appear to be transcribed as a single-stranded RNA molecule starting from the CRISPR leader (18, 22, 25). Accordingly, as transcription is progressing, two consecutive single-stranded CRISPR repeats may conceivably be able to interact and form secondary structures by pairing head to foot, giving rise to an even more stable structured RNA. We determined the putative secondary structures of the three S. thermophilus CRISPR repeats by using the Mfold program (26), as shown in Fig. Fig.3A.3A
Interestingly, assuming transcription of the whole CRISPR locus as a single-stranded RNA molecule, and if such a pairing between consecutive repeats would occur, every-other spacer would be constrained as a loop, whereas the remaining spacers would remain unconstrained, alternatively (Fig. (Fig.3B3B cas genes are locus specific. The three S. thermophilus CRISPR loci are associated with cas genes. For CRISPR1 and CRISPR3, the architecture is seemingly conserved, with four cas genes located upstream of the repeat-spacer region (Fig. (Fig.4).4
Although gene organization is similar between CRISPR1 and CRISPR3, sequence similarity is low, even at the protein level, as shown on Fig. Fig.4.4 Coupling CRISPR repeats/cas genes. Since we observed a correlation between the relatedness of cas genes and CRISPR repeats within S. thermophilus, we investigated whether there was a relationship between CRISPR repeats and cas genes across bacterial species. Across a variety of species, the clustering of the typical CRISPR repeats was similar to that of the Cas proteins (Fig. (Fig.5),5
Comparative analysis of the trees obtained revealed similar clustering patterns (Fig. (Fig.5),5 Spacer diversity in S. thermophilus CRISPR loci. We investigated CRISPR spacer diversity across a variety of S. thermophilus strains (Fig. (Fig.2).2
The identification of arrays of common consecutive spacers allowed the grouping of different S. thermophilus strains into clusters (Fig. (Fig.2).2 Within a particular CRISPR locus, beside the diversity derived from the presence of additional spacers, we also observed probable deletions of select spacers. For spacer additions, it was previously reported that integration of novel spacers primarily occurs at the leader end of CRISPR1 (2, 23), which is consistent with the diversity observed at the leader end of the CRISPR1 locus in several clusters (Fig. (Fig.2),2 A total of 3,626 spacers (2,829 for CRISPR1, 173 for CRISPR2, and 624 for CRISPR3) were analyzed (Fig. (Fig.22 In addition, spacer polymorphisms were also observed with regard to spacer size. Specifically, analysis of the spacer size distribution indicated that variability was lower for CRISPR1 and CRISPR3 than for CRISPR2 (Fig. (Fig.6).6
Analysis of CRISPR spacer sequences. We analyzed the CRISPR spacer sequences and investigated similarity and identity to phage, plasmid, and bacterial sequences (see Table S2 in the supplemental material). A total of 952 unique spacer sequences were analyzed, including 782, 16, and 154 spacers for CRISPR1, CRISPR2, and CRISPR3, respectively. Overall, among the 500 spacers with matches above the selected cutoff (see Materials and Methods), 384 (77%) showed similarity to viral sequences, whereas 80 (16%) showed similarity to plasmid sequences, and 36 (7%) showed similarity to chromosomal sequences (Table 3). Among CRISPR spacers homologous to viral sequences, the large majority (97% [374 of 384]) showed similarity to S. thermophilus phage sequences, including 147 spacers (39%) showing 100% identity. For CRISPR spacers showing similarities with plasmid sequences, nearly all of them (96% [77 of 80]) showed similarity to S. thermophilus plasmid sequences, including 41 spacers (51%) showing 100% identity. In contrast, only four CRISPR spacers (11% [4 of 36]) showed identity to S. thermophilus chromosomal sequences. Spacers identical to known sequences are particularly insightful, since it was previously shown that 100% identity between spacer and proto-spacer sequences is required to provide immunity (2, 7). Nevertheless, other similarities provide insight into the probable origin of the spacers.
Several matches were found across particular S. thermophilus phage genomes (see Table S2 in the supplemental material), notably phages 2972, DT1, 7201, Sfi11, Sfi19, and Sfi21 (17). As previously reported, no particular phage genome module seemed to be targeted, and similarities were observed across the genome, on both strands, with a bias toward the leading strand (2, 7, 17). Also, we investigated the occurrence of the spacer sequences in bacterial sequences, outside of CRISPR loci. Interestingly, the spacers showed identity to sequences found in 27 different S. thermophilus plasmids, notably several matches in pSt08, pt38, pSt106, pSt1, and pND103. In some cases, spacers showed identity to genes involved in plasmid replication, notably repA and repS, as is the case for spacer 059_1_01 (see Table S2 in the supplemental material), which showed identity to sequences found in pND103, pSt1, pt38, pER36, pER35, pER16, and pJ34. Interestingly, most of these plasmids, which are found in a variety of S. thermophilus strains, belong to the same family, namely, the pC19/pUB1104 rolling-circle family (24). Further, three CRISPR1 spacers and one CRISPR3 spacer showed 100% identity to S. thermophilus chromosomal sequences, notably in dtpT (encoding a di- or tripeptide proton symporter), rexA (encoding an ATP-dependent exonuclease), and str_0775 (phage-associated DNA primase), as well as a match in an intergenic region between STER_0810 (hypothetical protein) and STER_0811 (transposase). Interestingly, a number of similarities were observed with sequences found in closely related genera, including Lactococcus, Lactobacillus, and Staphylococcus (see Table S2 in the supplemental material). For phage sequences, similarities were found in Staphylococcus aureus phage Twort and Lactobacillus plantarum phage LP65. For plasmid sequences, similarity was found in Streptococcus pneumoniae pSpnP1, and identity was actually observed in the Lactococcus lactis plasmid pCIS3, within hsdS, which encodes a type I restriction-modification system. For chromosomal sequences, homologies were identified with Lactococcus lactis subsp. cremoris MG1363 and SK11, Lactobacillus reuteri F275, and Streptococcus pyogenes MGAS10394. Interestingly, in L. lactis subsp. cremoris, homology was found in hsdR, a gene also encoding a type I restriction-modification system. DISCUSSION In addition to the two CRISPR loci previously described in S. thermophilus (3), we report here the identification of CRISPR3 in the LMD-9 genome (20). Interestingly, this particular CRISPR locus is not ubiquitous in S. thermophilus genomes. Between the three CRISPR loci present in S. thermophilus genomes, diversity is observed at many levels, including (i) the typical CRISPR repeat sequence; (ii) the cas gene content, organization, and sequence; (iii) locus architecture and content; and (iv) spacer content, arrangement, and sequence. Diversity was observed across the three CRISPR loci between 124 different S. thermophilus strains. Specifically, CRISPR1 was ubiquitous, whereas CRISPR2 was present in 59 of 65 strains, and CRISPR3 was present in 53 of 66 strains. A total of 49 strains (39.5%) carried all three loci. Comparative genome analysis of CRISPR content in streptococci and various bacterial genera and species indicates that the three S. thermophilus CRISPR loci are distributed differently. Notably, CRISPR1 is present in only a few streptococci, whereas CRISPR3 can be found in most Streptococcus species. The distribution of these three CRISPR loci suggests that CRISPR1 may have recently become more specific to a few streptococcal species, whereas CRISPR3 is more widespread across streptococci, and CRISPR2 may be a vestige of a gram-positive ancestor. This is consistent with the absence of CRISPR2 and/or CRISPR3 in various S. thermophilus strains. In fact, detailed sequence analysis of distinct CRISPR3 locus architectures in various S. thermophilus strains suggests that deletions may have occurred via homologous recombination events involving CRISPR3 repeats, likely including the degenerate repeat in the vicinity of serB (Fig. (Fig.11 When equivalent CRISPR loci between strains are compared, a high degree of polymorphism is observed for spacer content and sequences. Specifically, 105 of 124, 7 of 59, and 20 of 53 unique spacer arrangements were observed for CRISPR1, CRISPR2, and CRISPR3, respectively. This indicates that the overall CRISPR content was unique in most strains. Perhaps the polymorphisms observed in the spacer contents of the three CRISPR loci across different S. thermophilus strains are an indicator of the activity of the locus, whereby spacer hypervariability is directly correlated with historical phage exposure. Arguably, the degree of spacer polymorphism, in terms of both total number of unique spacers and total number of unique spacer arrangements, for a given CRISPR locus, could be directly correlated with its activity. Consequently, we propose that in S. thermophilus CRISPR1 is the most active locus, followed by CRISPR3. This is supported by several observations: (i) repeat degeneracy seems to correlate with relative activity, whereby the most degenerate repeats are found in the least active locus, namely, CRISPR2; (ii) spacer size is more highly conserved in the most active loci, namely, CRISPR1 and CRISPR3, and least conserved in the least active locus, namely, CRISPR2; (iii) the average and maximum numbers of spacers are highest for CRISPR1 and lowest for CRISPR2; and (iv) the number of CRISPR BIMs obtained is higher for CRISPR1 than CRISPR3. Previous data have suggested that the enzymatic machinery of a specific locus cannot be effective in conjunction with the CRISPR genetic content of another (2). Specifically, when cas genes are inactivated in a particular CRISPR locus, the ability of this locus to provide resistance and integrate novel spacers is lost, despite the concurrent presence of other CRISPR loci and cas genes elsewhere in the chromosome (2). Here, we provide data indicating that each CAS system may be directly linked to a particular CRISPR repeat sequence, which is consistent with the observed comparable clustering of CRISPR repeats and Cas sequences (Fig. (Fig.5),5
We have shown that two distinct CRISPR loci, namely, CRISPR1 and CRISPR3 have the ability to evolve directly in response to phages by the polarized addition of new spacers derived from viral genomic sequences. Accordingly, CRISPR spacers provide a historical perspective of phage exposure, whereby spacers present in the vicinity of the leader were relatively recently added, whereas distal spacers likely originated from previous events. In addition to CRISPR variability due to the acquisition of novel spacers in response to phages, primarily at the leader end, we noticed that modifications can occur throughout the CRISPR locus, as seen in DGCC7710Φ2972+S15 (7), where a deletion occurred concomitantly with the insertion of a new spacer at the leader end (Fig. (Fig.2).2 Similarities between CRISPR spacers and phage or plasmid sequences have been documented previously (2, 4, 21, 23). Although the majority of CRISPR spacers shows homology to phage (77%) and plasmid (16%) sequences, we identified four CRISPR spacers that are 100% identical to S. thermophilus chromosomal gene sequences, including dtpT and rexA. This might indicate that the CRISPR/Cas system, in addition to providing resistance against foreign genetic elements such as plasmids and phages, may also serve as a microbial regulatory system involved in the control of mRNA transcripts levels for genes encoded on the chromosome, perhaps using a system based on RNAi, as previously suggested (19). Overall, the dynamic nature of CRISPR loci is potentially valuable for typing and comparative analyses of strains and microbial populations. Given that some loci are relatively active while others bear lower levels of polymorphism, the potential of a given CRISPR locus for typing and epidemiological studies has to be assessed on a case-by-case basis. Since CRISPRs are widely distributed in Bacteria and Archaea and actively involved in an adaptive immune system against foreign genetic elements, as well as intrinsic chromosomal elements, they provide critical insights into the relationships between prokaryotes and their environments, notably the coevolution of host and viral genomes. [Supplemental material]
Acknowledgments This study was supported by funding from Danisco A/S. S.M. acknowledges support from the Natural Sciences and Engineering Research Council of Canada through its Discovery Program. We thank Cécile Vos of Danisco Innovation for technical assistance. Footnotes Published ahead of print on 7 December 2007.†Supplemental material for this article may be found at http://jb.asm.org/. REFERENCES 1. Altschul, S. F., T. L. Madden, A. A. Schäffer, J. Zhang, Z. Zhang, W. Miller, and D. J. Lipman. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 253389-3402. [PubMed] 2. Barrangou, R., C. Fremaux, P. Boyaval, M. Richards, H. Deveau, S. Moineau, D. A. Romero, and P. Horvath. 2007. CRISPR provides acquired resistance against viruses in prokaryotes. Science 3151709-1712. [PubMed] 3. Bolotin, A., B. Quinquis, P. Renault, A. Sorokin, S. D. Ehrlich, S. Kulakauskas, A. Lapidus, E. Goltsman, M. Mazur, G. D. Pusch, M. Fonstein, R. Overbeek, N. Kyrpides, B. Purnelle, D. Prozzi, K. Ngui, D. Masuy, F. Hancy, S. Burteau, M. Boutry, J. Delcour, A. Goffeau, and P. Hols. 2004. Complete sequence and comparative genome analysis of the dairy bacterium Streptococcus thermophilus. Nat. Biotechnol. 221554-1558. [PubMed] 4. Bolotin, A., B. Quinquis, A. Sorokin, and S. D. Ehrlich. 2005. Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin. Microbiology 1512551-2561. [PubMed] 5. Bult, C. J., O. White, G. J. Olsen, L. Zhou, R. D. Fleischmann, G. G. Sutton, J. A. Blake, L. M. FitzGerald, R. A. Clayton, J. D. Gocayne, A. R. Kerlavage, B. A. Dougherty, J. F. Tomb, M. D. Adams, C. I. Reich, R. Overbeek, E. F. Kirkness, K. G. Weinstock, J. M. Merrick, A. Glodek, J. L. Scott, N. S. M. Geoghagen, J. F. Weidman, J. L. Fuhrmann, D. Nguyen, T. R. Utterback, J. M. Kelley, J. D. Peterson, P. W. Sadow, M. C. Hanna, M. D. Cotton, K. M. Roberts, M. A. Hurst, B. P. Kaine, M. Borodovsky, H. P. Klenk, C. M. Fraser, H. O. Smith, C. R. Woese, and J. C. Venter. 1996. Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii. Science 2731058-1073. [PubMed] 6. Crooks, G. E., G. Hon, J. M. Chandonia, and S. E. Brenner. 2004. WebLogo: a sequence logo generator. Genome Res. 141188-1190. [PubMed] 7. Deveau, H., R. Barrangou, J. E. Garneau, J. Labonté, C. Fremaux, P. Boyaval, D. A. Romero, P. Horvath, and S. Moineau. 2008. Phage response to CRISPR-encoded resistance in Streptococcus thermophilus. J. Bacteriol. 1390-1400. 8. Godde, J. S., and A. Bickerton. 2006. The repetitive DNA elements called CRISPRs and their associated genes: evidence of horizontal transfer among prokaryotes. J. Mol. Evol. 62718-729. [PubMed] 9. Grissa, I., G. Vergnaud, and C. Pourcel. 2007. The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats. BMC Bioinform. 8172-182. 10. Grissa, I., G. Vergnaud, and C. Pourcel. 2007. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 351-6. [PubMed] 11. Haft, D. H., J. Selengut, E. F. Mongodin, and K. E. Nelson. 2005. A guild of 45 CRISPR-associated (Cas) protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomes. PLoS Comput. Biol. 1474-483. 12. Hols, P., F. Hancy, L. Fontaine, B. Grossiord, D. Prozzi, N. Leblond-Bourget, B. Decaris, A. Bolotin, C. Delorme, S. D. Ehrlich, E. Guédon, V. Monnet, P. Renault, and M. Kleerebezem. 2005. New insights in the molecular biology and physiology of Streptococcus thermophilus revealed by comparative genomics. FEMS Microbiol. Rev. 29435-463. [PubMed] 13. Jansen, R., J. D. A. Van Embden, W. Gaastra, and L. M. Schouls. 2002. Identification of a novel family of sequence repeats among prokaryotes. OMICS 623-33. [PubMed] 14. Jansen, R., J. D. A. Van Embden, W. Gaastra, and L. M. Schouls. 2002. Identification of genes that are associated with DNA repeats in prokaryotes. Mol. Microbiol. 431565-1575. [PubMed] 15. Jeanmougin, F., J. D. Thompson, T. J. Gibson, M. Gouy, and D. G. Higgins. 1998. Multiple sequence alignment with CLUSTAL X. Trends Biochem. Sci. 23403-405. [PubMed] 16. Kunin, V., R. Sorek, and P. Hugenholtz. 2007. Evolutionary conservation of sequence and secondary structures in CRISPR repeats. Genome Biol. 8R61. [PubMed] 17. Lévesque, C., M. Duplessis, J. Labonté, S. Labrie, C. Fremaux, D. Tremblay, and S. Moineau. 2005. Genomic organization and molecular analysis of the virulent bacteriophage 2972 infecting an exopolysaccharide-producing Streptococcus thermophilus strain. Appl. Environ. Microbiol. 714057-4068. [PubMed] 18. Lillestøl, R., P. Redder, R. Garrett, and K. Brügger. 2006. A putative viral defense mechanism in archaeal cells. Archaea 259-72. [PubMed] 19. Makarova, K. S., N. V. Grishin, S. A. Shabalina, Y. I. Wolf, and E. V. Koonin. 2006. A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action. Biol. Direct. 11-26. [PubMed] 20. Makarova, K., A. Slesarev, Y. Wolf, A. Sorokin, B. Mirkin, E. Koonin, A. Pavlov, N. Pavlova, V. Karamychev, N. Polouchine, V. Shakhova, I. Grigoriev, Y. Lou, D. Rohksar, S. Lucas, K. Huang, D. Goodstein, T. Hawkins, V. Plengvidhya, D. Welker, J. Hughes, Y. Goh, A. Benson, K. Baldwin, J. Lee, I. Diaz-Muniz, B. Dosti, V. Smeianov, W. Wechter, R. Barabote, G. Lorca, E. Altermann, R. Barrangou, B. Ganesan, Y. Xie, H. Rawsthorne, D. Tamir, C. Parker, F. Breidt, J. Broadbent, R. Hutkins, D. O'Sullivan, J. Steele, G. Unlu, M. Saier, T. Klaenhammer, P. Richardson, S. Kozyavkin, B. Weimer, and D. Mills. 2006. Comparative genomics of the lactic acid bacteria. Proc. Natl. Acad. Sci. USA 10315611-15616. [PubMed] 21. Mojica, F. J., C. Diez-Villasenor, J. Garcia-Martinez, and E. Soria. 2005. Intervening sequences of regularly spaced prokaryotic repeats derive from foreign genetic elements. J. Mol. Evol. 60174-182. [PubMed] 22. Peng, X., K. Brügger, B. Shen, L. Chen, Q. She, and R. A. Garrett. 2003. Genus-specific protein binding to the large clusters of DNA repeats (short regularly spaced repeats) present in Sulfolobus genomes. J. Bacteriol. 1852410-2417. [PubMed] 23. Pourcel, C., G. Salvignol, and G. Vergnaud. 2005. CRISPR elements in Yersinia pestis acquire new repeats by preferential uptake of bacteriophage DNA, and provide additional tools for evolutionary studies. Microbiology 151653-663. [PubMed] 24. Turgeon, N., M. Frenette, and S. Moineau. 2004. Characterization of a theta replicating plasmid from Streptococcus thermophilus. Plasmid 5124-36. [PubMed] 25. Viswanathan, P., K. Murphy, B. Julien, A. G. Garza, and L. Kroos. 2007. Regulation of dev, an operon that includes genes essential for Myxococcus xanthus development and CRISPR-associated genes and repeats. J. Bacteriol. 1893738-3750. [PubMed] 26. Zuker, M. 2003. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 313406-3415. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||
Nat Biotechnol. 2004 Dec; 22(12):1554-8.
[Nat Biotechnol. 2004]FEMS Microbiol Rev. 2005 Aug; 29(3):435-63.
[FEMS Microbiol Rev. 2005]Proc Natl Acad Sci U S A. 2006 Oct 17; 103(42):15611-6.
[Proc Natl Acad Sci U S A. 2006]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Microbiology. 2005 Aug; 151(Pt 8):2551-61.
[Microbiology. 2005]J Mol Evol. 2006 Jun; 62(6):718-29.
[J Mol Evol. 2006]OMICS. 2002; 6(1):23-33.
[OMICS. 2002]Archaea. 2006 Aug; 2(1):59-72.
[Archaea. 2006]Science. 1996 Aug 23; 273(5278):1058-73.
[Science. 1996]Mol Microbiol. 2002 Mar; 43(6):1565-75.
[Mol Microbiol. 2002]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Microbiology. 2005 Aug; 151(Pt 8):2551-61.
[Microbiology. 2005]Microbiology. 2005 Aug; 151(Pt 8):2551-61.
[Microbiology. 2005]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Nucleic Acids Res. 2007; 35(1):1-10.
[Nucleic Acids Res. 2007]Nucleic Acids Res. 1997 Sep 1; 25(17):3389-402.
[Nucleic Acids Res. 1997]Nucleic Acids Res. 2003 Jul 1; 31(13):3406-15.
[Nucleic Acids Res. 2003]Nat Biotechnol. 2004 Dec; 22(12):1554-8.
[Nat Biotechnol. 2004]Microbiology. 2005 Aug; 151(Pt 8):2551-61.
[Microbiology. 2005]Proc Natl Acad Sci U S A. 2006 Oct 17; 103(42):15611-6.
[Proc Natl Acad Sci U S A. 2006]OMICS. 2002; 6(1):23-33.
[OMICS. 2002]Mol Microbiol. 2002 Mar; 43(6):1565-75.
[Mol Microbiol. 2002]Nat Biotechnol. 2004 Dec; 22(12):1554-8.
[Nat Biotechnol. 2004]Proc Natl Acad Sci U S A. 2006 Oct 17; 103(42):15611-6.
[Proc Natl Acad Sci U S A. 2006]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Proc Natl Acad Sci U S A. 2006 Oct 17; 103(42):15611-6.
[Proc Natl Acad Sci U S A. 2006]OMICS. 2002; 6(1):23-33.
[OMICS. 2002]Genome Biol. 2007; 8(4):R61.
[Genome Biol. 2007]Archaea. 2006 Aug; 2(1):59-72.
[Archaea. 2006]J Bacteriol. 2003 Apr; 185(8):2410-7.
[J Bacteriol. 2003]J Bacteriol. 2007 May; 189(10):3738-50.
[J Bacteriol. 2007]Nucleic Acids Res. 2003 Jul 1; 31(13):3406-15.
[Nucleic Acids Res. 2003]Genome Biol. 2007; 8(4):R61.
[Genome Biol. 2007]Microbiology. 2005 Aug; 151(Pt 8):2551-61.
[Microbiology. 2005]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Microbiology. 2005 Mar; 151(Pt 3):653-63.
[Microbiology. 2005]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Microbiology. 2005 Aug; 151(Pt 8):2551-61.
[Microbiology. 2005]Proc Natl Acad Sci U S A. 2006 Oct 17; 103(42):15611-6.
[Proc Natl Acad Sci U S A. 2006]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Appl Environ Microbiol. 2005 Jul; 71(7):4057-68.
[Appl Environ Microbiol. 2005]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Plasmid. 2004 Jan; 51(1):24-36.
[Plasmid. 2004]Nat Biotechnol. 2004 Dec; 22(12):1554-8.
[Nat Biotechnol. 2004]Proc Natl Acad Sci U S A. 2006 Oct 17; 103(42):15611-6.
[Proc Natl Acad Sci U S A. 2006]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Genome Biol. 2007; 8(4):R61.
[Genome Biol. 2007]Biol Direct. 2006 Jan 31; 1():1.
[Biol Direct. 2006]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Microbiology. 2005 Aug; 151(Pt 8):2551-61.
[Microbiology. 2005]J Mol Evol. 2005 Feb; 60(2):174-82.
[J Mol Evol. 2005]Microbiology. 2005 Mar; 151(Pt 3):653-63.
[Microbiology. 2005]Biol Direct. 2006 Jan 31; 1():1.
[Biol Direct. 2006]Nucleic Acids Res. 2003 Jul 1; 31(13):3406-15.
[Nucleic Acids Res. 2003]Trends Biochem Sci. 1998 Oct; 23(10):403-5.
[Trends Biochem Sci. 1998]Nucleic Acids Res. 2007; 35(1):1-10.
[Nucleic Acids Res. 2007]Genome Res. 2004 Jun; 14(6):1188-90.
[Genome Res. 2004]