![]() | ![]() |
Formats:
|
||||||||||||||||||||||||
Copyright © 2008, The American Society for Biochemistry and
Molecular Biology, Inc. A Novel Family of Sequence-specific Endoribonucleases Associated with the
Clustered Regularly Interspaced Short Palindromic
Repeats* ![]() ‡Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario M5G 1L6, Canada, §Structural Proteomics in Toronto, Ontario Cancer Institute, Max Bell Research Centre 5R407, Toronto, Ontario M5G 2C4, Canada, the ¶Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22908, the Midwest Center for Structural Genomics, Biosciences Division, Argonne National Laboratory, Argonne, Illinois 60439, and the **National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 208941Supported by the Intramural Research Program of the National
Institutes of Health (National Library of
Medicine, National Center for Biotechnology Information). 2
To whom correspondence should be addressed. Tel.: 416-978-4013; Fax:
416-978-8528; E-mail:
a.iakounine/at/utoronto.ca.
Received April 28, 2008; Revised May 14, 2008. This article has been cited by other articles in PMC.Abstract Clustered regularly interspaced short palindromic repeats (CRISPRs)
together with the associated CAS proteins protect microbial cells from
invasion by foreign genetic elements using presently unknown molecular
mechanisms. All CRISPR systems contain proteins of the CAS2 family, suggesting
that these uncharacterized proteins play a central role in this process. Here
we show that the CAS2 proteins represent a novel family of endoribonucleases.
Six purified CAS2 proteins from diverse organisms cleaved single-stranded RNAs
preferentially within U-rich regions. A representative CAS2 enzyme, SSO1404
from Sulfolobus solfataricus, cleaved the phosphodiester linkage on
the 3′-side and generated 5′-phosphate- and
3′-hydroxyl-terminated oligonucleotides. The crystal structure of
SSO1404 was solved at 1.6Å resolution revealing the first ribonuclease
with a ferredoxin-like fold. Mutagenesis of SSO1404 identified six residues
(Tyr-9, Asp-10, Arg-17, Arg-19, Arg-31, and Phe-37) that are important for
enzymatic activity and suggested that Asp-10 might be the principal catalytic
residue. Thus, CAS2 proteins are sequence-specific endoribonucleases, and we
propose that their role in the CRISPR-mediated anti-phage defense might
involve degradation of phage or cellular mRNAs. Numerous prokaryotic genomes contain structures known as clustered
regularly interspaced short palindromic repeats
(CRISPRs),3 composed
of 25-50-bp direct repeats separated by intervening sequence spacers (or
inserts) of similar length
(1-3).
CRISPRs represent the most widely distributed family of repeats in prokaryotes
and are found in a great variety of bacteria and archaea, including many
pathogens (such as Mycobacterium tuberculosis, Streptococcus, Yersinia,
Neisseria, and Corynebacterium)
(4-8).
Most organisms possess multiple CRISPR clusters
(2-20)
that form tandem arrays containing from 4 to over 100 units (a repeat and a
spacer) (1,
9-11).
Sequence analysis of the CRISPR spacers in various organisms revealed that
some of the spacer sequences are homologous (up to 100% sequence identity) to
sequences in phage genomes, plasmids, or in the chromosome and have either
sense or antisense orientations
(5,
12-14).
Remarkably, phages and plasmids fail to infect the specific strains carrying
the cognate spacers, implying a relationship between CRISPRs and immunity
against specific foreign DNA. Recent experiments on Streptococcus
thermophilus have directly demonstrated that, after a phage challenge,
this bacterium adds new CRISPR spacers derived from the phage genome
(15,
16). Removal or addition of
the phage-specific spacers correlated with the cell resistance to the
respective phage implying that resistance specificity is determined by the
sequence identity between the phage genome and spacers
(15). On the chromosome, CRISPR loci are flanked by a large number of
cas (CRISPR-associated) genes encoding uncharacterized proteins. A
comprehensive bioinformatic analysis of the CAS system in sequenced genomes
resulted in a refined classification with 25 gene families and at least nine
types of the cas operon organization
(9,
17). Eight CAS protein
families have been predicted to possess nuclease activity; nine families have
been characterized as putative RNA-binding proteins (RAMP-domain proteins),
and two families have been predicted to possess helicase and DNA/RNA
polymerase activity (17). This
analysis, combined with the data on the homology of some CRISPR spacer
sequences to phage genes, led to the hypothesis that the CRISPRs and
cas-encoded proteins comprise a system of defense against invading
phages and plasmids and that this system might operate analogously to the
eukaryotic RNA interference systems
(17). The wide distribution of
the CRISPR-CAS system among archaea and bacteria, its apparent importance for
immunity of prokaryotes against infectious agents, and the predicted novel
mechanism have recently made this system a subject of intense interest
(3). Two cas genes (cas1 and cas2) are always located
near a CRISPR locus and are found only in species containing CRISPRs
suggesting that these proteins play a central role in the CRISPR system
(1,
17). The members of the CAS2
superfamily are small, uncharacterized proteins (80-120 residues), which
belong to COG1343 and COG3512 groups of the COG protein classification system
(18). CAS2 proteins contain
several conserved sequence motifs, in particular an N-terminal motif that
consists of a string of hydrophobic residues (a predicted β-strand), and
typically ends with an aspartate
(17). The CAS2 protein
sequences show some similarity to the sequences of the VapD family of
uncharacterized proteins that are functionally linked to the VapBC
toxin-antitoxin (TA) operon
(19). Based on the pattern of
conserved amino acid residues, in particular the presence of a conserved
aspartate after a predicted β-strand, and some functional clues on the TA
systems, it has been hypothesized that both CAS2 and VapD might possess RNase
activity (17). Here we report for the first time the results of biochemical and structural
characterization of a family of CRISPR-associated enzymes, the CAS2 family
proteins from five prokaryotes. We show that CAS2 proteins are
endoribonucleases that are specific to single-stranded (ss)RNAs and
preferentially cleave them within U-rich regions. The crystal structure of a
representative CAS2 protein, SSO1404 from Sulfolobus solfataricus,
was solved to a 1.6 Å resolution and revealed a ferredoxin-like fold
with the double split β-α-β motif, as well as the putative
active site. EXPERIMENTAL PROCEDURES Protein Overexpression, Purification, and Site-directed
Mutagenesis—The cloning of the genes encoding SSO1404 and other
CAS2 proteins (SSO8090, TM1796, AF1876, MTH1083, and NE0845) into the modified
pET15b was carried out as described previously
(20). The proteins were
expressed as a fusion with an N-terminal His6 tag in
Escherichia coli strain BL21 (DE3) and purified to more than 95%
homogeneity using metal-chelate affinity chromatography on nickel affinity
resin and gel filtration on a Superdex 200 26/60 column (Amersham Biosciences)
as described before (20,
21). Site-directed mutagenesis
of SSO1404 was performed as described previously
(21) using a protocol based on
the QuikChange site-directed mutagenesis kit (Stratagene). Preparation of RNA Substrates—The short RNA substrates
(Table 1) were purchased from
IDT. The oligonucleotides were 5′-end-labeled with
[γ-32P]ATP (6,000 Ci/mmol; Amersham Biosciences) and T4
polynucleotide kinase (PNK) (Fermentas) and then purified by denaturing PAGE
(15% polyacrylamide, 8 m urea gel). The labeled oligonucleotides
were eluted from the gel, precipitated with 2% LiClO4 in acetone,
washed with acetone, dried, and dissolved in diethyl pyrocarbonate-treated
Milli-Q water. The long RNA substrates were synthesized using the Ambion T7
RNA polymerase MAXIscript transcription kit. For the synthesis of the mouse
β-actin mRNA (304 nt) fragment, pTRI-actin-mouse DNA (MAXIscript kit,
Ambion) was used as a template. To produce long CRISPR RNA substrates, a DNA
template was prepared by cloning the 1,292-bp-long 5′-fragment of the
S. solfataricus CRISPR cluster-2 DNA into the pBluescript SK+ vector
(Stratagene). During the transcription reaction (MAXIscript kit, Ambion), this
DNA template (linearized by EcoRV) directed the synthesis of a 270-nt-long RNA
substrate containing the CRISPR cluster-2 upstream region, repeat-1, spacer-1,
repeat-2, and a 27-nt fragment of spacer-2. Long RNA substrates were uniformly
labeled during transcription using [α-32P]UTP (3,000
Ci/mmol). The labeled RNA substrates were purified by denaturing 8% PAA, 8
m urea gel, eluted, ethanol-precipitated, and stored at -70 °C.
Double-stranded RNA substrates were prepared by the incubation of equimolar
amounts (10 μm) of the 5′-32P-labeled sense and
unlabeled antisense RNA oligoribonucleotides (RNA5, RNA25, and RNA26) in 50
mm Tris-HCl (pH 7.0), 100 mm KCl at 90 °C for 1 min
and slow cooling at room temperature for 1.5 h. The formation of RNA duplexes
was verified by electrophoresis in 10% nondenaturing PAA gels.
Enzymatic Assays—The reaction mixture for RNase assays (10
μl) contained 0.1 μm [32P]RNA, 50 mm
Tris-HCl (pH 8.5), 100 mm KCl, 5 mm MgCl2, 1
mm dithiothreitol, and 0.01-0.1 μg of enzyme. The pH dependence
of SSO1404 was characterized using three buffers: MES-K (pH 5.5 to 6.5),
Tris-HCl (pH 7.0 to 9.0), and CAPS-K (pH 9.4 to 11.0). The reaction mixture
was incubated at 37 °C for the indicated period of time and quenched by
the addition of equal volume of formamide loading buffer (80% formamide,
0.025% bromphenol blue, 0.025% xylene cyanol, and 10 mm EDTA (pH
8.0)). The reaction products were resolved by electrophoresis in 15% PAA, 8
m urea gels using TBE (10 mm Tris borate (pH 8.3), and 2
mm EDTA) as a running buffer. As nucleotide size markers, an
imidazole ladder or a G-ladder produced by partial RNA cleavage by 2
m imidazole or RNase T1, respectively, was used
(22,
23). For the analysis of the
RNA product 5′-end, after RNase reaction, RNA products were precipitated
by 2% LiClO4, washed by acetone, dried, dissolved in Milli-Q water,
and phosphorylated with [γ-32P]ATP and T4 PNK using
conditions for forward or phosphate exchange reaction according to the
manufacturer's protocol (Fermentas). After the PNK reaction, RNA products were
analyzed using denaturing 15% PAA, 8 m urea gels as described above
for RNase assays. The reaction mixtures for DNase assays (40 μl) contained
50 mm HEPES-K buffer (pH 7.5), 100 mm KCl, 5
mm MgCl2, 1 mm dithiothreitol, 0.3 μg of
λ DNA (double-stranded DNA) or 0.75 μg of M13 DNA (ssDNA), and 1-4
μg of enzyme. After 1 h of incubation at 37 °C, the reactions were
quenched by the addition of 6× DNA loading dye (Fermentas) and analyzed
on EtBr-stained 1% agarose gels. Protein Crystallization and Structure Determination—SSO1404
crystals were grown using the hanging drop vapor diffusion method with the
drops containing a mixture of 2 μl of 10 mg/ml purified
selenomethionine-incorporated SSO1404 protein and 2 μl of reservoir buffer
(0.2 m NaI, 20% w/v PEG 3350, and 2% v/v isopropyl alcohol). For
diffraction studies, the crystals were stabilized with the crystallization
buffer supplemented with 20% ethylene glycol as a cryoprotectant and
flash-frozen in liquid nitrogen. A single crystal of
selenomethionine-incorporated SSO1404 was used to collect diffraction data at
beamline 19-BM of the Structural Biology Center of the Advanced Photon Source
(24) and was maintained at a
temperature of 100 K. A single-wavelength anomalous diffraction dataset was
collected at a wavelength of 0.9794 Å. Crystallographic data collection
and model refinement statistics are summarized in
Table 2. Reflection data were
collected, indexed, integrated, and scaled with HKL-3000
(25).
A two-site selenium substructure was determined; the structure was phased
by single-wavelength anomalous diffraction, and an initial model was built.
All structure solution and initial model building was performed by HKL-3000,
which is integrated with SHELXD, SHELXE, MLPHARE, DM, O, COOT, SOLVE, RESOLVE,
and ARP/wARP
(25-32).
The initial model was improved by iterative cycles of manual rebuilding in
COOT, followed by maximum likelihood refinement with REF-MAC5
(33). In later stages of
refinement, a multigroup TLS model generated by the TLSMD web server was used
to further improve the model. The final model was validated using Mol-probity
(34), SFCHECK
(35), and PROCHECK. The atomic
coordinates and structure factors for SSO1404 have been deposited in the
Protein Data Bank with the accession code 2i8e. RESULTS AND DISCUSSION Enzymatic Activity of CAS2 Proteins—To characterize the
biochemical activity of CAS2 proteins, we cloned and purified six members of
this family from different organisms as follows: SSO1404 and SSO8090 from
S. solfataricus, AF1876 from Archaeoglobus fulgidus, TM1796
from Thermotoga maritima, MTH1083 from Methanobacterium
thermoautotrophicum, and NE0845 from Nitrosomonas europaea.
Given the prediction that CAS2 proteins might possess nuclease activity
(17), the purified proteins
were tested for the presence of DNase or RNase activity against
single-stranded (ss) DNA (M13 DNA), double-stranded (ds) DNA (λ DNA),
or ssRNA as follows: RNA5, RNA6, and RNA14
(Table 1), which are identical
in sequence to the sense strands of the CRISPR repeats or spacers from S.
solfataricus, A. fulgidus, or T. maritima (36 or 39 nt). No
nuclease activity was found against either of the DNA substrates, but all
proteins degraded the ssRNAs (Fig.
1
Reaction Requirements and RNA Cleavage Products of
SSO1404—SSO1404 exhibited RNase activity over a broad pH range
(7.0-10.0) with maximum activity at pH 8.5-9.0
(Fig. 2A
RNases can be subdivided into two groups depending on the position of the
cleavage of the phosphodiester linkage
(36). Enzymes of the first
class cleave the bond on the 3′-side (producing a
5′-phosphorylated second product) and include numerous intracellular
endo- and exoribonucleases. The enzymes of the second class (e.g.
RNase A, RNase T1, and barnases) cleave the linkage on the 5′-side
releasing products containing a cyclic 2′,3′-phosphodiester bond.
The products of ssRNA cleavage by a representative CAS2 protein, SSO1404, were
characterized using the T4 PNK-catalyzed reactions of RNA phosphorylation (at
5′-hydroxyl termini) and the phosphate exchange between the
5′-phosphate groups of the oligonucleotide substrate and ATP
(37). After hydrolysis of the
RNA5 substrate by SSO1404, the reaction products were incubated with PNK and
[γ-32P]ATP and analyzed by denaturing PAGE and
autoradiography (Fig.
2C Endoribonuclease Activity of SSO1404 against Long CRISPR
Substrates—Previous analyses of small RNAs produced in S.
solfataricus and A. fulgidus revealed that the long CRISPR
transcripts are processed into fragments with the size of one repeat and one
spacer (62-75 nt) (39,
40). It has been proposed that
these short CRISPR RNAs are produced by an unknown cas RNase that
would cleave within the repeat sequence producing intact spacers flanked with
the repeat fragments (3). The
demonstration of ssRNase activity of SSO1404 suggested the possibility that
CAS2 could be the RNase that is responsible for processing CRISPR RNAs. Thus,
we tested SSO1404 for the ability to cleave the 270-nucleotide-long CRISPR
transcript of the S. solfataricus CRISPR cluster-2. The uniformly
labeled long CRISPR RNA substrate was prepared using T7 RNA polymerase and the
270-bp DNA fragment of the 5′-end of the S. solfataricus CRISPR
cluster-2 containing the upstream region, repeat-1, spacer-1, repeat-2, and a
27-nt fragment of spacer-2 as the template. If SSO1404 were a
CRISPR-processing endoribonuclease, it would be expected to cleave this
substrate within the repeat sequences and produce a minimal product containing
one spacer flanked by two repeat halves (63-66-nt-long), as well as a series
of products containing two or more repeat + spacer units. SSO1404 showed
detectable endoribonuclease activity with this long CRISPR RNA substrate, but
it produced products that were smaller than expected (23, 27, and 45 nt long),
as well as several longer fragments (65, 75, 95, 110, and 160 nt long)
(Fig. 3A
SSO1404 Cleaves ssRNAs Preferentially within U-rich
Regions—The substrate specificity of SSO1404 was further
characterized using an extended set of shorter synthetic ssRNA substrates, as
well as several dsRNAs. The RNA1 to RNA5 substrates were identical in sequence
to the sense strands of repeat-1 (24 nt), repeat-2 (25 nt), and three spacers
(1-3)
of the S. solfataricus CRISPR cluster-2
(Table 1). SSO1404 cleaved all
these substrates, but the highest activity was observed with RNA5, which was
preferentially cleaved between U9 and U10
(Fig. 3B Eight ssRNA substrates (RNA7 to RNA10 and RNA14 to RNA17) corresponded to
repeats and spacers from other CRISPR-containing organisms (S.
thermophilus, T. maritima, and Methanococcus jannaschii),
whereas RNA11, RNA12, and RNA13 were scrambled RNA substrates
(Table 1). SSO1404 demonstrated
endonucleolytic activity toward most of these substrates with the highest
activity toward RNA9, RNA10, and RNA13
(Fig. 3C Although several substrates shown in
Fig. 3
An ssRNA endoribonuclease activity with a preference for U-rich regions
also has been observed in the abortive phage infection determinant AbiB from
Lactococcus lactis and in mRNA interferases MazF-mt1 and MazF-mt6
from M. tuberculosis
(41,
42). The L. lactis
AbiB prevented growth of the sensitive phage bIL170 through the selective
degradation of phage mRNAs by endonucleolytic cleavage at U/U, A/U, and U/A
sites (41). The mRNA
interferases are toxin components of chromosomal TA modules that are abundant
in free-living prokaryotes and induce reversible cell cycle arrest or
programmed cell death in response to starvation or other stress conditions
(43-46).
These enzymes have different mRNA cleavage specificities, and their expression
in the cell causes the effective inhibition of protein synthesis leading to
temporal cell growth arrest. In this regard, it might be relevant that U-rich
and AU-rich regions have been identified upstream of the Shine-Dalgarno
sequence in prokaryotic and phage mRNAs and shown to enhance translation
(47,
48). The CAS2 proteins are
unrelated to mRNA interferases or AbiB but, as mentioned above, appear to be
homologous to VapD, a component of a distinct class of TA systems. Thus, CAS2
and VapD might represent a novel group of mRNA-specific endoribonucleases. Crystal Structure of SSO1404—To elucidate the molecular
basis for the ssRNase activity of CAS2, we determined the crystal structure of
SSO1404 at 1.6 Å resolution. The structure demonstrated that SSO1404 is
a homodimer (Fig. 5A
A search for structural homologs of SSO1404 using the Dali
(49) and secondary-structure
matching (50) programs
(Pro-Func data base (51)
identified two uncharacterized proteins, TT1823 from Thermus
thermophilus (Protein Data Bank code 1zpw, secondary-structure matching
Z-score = 7.7, r.m.s.d. = 1.3 Å) and PF1117 from Pyrococcus
furiosus (Protein Data Bank code 2i0x, secondary-structure matching
Z-score = 7.0, r.m.s.d. = 1.7 Å)
(Fig. 5, B-D Both Dali and VAST searches also recognized various proteins with the
ferredoxin-like fold as being structural relatives of each of the three CAS2
structures with similar, moderate scores and r.m.s.d. values (supplemental
Table 1), suggesting that CAS2 proteins comprise a new superfamily within the
ferredoxin-like fold. Indeed, this is how the available CAS2 structures have
been classified in the SCOP data base. The ferredoxin-like domains consist of one of the most populated protein
folds, with numerous structural and functional derivatives
(53). In particular, this fold
is present in numerous RNA-binding proteins, including the RNA-binding domain,
the anticodon-binding domain of PheRS, ribosomal proteins S6 and S10, and also
the prominent components of the CAS system, the RAMP superfamily proteins,
which possess a duplicated ferredoxin-like domain
(17,
54). However, to our
knowledge, no nucleases with the ferredoxin-like fold have been characterized
with the possible exception of the IS200 transposase of S.
solfataricus (SSO1474, Protein Data Bank code 2f5g). Like many ribosomal
proteins (55), SSO1404 exposes
several aromatic and hydrophobic residues to the solvent (Ile-32, Tyr-34,
Ile-75, Ile-84, Val-85, Ile-86, Phe-80, and Tyr-88), and some of these
residues are conserved in large subsets of the CAS2 family proteins
(supplemental Fig. 1), suggesting their involvement in the interaction with
the bases of RNA. The extended loop α2-β4 of SSO1404 also could be
a candidate for interaction with RNA. Mutational Analysis of SSO1404 and the Potential Catalytic
Site—The structure of the SSO1404 dimer revealed several major
cavities or grooves located at the interface of the two monomers, which might
represent potential catalytic sites (Fig.
6A
In the SSO1404 structure, five residues that are important for activity
(Tyr-9, Asp-10, Arg-17, Arg-31, and Phe-37) are located in the long cavity
formed by the α1 helix on one side, the β2 and β3 strands on
the other side, and the β1 strand at the bottom
(Fig. 6B The SSO1404 D65A protein had a 2-fold reduced activity
(Fig. 8 Conclusions—This work is the first step toward a
comprehensive biochemical and structural characterization of CAS proteins.
Recent bioinformatic analyses suggest that the CRISPR and CAS proteins might
use several dissimilar mechanisms to abrogate phage infection and that at
least one of these mechanisms could resemble eukaryotic RNA interference
(3,
17). This study shows that
SSO1404 and other CAS2 proteins represent a novel family of endoribonucleases,
which possess a ferredoxin-like fold and are specific to ssRNA substrates.
SSO1404 cleaves ssRNAs preferentially within U-rich regions and, in this
respect, resembles the phage abortive infection endoribonuclease AbiB from
L. lactis (41). A
similar mechanism of selective degradation of phage transcripts might be
proposed for SSO1404 and other CAS2 proteins as well. Under this hypothesis,
CAS2 would be the functional analog of the eukaryotic slicer nuclease, a
function that is performed by the PIWI domains that are unrelated to CAS2
(59). Considering the apparent
mechanism of CRISPR-associated anti-phage defense that involves integration of
sequences homologous to fragments of phage mRNAs into the CRISPR loci, the
CAS2 endoribonuclease activity might contribute to this process as well.
Another possible role of CAS2 proteins in anti-phage defense might be
associated with the global inhibition of translation by mRNA cleavage, a
mechanism that has been proposed for RelBE and several other TA systems
(MazEF, PemIK, and ChpBIK) that contain RNase components (mRNA interferases)
(45,
60,
61). These TA systems play
important roles in stress response to nutritional limitations or DNA damage,
and their expression results in growth arrest or programmed cell death
(43-45).
Moreover, it has been shown that the E. coli MazEF TA module prevents
multiplication of the phage P1 by promoting cell death
(62). A similar mechanism of
anti-phage response appears to be a possibility for CAS2, especially in light
of the previously described relationship with VapD, an uncharacterized protein
that is functionally linked to the VapBC toxin-antitoxin system
(17). S. solfataricus
and many other CRISPR-containing organisms have at least two CAS2 proteins
that might have different sequence specificities and could target distinct
sets of mRNAs. [Supplemental Data]
Acknowledgments We thank all members of the Structural Proteomics in Toronto (SPiT) Centre,
the Structural Biology Center at the Advanced Photon Source and Midwest Center
for Structural Genomics, and Andrzej Joachimiak for help in conducting
experiments and discussions. Crystallography results were derived from work
performed at Argonne National Laboratory, Structural Biology Center, at the
Advanced Photon Source. Argonne is operated by University of Chicago Argonne,
LLC, for the United States Department of Energy, Office of Biological and
Environmental Research, under Contract DE-AC02-06CH11357. Notes The atomic coordinates and structure factors (code 2i8e) have been
deposited in the Protein Data Bank, Research Collaboratory for Structural
Bioinformatics, Rutgers University, New Brunswick, NJ
(http://www.rcsb.org/). *This work was supported, in whole or in part, by National
Institutes of Health Grants
GM62414 and GM074942 from the
Protein Structure Initiative (Midwest Center
for Structural Genomics). This work was also supported by
Genome Canada (through the Ontario Genomics
Institute), the Ontario Research and Development
Fund, and the Canadian Foundation for
Innovation. The costs of publication of this article were
defrayed in part by the payment of page charges. This article must therefore
be hereby marked “advertisement” in accordance with 18
U.S.C. Section 1734 solely to indicate this fact. The on-line version of this article (available at
http://www.jbc.org)
contains supplemental Table 1, Fig. 1, and additional references. Footnotes 3The abbreviations used are: CRISPR, clustered regularly interspaced short
palindromic repeats; CAS, CRISPR-associated; PAA, polyacryalamide; PAGE,
polyacrylamide gel electrophoresis; PNK, polynucleotide kinase; ssRNA,
single-stranded RNA; TA, toxin-antitoxin; nt, nucleotide; r.m.s.d., root mean
square deviation; MES, 4-morpholineethanesulfonic acid; CAPS, 3
-(cyclohexylamino)propanesulfonic acid. References 1. Jansen, R., Embden, J. D., Gaastra, W., and Schouls, L. M.
(2002. ) Mol. Microbiol.
43
1565-1575 [PubMed] 2. Mojica, F. J., Diez-Villasenor, C., Soria, E., and Juez, G.
(2000. ) Mol. Microbiol.
36 244-246 [PubMed] 3. Sorek, R., Kunin, V., and Hugenholtz, P. (2008. )
Nat. Rev. Microbiol. 6
181-186 [PubMed] 4. Groenen, P. M., Bunschoten, A. E., van Soolingen, D., and van
Embden, J. D. (1993. ) Mol. Microbiol.
10
1057-1065 [PubMed] 5. Mojica, F. J., Diez-Villasenor, C., Garcia-Martinez, J., and Soria,
E. (2005. ) J. Mol. Evol.
60 174-182 [PubMed] 6. Masepohl, B., Gorlitz, K., and Bohme, H. (1996. )
Biochim. Biophys. Acta
1307 26-30 [PubMed] 7. Hoe, N., Nakashima, K., Grigsby, D., Pan, X., Dou, S. J., Naidich,
S., Garcia, M., Kahn, E., Bergmire-Sweat, D., and Musser, J. M.
(1999. ) Emerg. Infect. Dis.
5 254-263 [PubMed] 8. Ishino, Y., Shinagawa, H., Makino, K., Amemura, M., and Nakata, A.
(1987. ) J. Bacteriol.
169
5429-5433 [PubMed] 9. Haft, D. H., Selengut, J., Mongodin, E. F., and Nelson, K. E.
(2005. ) Plos Comput. Biol.
1 e60. [PubMed] 10. Grissa, I., Vergnaud, G., and Pourcel, C. (2007. )
BMC Bioinformatics 8
172. [PubMed] 11. Lillestol, R. K., Redder, P., Garrett, R. A., and Brugger, K.
(2006. ) Archaea
2 59-72 [PubMed] 12. Pourcel, C., Salvignol, G., and Vergnaud, G. (2005. )
Microbiology 151
653-663 [PubMed] 13. Bolotin, A., Quinquis, B., Sorokin, A., and Ehrlich, S. D.
(2005. ) Microbiology
151
2551-2561 [PubMed] 14. Horvath, P., Romero, D. A., Coute-Monvoisin, A. C., Richards, M.,
Deveau, H., Moineau, S., Boyaval, P., Fremaux, C., and Barrangou, R.
(2008. ) J. Bacteriol.
190
1401-1412 [PubMed] 15. Barrangou, R., Fremaux, C., Deveau, H., Richards, M., Boyaval, P.,
Moineau, S., Romero, D. A., and Horvath, P. (2007. )
Science 315
1709-1712 [PubMed] 16. Deveau, H., Barrangou, R., Garneau, J. E., Labonte, J., Fremaux,
C., Boyaval, P., Romero, D. A., Horvath, P., and Moineau, S.
(2008. ) J. Bacteriol.
190
1390-1400 [PubMed] 17. Makarova, K. S., Grishin, N. V., Shabalina, S. A., Wolf, Y. I., and
Koonin, E. V. (2006. ) Biol. Direct
1 7. [PubMed] 18. Tatusov, R. L., Fedorova, N. D., Jackson, J. D., Jacobs, A. R.,
Kiryutin, B., Koonin, E. V., Krylov, D. M., Mazumder, R., Mekhedov, S. L.,
Nikolskaya, A. N., Rao, B. S., Smirnov, S., Sverdlov, A. V., Vasudevan, S.,
Wolf, Y. I., Yin, J. J., and Natale, D. A. (2003. ) BMC
Bioinformatics 4
41. [PubMed] 19. Katz, M. E., Wright, C. L., Gartside, T. S., Cheetham, B. F.,
Doidge, C. V., Moses, E. K., and Rood, J. I. (1994. ) J.
Bacteriol. 176
2663-2669 [PubMed] 20. Zhang, R. G., Skarina, T., Katz, J. E., Beasley, S., Khachatryan,
A., Vyas, S., Arrowsmith, C. H., Clarke, S., Edwards, A., Joachimiak, A., and
Savchenko, A. (2001. ) Structure (Lond.)
9
1095-1106. 21. Proudfoot, M., Sanders, S. A., Singer, A., Zhang, R., Brown, G.,
Binkowski, A., Xu, L., Lukin, J. A., Murzin, A. G., Joachimiak, A.,
Arrowsmith, C. H., Edwards, A. M., Savchenko, A. V., and Yakunin, A. F.
(2008. ) J. Mol. Biol.
375
301-315 [PubMed] 22. Mironova, N. L., Pyshnyi, D. V., Shtadler, D. V., Fedorova, A. A.,
Vlassov, V. V., and Zenkova, M. A. (2007. ) Nucleic
Acids Res. 35
2356-2367 [PubMed] 23. Nishikawa, S., Morioka, H., Kim, H. J., Fuchimura, K., Tanaka, T.,
Uesugi, S., Hakoshima, T., Tomita, K., Ohtsuka, E., and Ikehara, M.
(1987. ) Bio-chemistry
26
8620-8624. 24. Rosenbaum, G., Alkire, R. W., Evans, G., Rotella, F. J., Lazarski,
K., Zhang, R. G., Ginell, S. L., Duke, N., Naday, I., Lazarz, J., Molitsky, M.
J., Keefe, L., Gonczy, J., Rock, L., Sanishvili, R., Walsh, M. A., Westbrook,
E., and Joachimiak, A. (2006. ) J. Synchrotron
Radiat. 13
30-45 [PubMed] 25. Minor, W., Cymborowski, M., Otwinowski, Z., and Chruszcz, M.
(2006. ) Acta Crystallogr. Sect. D. Biol.
Crystallogr. 62
859-866 [PubMed] 26. Jones, T. A., Zou, J. Y., Cowan, S. W., and Kjeldgaard, M.
(1991. ) Acta Crystallogr. Sect. A
47 110-119 [PubMed] 27. Terwilliger, T. C., and Berendzen, J. (1999. )
Acta Crystallogr. Sect. D. Biol. Crystallogr.
55 849-861 [PubMed] 28. Terwilliger, T. C. (2002. ) Acta Crystallogr.
Sect. D. Biol. Crystallogr. 58
1937-1940 [PubMed] 29. Perrakis, A., Morris, R., and Lamzin, V. S. (1999. )
Nat. Struct. Biol. 6
458-463 [PubMed] 30. Emsley, P., and Cowtan, K. (2004. ) Acta
Crystallogr. Sect. D. Biol. Crystallogr.
60
2126-2132 [PubMed] 31. Morris, R. J., Perrakis, A., and Lamzin, V. S. (2003. )
Methods Enzymol. 374
229-244 [PubMed] 32. Schneider, T. R., and Sheldrick, G. M. (2002. )
Acta Crystallogr. Sect. D. Biol. Crystallogr.
58
1772-1779 [PubMed] 33. Murshudov, G. N., Vagin, A. A., and Dodson, E. J.
(1997. ) Acta Crystallogr. Sect. D. Biol.
Crystallogr. 53
240-255 [PubMed] 34. Lovell, S. C., Davis, I. W., Arendall, W. B., III, de Bakker, P.
I., Word, J. M., Prisant, M. G., Richardson, J. S., and Richardson, D. C.
(2003. ) Proteins
50 437-450 [PubMed] 35. Vaguine, A. A., Richelle, J., and Wodak, S. J. (1999. )
Acta Crystallogr. Sect. D Biol. Crystallogr.
55 191-205 [PubMed] 36. Saida, F., Uzan, M., and Bontems, F. (2003. )
Nucleic Acids Res. 31
2751-2758 [PubMed] 37. Novogrodsky, A., Tal, M., Traub, A., and Hurwitz, J.
(1966. ) J. Biol. Chem.
241
2933-2943 [PubMed] 38. Berkner, K. L., and Folk, W. R. (1980. )
Methods Enzymol. 65
28-36 [PubMed] 39. Tang, T. H., Bachellerie, J. P., Rozhdestvensky, T., Bortolin, M.
L., Huber, H., Drungowski, M., Elge, T., Brosius, J., and Huttenhofer, A.
(2002. ) Proc. Natl. Acad. Sci. U. S. A.
99
7536-7541 [PubMed] 40. Tang, T. H., Polacek, N., Zywicki, M., Huber, H., Brugger, K.,
Garrett, R., Bachellerie, J. P., and Huttenhofer, A. (2005. )
Mol. Microbiol. 55
469-481 [PubMed] 41. Parreira, R., Ehrlich, S. D., and Chopin, M. C. (1996. )
Mol. Microbiol. 19
221-230 [PubMed] 42. Zhu, L., Zhang, Y., Teh, J. S., Zhang, J., Connell, N., Rubin, H.,
and Inouye, M. (2006. ) J. Biol. Chem.
281
18638-18643 [PubMed] 43. Hayes, F. (2003. ) Science
301
1496-1499 [PubMed] 44. Gerdes, K. (2000. ) J.
Bacteriol. 182
561-572 [PubMed] 45. Gerdes, K., Christensen, S. K., and Lobner-Olesen, A.
(2005. ) Nat. Rev. Microbiol.
3 371-382 [PubMed] 46. Inouye, M. (2006. ) J. Cell.
Physiol. 209
670-676 [PubMed] 47. Zhang, J., and Deutscher, M. P. (1992. )
Proc. Natl. Acad. Sci. U. S. A.
89
2605-2609 [PubMed] 48. Olins, P. O., and Rangwala, S. H. (1989. ) J.
Biol. Chem. 264
16973-16976 [PubMed] 49. Holm, L., and Sander, C. (1994. )
Proteins 19
165-173 [PubMed] 50. Krissinel, E., and Henrick, K. (2004. ) Acta
Crystallogr. Sect. D Biol. Crystallogr.
60
2256-2268 [PubMed] 51. Laskowski, R. A., Watson, J. D., and Thornton, J. M.
(2005. ) Nucleic Acids Res.
33 W89-W93 [PubMed] 52. Ramakrishnan, V., and White, S. W. (1998. )
Trends Biochem. Sci. 23
208-212 [PubMed] 53. Orengo, C. A., Jones, D. T., and Thornton, J. M.
(1994. ) Nature
372
631-634 [PubMed] 54. Ebihara, A., Yao, M., Masui, R., Tanaka, I., Yokoyama, S., and
Kuramitsu, S. (2006. ) Protein Sci.
15
1494-1499 [PubMed] 55. Draper, D. E. (1999. ) J. Mol.
Biol. 293
255-270 [PubMed] 56. Worrall, J. A., and Luisi, B. F. (2007. )
Curr. Opin. Struct. Biol.
17 128-137 [PubMed] 57. Redko, Y., Tock, M. R., Adams, C. J., Kaberdin, V. R., Grasby, J.
A., and McDowall, K. J. (2003. ) J. Biol.
Chem. 278
44001-44008 [PubMed] 58. Callaghan, A. J., Marcaida, M. J., Stead, J. A., McDowall, K. J.,
Scott, W. G., and Luisi, B. F. (2005. )
Nature 437
1187-1191 [PubMed] 59. Hutvagner, G., and Simard, M. J. (2008. )
Nat. Rev. Mol. Cell Biol.
9 22-32 [PubMed] 60. Zhang, J., Zhang, Y., Zhu, L., Suzuki, M., and Inouye, M.
(2004. ) J. Biol. Chem.
279
20678-20684 [PubMed] 61. Zhang, Y., Zhu, L., Zhang, J., and Inouye, M. (2005. )
J. Biol. Chem. 280
26080-26088 [PubMed] 62. Hazan, R., and Engelberg-Kulka, H. (2004. )
Mol. Genet. Genomics
272
227-234 [PubMed] 63. Poirot, O., Suhre, K., Abergel, C., O'Toole, E., and Notredame, C.
(2004. ) Nucleic Acids Res.
32 W37-W40 [PubMed] 64. Beitz, E. (2000. ) Bioinformatics
(Oxf.) 16
135-139. |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||
Mol Microbiol. 2002 Mar; 43(6):1565-75.
[Mol Microbiol. 2002]Nat Rev Microbiol. 2008 Mar; 6(3):181-6.
[Nat Rev Microbiol. 2008]Mol Microbiol. 1993 Dec; 10(5):1057-65.
[Mol Microbiol. 1993]J Bacteriol. 1987 Dec; 169(12):5429-33.
[J Bacteriol. 1987]Mol Microbiol. 2000 Apr; 36(1):244-6.
[Mol Microbiol. 2000]PLoS Comput Biol. 2005 Nov; 1(6):e60.
[PLoS Comput Biol. 2005]Biol Direct. 2006 Mar 16; 1():7.
[Biol Direct. 2006]Nat Rev Microbiol. 2008 Mar; 6(3):181-6.
[Nat Rev Microbiol. 2008]Mol Microbiol. 2002 Mar; 43(6):1565-75.
[Mol Microbiol. 2002]Biol Direct. 2006 Mar 16; 1():7.
[Biol Direct. 2006]BMC Bioinformatics. 2003 Sep 11; 4():41.
[BMC Bioinformatics. 2003]J Bacteriol. 1994 May; 176(9):2663-9.
[J Bacteriol. 1994]J Mol Biol. 2008 Jan 4; 375(1):301-15.
[J Mol Biol. 2008]Nucleic Acids Res. 2007; 35(7):2356-67.
[Nucleic Acids Res. 2007]J Synchrotron Radiat. 2006 Jan; 13(Pt 1):30-45.
[J Synchrotron Radiat. 2006]Acta Crystallogr D Biol Crystallogr. 2006 Aug; 62(Pt 8):859-66.
[Acta Crystallogr D Biol Crystallogr. 2006]Acta Crystallogr D Biol Crystallogr. 2006 Aug; 62(Pt 8):859-66.
[Acta Crystallogr D Biol Crystallogr. 2006]Acta Crystallogr D Biol Crystallogr. 2002 Oct; 58(Pt 10 Pt 2):1772-9.
[Acta Crystallogr D Biol Crystallogr. 2002]Acta Crystallogr D Biol Crystallogr. 1997 May 1; 53(Pt 3):240-55.
[Acta Crystallogr D Biol Crystallogr. 1997]Proteins. 2003 Feb 15; 50(3):437-50.
[Proteins. 2003]Acta Crystallogr D Biol Crystallogr. 1999 Jan; 55(Pt 1):191-205.
[Acta Crystallogr D Biol Crystallogr. 1999]Biol Direct. 2006 Mar 16; 1():7.
[Biol Direct. 2006]Nucleic Acids Res. 2003 Jun 1; 31(11):2751-8.
[Nucleic Acids Res. 2003]J Biol Chem. 1966 Jun 25; 241(12):2933-43.
[J Biol Chem. 1966]Methods Enzymol. 1980; 65(1):28-36.
[Methods Enzymol. 1980]Proc Natl Acad Sci U S A. 2002 May 28; 99(11):7536-41.
[Proc Natl Acad Sci U S A. 2002]Mol Microbiol. 2005 Jan; 55(2):469-81.
[Mol Microbiol. 2005]Nat Rev Microbiol. 2008 Mar; 6(3):181-6.
[Nat Rev Microbiol. 2008]Mol Microbiol. 2002 Mar; 43(6):1565-75.
[Mol Microbiol. 2002]Nat Rev Microbiol. 2008 Mar; 6(3):181-6.
[Nat Rev Microbiol. 2008]Mol Microbiol. 1996 Jan; 19(2):221-30.
[Mol Microbiol. 1996]J Biol Chem. 2006 Jul 7; 281(27):18638-43.
[J Biol Chem. 2006]Science. 2003 Sep 12; 301(5639):1496-9.
[Science. 2003]J Cell Physiol. 2006 Dec; 209(3):670-6.
[J Cell Physiol. 2006]Proc Natl Acad Sci U S A. 1992 Apr 1; 89(7):2605-9.
[Proc Natl Acad Sci U S A. 1992]Proteins. 1994 Jul; 19(3):165-73.
[Proteins. 1994]Acta Crystallogr D Biol Crystallogr. 2004 Dec; 60(Pt 12 Pt 1):2256-68.
[Acta Crystallogr D Biol Crystallogr. 2004]Nucleic Acids Res. 2005 Jul 1; 33(Web Server issue):W89-93.
[Nucleic Acids Res. 2005]Biol Direct. 2006 Mar 16; 1():7.
[Biol Direct. 2006]Trends Biochem Sci. 1998 Jun; 23(6):208-12.
[Trends Biochem Sci. 1998]Nature. 1994 Dec 15; 372(6507):631-4.
[Nature. 1994]Biol Direct. 2006 Mar 16; 1():7.
[Biol Direct. 2006]Protein Sci. 2006 Jun; 15(6):1494-9.
[Protein Sci. 2006]J Mol Biol. 1999 Oct 22; 293(2):255-70.
[J Mol Biol. 1999]Biol Direct. 2006 Mar 16; 1():7.
[Biol Direct. 2006]Curr Opin Struct Biol. 2007 Feb; 17(1):128-37.
[Curr Opin Struct Biol. 2007]J Biol Chem. 2003 Nov 7; 278(45):44001-8.
[J Biol Chem. 2003]Nature. 2005 Oct 20; 437(7062):1187-91.
[Nature. 2005]Nat Rev Microbiol. 2008 Mar; 6(3):181-6.
[Nat Rev Microbiol. 2008]Biol Direct. 2006 Mar 16; 1():7.
[Biol Direct. 2006]Mol Microbiol. 1996 Jan; 19(2):221-30.
[Mol Microbiol. 1996]Nat Rev Mol Cell Biol. 2008 Jan; 9(1):22-32.
[Nat Rev Mol Cell Biol. 2008]Nat Rev Microbiol. 2005 May; 3(5):371-82.
[Nat Rev Microbiol. 2005]Nucleic Acids Res. 2004 Jul 1; 32(Web Server issue):W37-40.
[Nucleic Acids Res. 2004]Nucleic Acids Res. 2004 Jul 1; 32(Web Server issue):W37-40.
[Nucleic Acids Res. 2004]