![]() | ![]() |
Formats:
|
||||||||||||||||||||
Copyright : © 2005 Sperisen et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Stealth Proteins: In Silico Identification of a Novel Protein Family Rendering Bacterial Pathogens Invisible to Host Immune Defense 1 Swiss Institute of Bioinformatics, Epalinges, Switzerland 2 Swiss Institute for Experimental Cancer Research, Epalinges, Switzerland Peer Bork, Editor EMBL Heidelberg, Germany #Contributed equally. * To whom correspondence should be addressed. E-mail: philipp.bucher/at/isrec.ch ¤ Current address: Helvea, Geneva, Switzerland Received July 11, 2005; Accepted October 20, 2005. Abstract There are a variety of bacterial defense strategies to survive in a hostile environment. Generation of extracellular polysaccharides has proved to be a simple but effective strategy against the host's innate immune system. A comparative genomics approach led us to identify a new protein family termed Stealth, most likely involved in the synthesis of extracellular polysaccharides. This protein family is characterized by a series of domains conserved across phylogeny from bacteria to eukaryotes. In bacteria, Stealth (previously characterized as SacB, XcbA, or WefC) is encoded by subsets of strains mainly colonizing multicellular organisms, with evidence for a protective effect against the host innate immune defense. More specifically, integrating all the available information about Stealth proteins in bacteria, we propose that Stealth is a D-hexose-1-phosphoryl transferase involved in the synthesis of polysaccharides. In the animal kingdom, Stealth is strongly conserved across evolution from social amoebas to simple and complex multicellular organisms, such as Dictyostelium discoideum, hydra, and human. Based on the occurrence of Stealth in most Eukaryotes and a subset of Prokaryotes together with its potential role in extracellular polysaccharide synthesis, we propose that metazoan Stealth functions to regulate the innate immune system. Moreover, there is good reason to speculate that the acquisition and spread of Stealth could be responsible for future epidemic outbreaks of infectious diseases caused by a large variety of eubacterial pathogens. Our in silico identification of a homologous protein in the human host will help to elucidate the causes of Stealth-dependent virulence. At a more basic level, the characterization of the molecular and cellular function of Stealth proteins may shed light on fundamental mechanisms of innate immune defense against microbial invasion. Synopsis The immune system is a complex and highly developed system of specialized cells and organs that protects an organism against bacterial, parasitic, fungal, and viral infections. Broadly speaking, the different types of immune responses subdivide the immune system into two categories: innate (or nonadaptive) and adaptive immune system. The innate immune system serves as a first line of defense but lacks the ability to recognize certain pathogens and to provide the specific protective immunity that prevents reinfection. Just as metazoans have developed many different defenses against pathogens, so have pathogens evolved elaborate strategies to evade these defenses. Based on a comparative genomics approach and data mining, the authors have discovered a new family of proteins with a striking phylogenetic distribution, occurring in most eukaryotes and in subsets of mostly pathogenic or commensal prokaryotes. While the precise functions of these proteins remain unknown, prokaryotic versions have been implicated in the synthesis of extracellular polysaccharides known to be potent regulators of the innate immune system. This previously unrecognized link hints towards a potentially novel regulatory mechanism of the innate immune system. It remains to be shown if drugs selectively inhibiting Stealth in pathogens will help fight Stealth-mediated infections. Introduction Colonization of hosts by microorganisms is a complex process that determines if the microorganism will coexist with the host as commensal, become an invasive pathogen, or be efficiently eliminated by the host's immune defense [1,2]. Consequently, microorganisms have developed a variety of measures to cope with the increasingly sophisticated defense strategies of the host's immune system [3–7]. Amongst them, the generation of an extracellular coat made of polysaccharides has proved to be a simple but effective strategy. Bacterial surface polysaccharides can be either amorphous exopolysaccharides, anchored in the lipid layer (lipopolysaccharides, another known regulator of the immune system), or organized as a capsule (capsule polysaccharides [CPSs]). The latter have been shown to mediate adherence to cells and, more importantly, protection against the host's innate immune system [8–11]. Different strategies to escape host immune surveillance have evolved through vertical evolution but also through horizontal gene transfer [12–15]. Though a subject of long-standing controversy, there is increasing evidence suggesting that horizontal gene transfer also occurs from eukaryotes to prokaryotes [16]. Even though the recombined bacteria seemed to have preferentially retained individual domains of proteins [16], a first example was recently reported in which certain bacterial strains kept an entire open reading frame [17]. Here we describe a novel protein family named “Stealth.” Based on a comparative genomics approach, we propose a biological function and an evolutionary scenario for this new protein family. Results/Discussion Identification of Stealth In a screen of the human genome for Notch-related proteins, a novel protein containing two copies of Lin-12/Notch repeats was identified. The protein also showed strong sequence similarity to a number of animal and bacterial proteins, including several virulence factors of human pathogens published under different names. This previously unknown protein family was named “Stealth” because experimentally characterized members of this family appear to render bacterial and protozoan invaders invisible to the host's immune surveillance system. Stealth proteins are characterized by four conserved regions (CRs) referred to as CR1 to CR4 (Figure 1
Taxonomic Distribution Stealth proteins are found encoded in the genomes of chordates, echinodermates, hydras, fungi, and flies but appear to be absent from nematodes and plants. Interestingly, a few organisms contain multiple Stealth genes (Table 1). Stealth proteins also occur in the protist genomes of Dictyostelium, Giardia, Leishmania, Entamoeba, and Phytophthora, and among the hitherto sequenced bacteria, they are found in the following phyla: alpha-, beta-, and gamma-proteobacteria (mostly pathogens), firmicutes (mostly the commensals), and actinobacteria (some animal pathogens) (Table 1; Figure S1). It is noteworthy that the large majority of completely sequenced bacterial genomes do not harbor Stealth. The species that do contain a member of this family are not necessarily closely related, and include Gram-positive as well as Gram-negative bacteria.
Stealth in Bacteria Several of the documented bacterial Stealth genes belong to capsule group II biosynthesis operons generating carbohydrate-phosphodiester-containing CPSs [19–24]. In the case of Stealth-expressing bacteria, these CPSs turned out to inhibit complement-mediated lysis, as shown for serogroup A and X of Neisseria meningitidis [23,24] and to correlate with serum and phagocyte survival abilities as shown for Aeromonas hydrophila [25]. The majority of Stealth-expressing bacteria that have been analyzed so far for the composition of their exopolysaccharides turned out to build phosphoglycans consisting of phosphodiester-linked hexose mono- or disaccharide building blocks [26–29]. On the other hand, certain bacteria living in a biofilm community contain CPSs consisting of phosphodiester-linked hexa- or heptasaccharide repeating units [30,31]. These carbohydrates, also called receptor polysaccharides, are synthesized by a series of different glycosyltransferases, with Stealth amongst them [22]. Strains encoding Stealth carry a hexose phosphodiester linker [31] in their receptor polysaccharides, whereas strains lacking Stealth build receptor polysaccharides with a pentose phosphodiester linker. Definite proof for an essential function of Stealth in CPS biosynthesis was shown in N. meningitidis serogroup A by selective deletion of the gene sacB (i.e., Stealth), giving rise to virtually unencapsulated mutants [23], and by deletion of part of the gene xcbA (i.e., Stealth), together with flanking open reading frames in a serogroup X strain, which resulted in complement-sensitive mutants [24]. Moreover, when the gene cps1A (i.e., Stealth) was deleted in Actinobacillus pleuropneumoniae, the resulting strains lost their pathogenicity in pigs [20]. Taken together, all of the above data suggest that Stealth is a D-hexose-1-phosphoryl transferase that generates interglycosidic phosphate diester linkages. Characteristics of Metazoan Stealth Unlike the bacterial Stealth proteins, the vertebrate members of this family are not properly represented in current protein databases. We have manually reconstructed the gene and protein sequences for a number of species with the aid of EST sequences and cross-genome comparisons (Table 1). The human gene consists of 21 exons (Figure 2 Metazoan Stealth proteins are characterized by additional domains. There is a predicted signal peptide and, near the C-terminus, a transmembrane helix. One or two Notch/Lin-12 repeats [32] are inserted between CR2 and CR3, and an EF-hand domain [33] appears between CR3 and CR4. So far, all reconstructed Stealth proteins contain these domains, and in some of the cases where only pieces of sequences are available one can identify these motifs. The strong conservation of the Stealth domain architecture suggests that this protein plays an essential role. No experimental knowledge is available about the function of metazoan Stealth proteins today (note, however, that Stealth-deficient mice have been generated by O. Z. and coworkers and will be made available upon request). In view of the high degree of sequence similarity to their bacterial homologs, it is reasonable to speculate that they have a similar molecular function and thus are also implicated in exopolysaccharide synthesis. Public expression profiles derived from SAGE experiments indicate a rather broad tissue distribution. The Stealth-dependent polysaccharides could be host-specific structural surface elements exploited by the immune system for self-recognition. In this case, the Stealth-dependent resistance of human pathogens to complement-mediated lysis and other host defense mechanisms would be a straightforward case of molecular mimicry. Alternatively, host-encoded Stealth proteins may play an active role in down-regulating the immune response. The presence of Stealth in both insects and urochordates further suggests that this protein interferes with processes related to innate rather than adaptive immunity [34,35]. Stealth and Protists Although higher eukaryotes haven't yet been investigated for the presence of phosphoglycan structures similar to the CPSs, such structures have been identified in D. discoideum and in Leishmania species. In D. discoideum such polysaccharides were found on lysosomal cysteine proteinases and spore coat proteins [36,37]. The lysosomal enzymes of D. discoideum have two types of carbohydrate modifications [38,39] found in two separate sets of lysosomal vesicles [40,41]. The major component of Leishmania lipophosphoglycan is a heteropolymer of 10–40 phosphodiester-linked disaccharide units, depending on species and developmental stage [42]. Lipophosphoglycan is predominantly expressed by promastigotes, is essential for intracellular survival in macrophages and for the virulence of Leishmania major and L. donovani, and disappears when the pathogen intracellularly differentiates into amastigotes within host phagolysosomes [43–47]. The genes encoding these hexose-phosphoryl transferases have been identified neither in D. discoideum nor in Leishmania. Given, however, Stealth's presumed enzymatic activity and its comparative biochemical characterization from three different Leishmania species using synthetic acceptor substrate analogs [48], the two Stealth proteins found in Leishmania and those found in D. discoideum are good candidates for this function. Evolution of Stealth The peculiar taxonomic distribution of Stealth (Figure 3
Materials and Methods Sequence analysis. Multiple amino acid sequence alignments of the four CRs were generated using T-Coffee [52]. The signal peptides were predicted with SignalP v2.0 using the combined NN/HMM-based method [53,54], the transmembrane predictions were made using TMHMM v2.0 [55,56], and the Lin-12/Notch repeats were identified using the profile PS50258 in PROSITE [57]. The EF-hand domains were detected using the Pfam HMM PF00036 [58]. Sequence database searches. Other members of the Stealth protein family were identified by searching with either the human or the Streptomyces coelicolor CR2 using BLAST [18] on either nucleic acid or protein databases. Calculation of sequence trees. For each CR a separate multiple amino acid sequence alignment was generated. These multiple alignments were concatenated, resulting in a multiple alignment that represents the four CRs. CRs that are absent in certain species are represented as gaps in the multiple alignment. Processed alignments were used to derive tree topologies using Bayesian inference of phylogeny as implemented by MrBayes v3.0 [62,63]. MrBayes was used with four heated chains over 200,000 generations, sampling every 20 trees. The likelihoods of these trees were examined to estimate the length of the burn-in phase, and all trees sampled 20,000 generations later than this point were used to create a consensus tree using the 50% majority rule. MrBayes was used with the mixed model of amino acid substitution, assuming the presence of invariant sites and using a gamma distribution approximated by four different rate categories to model rate variation between sites, estimating amino acid frequencies from the alignment. The consensus tree was displayed using DRAWGRAM of the PHYLIP package [64]. Figure S1: Taxonomic Distribution of Stealth in Bacteria (57 KB DOC) Click here for additional data file.(58K, doc) Acknowledgments Part of this work has been supported by grant SKL 1125–02–2001 from the Swiss Cancer League (to OZ). We thank Denis-Luc Ardiet for stimulating discussions and prompting us to kreisler. Abbreviations
Footnotes Competing interests. The authors have declared that no competing interests exist. Author contributions. PS, CDS, PB, and OZ conceived and designed the experiments. PS and CDS performed the experiments. PS, CDS, PB, and OZ analyzed the data and wrote the paper. Citation: Sperisen P, Schmid CD, Bucher P, Zilian O (2005) Stealth proteins: In silico identification of a novel protein family rendering bacterial pathogens invisible to host immune defense. PLoS Comput Biol 1(6): e63. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||
Nature. 2004 Jul 8; 430(6996):250-6.
[Nature. 2004]Nat Rev Microbiol. 2004 Oct; 2(10):833-41.
[Nat Rev Microbiol. 2004]C R Biol. 2004 Jun; 327(6):557-70.
[C R Biol. 2004]J Clin Invest. 2001 Jan; 107(1):27-30.
[J Clin Invest. 2001]Annu Rev Microbiol. 1987; 41():435-64.
[Annu Rev Microbiol. 1987]Microbiol Rev. 1994 Sep; 58(3):563-602.
[Microbiol Rev. 1994]APMIS Suppl. 1998; 84():37-42.
[APMIS Suppl. 1998]Annu Rev Microbiol. 2001; 55():709-42.
[Annu Rev Microbiol. 2001]Genome Biol. 2004; 5(6):R38.
[Genome Biol. 2004]J Mol Biol. 1990 Oct 5; 215(3):403-10.
[J Mol Biol. 1990]Proteins. 1999 Mar 1; 34(4):508-19.
[Proteins. 1999]Infect Immun. 2003 Dec; 71(12):7202-7.
[Infect Immun. 2003]Infect Immun. 2003 Dec; 71(12):6712-20.
[Infect Immun. 2003]J Bacteriol. 1998 Mar; 180(6):1533-9.
[J Bacteriol. 1998]Microbiology. 2003 Apr; 149(Pt 4):1051-60.
[Microbiology. 2003]J Biol Chem. 1971 Aug 10; 246(15):4703-12.
[J Biol Chem. 1971]Carbohydr Res. 1980 Mar; 79(2):308-12.
[Carbohydr Res. 1980]Glycobiology. 1994 Apr; 4(2):183-92.
[Glycobiology. 1994]Infect Immun. 1997 Dec; 65(12):5035-41.
[Infect Immun. 1997]J Bacteriol. 2003 Sep; 185(18):5419-30.
[J Bacteriol. 2003]J Bacteriol. 1998 Mar; 180(6):1533-9.
[J Bacteriol. 1998]Infect Immun. 2003 Dec; 71(12):6712-20.
[Infect Immun. 2003]Infect Immun. 2003 Jun; 71(6):3320-8.
[Infect Immun. 2003]Biochemistry. 2003 Jun 17; 42(23):7061-7.
[Biochemistry. 2003]Trends Biochem Sci. 1996 Jan; 21(1):14-7.
[Trends Biochem Sci. 1996]Curr Pharm Des. 2003; 9(2):119-31.
[Curr Pharm Des. 2003]Mol Immunol. 2004 Nov; 41(11):1077-87.
[Mol Immunol. 2004]Glycobiology. 1998 Aug; 8(8):799-811.
[Glycobiology. 1998]J Biol Chem. 2000 Apr 21; 275(16):12164-74.
[J Biol Chem. 2000]Methods Enzymol. 1984; 107():172-83.
[Methods Enzymol. 1984]J Biol Chem. 1996 May 3; 271(18):10897-903.
[J Biol Chem. 1996]J Cell Sci. 1997 Sep; 110 ( Pt 18)():2239-48.
[J Cell Sci. 1997]Mol Microbiol. 1991 Sep; 5(9):2255-60.
[Mol Microbiol. 1991]J Bacteriol. 1990 Mar; 172(3):1374-9.
[J Bacteriol. 1990]Infect Immun. 2003 Dec; 71(12):7202-7.
[Infect Immun. 2003]Microbiology. 2000 Nov; 146 ( Pt 11)():2793-802.
[Microbiology. 2000]Infect Immun. 2003 Dec; 71(12):6712-20.
[Infect Immun. 2003]J Mol Biol. 2000 Sep 8; 302(1):205-17.
[J Mol Biol. 2000]Protein Eng. 1997 Jan; 10(1):1-6.
[Protein Eng. 1997]Proc Int Conf Intell Syst Mol Biol. 1998; 6():122-30.
[Proc Int Conf Intell Syst Mol Biol. 1998]J Mol Biol. 2001 Jan 19; 305(3):567-80.
[J Mol Biol. 2001]Proc Int Conf Intell Syst Mol Biol. 1998; 6():175-82.
[Proc Int Conf Intell Syst Mol Biol. 1998]Genome Res. 2002 Jul; 12(7):1068-74.
[Genome Res. 2002]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D509-11.
[Nucleic Acids Res. 2004]J Mol Biol. 1990 Oct 5; 215(3):403-10.
[J Mol Biol. 1990]Bioinformatics. 2001 Aug; 17(8):754-5.
[Bioinformatics. 2001]Bioinformatics. 2003 Aug 12; 19(12):1572-4.
[Bioinformatics. 2003]