Numbers and features of predicted amino-terminal signal peptides found in (putative) exported proteins of B. subtilis. To estimate the number of exported proteins, the first 60 residues of all annotated proteins of B. subtilis in the SubtiList database (http://bioweb.pasteur.fr/GenoList/SubtiList) were used to predict amino-terminal signal peptides with the SignalP algorithm for the prediction of signal peptides of gram-positive eubacteria (194). Next, to distinguish between potential secretory proteins and multispanning membrane proteins, putative membrane-spanning segments in protein sequences with a putative signal peptide were predicted with the TopPred2 algorithm (61, 264). All proteins containing additional hydrophobic domains (upper cutoff, 1.0; lower cutoff, 0.6; window size top, 11; window size bottom, 21) were regarded as membrane proteins, and their amino termini were excluded from the primary set of signal peptides. Finally, all putative signal peptides were screened for the presence of a lipobox, twin-arginine motif, or a cleavage site for the prepilin SPase. On the basis of SPase cleavage sites, predicted signal peptides were divided into four distinct classes: A, secretory (Sec-type) signal peptides and twin-arginine signal peptides; B, lipoprotein signal peptides; C, prepilin-like signal peptides; and D, bacteriocin and pheromone signal peptides. The number of predicted B. subtilis signal peptides of each class and the SPases responsible for their cleavage are indicated. Most signal peptides have a tripartite structure: a positively charged N-domain (N), containing lysine and/or arginine residues (indicated with +); a hydrophobic H-domain (H, indicated by a black box); and a C-domain (C) which specifies the cleavage site for SPase, as indicated by the scissors symbol. The average lengths of the complete signal peptide, N-domain, H-domain, and consensus SPase recognition sequences are indicated. Furthermore, helix-breaking residues, mostly glycine or proline (G/P) in the H-domain of certain signal peptides, are indicated. These residues are thought to facilitate loopwise membrane insertion and cleavage by SPase I, respectively (Fig. 1) (202). Finally, where appropriate, the most frequently occurring first amino acid (aa) of the mature protein (+1) is indicated.