The lipocalin protein family consists mainly of small extracellular proteins that bind hydrophobic ligands and fulfill numerous biological functions including ligand transport, cryptic coloration, sensory transduction, the biosynthesis of prostaglandins, and the regulation of cellular homeostasis and immunity. The lipocalins can be structurally characterized as an 8-stranded antiparallel β-barrel followed by a C-terminal α-helix, as exemplified by serum retinol binding protein.1 The lipocalin fold is widely distributed among vertebrates and a few have been isolated from invertebrates and plants. Until 1995, when the first bacterial lipocalins were identified,2 the family was thought to be restricted to eukaryotes. The recent crystal structure of a bacterial lipocalin suggests that phospholipid derivatives are likely ligands.3 The existence of bacterial lipocalins provides insight into the origins of the lipocalin family and, combined with the powerful tools of bacterial genetics, provides fertile ground for the investigation of lipocalin structure and function.
The bacterial lipocalin Blc was first identified as an outer membrane lipoprotein of E. coli in 1995.2 Approximately 100 lipoproteins with varied structures and functions are encoded within the E. coli genome.4 Export of bacterial lipoproteins across the inner membrane is directed by a sec-dependent type-II signal peptide, which converts the amino terminal cysteine into N-acyl-S-sn-1,2-diacylglycerylcysteine. The lipidated protein remains anchored to the periplasmic surface of the inner membrane until it encounters the LolCDE ATP-binding cassette transport system, which can actively disengage the lipoprotein from the membrane and complex it with the periplasmic chaperone LolA. The LolA/lipoprotein complex moves across the periplasmic space and docks in the outer membrane with LolB, which is also a lipoprotein, to allow delivery of the lipoprotein from LolA into the periplasmic leaflet of the outer membrane.5 The outer membrane β-barrel proteins are similarly exported across the inner membrane in a process that subsequently employs dedicated chaperones to move them through the periplasmic space before folding into the outer membrane.6 Lipoproteins and integral membrane β-barrel proteins are the two major classes of proteins present in the bacterial outer membrane. Additionally, the outer membranes of Gram-negative bacteria are distinguished by an asymmetric lipid organization, with the inner and outer leaflets composed of phospholipids and lipopolysaccharides (LPS), respectively.
β-barrel membrane proteins are not normally found in the inner membrane, although there is no physical reason why they could not assemble in that location. The β-barrel structure is very effective at creating pores that would dissipate the proton motive force, suggesting that natural selection acts against their inner membrane localization. Indeed, several protein toxins act by forming β-barrel structures within energy-transducing membrane systems.7 The β-barrel structure satisfies the requirement of providing continuous hydrogen bonding between the amide nitrogen proton and carbonyl oxygen of the peptide bonds, which are otherwise too polar to interact stably with the hydrocarbon chains in a lipid bilayer membrane. Roughly 8 amino acids are sufficient to span the membrane in the extended β-conformation, and half of these residues project into the mostly polar β-barrel interior region.8 Only those residues whose registration projects them into the lipid milieu of the membrane need to possess hydrophobic side-chains. Consequently, β-barrel membrane proteins are only moderately hydrophobic, a fact that facilitates their coursing through the inner membrane and periplasmic space.9 Nature's other solution to provide continuous peptide hydrogen bonding is the transmembrane α-helix, which demands roughly 20 hydrophobic amino acid side chains in order to span the membrane.10 The greater hydrophobicity of transmembrane α-helical proteins dictates that their targeting to the inner membrane is essentially irreversible, a fact that explains why no transmembrane α-helical proteins are found in the outer membrane. The peptidoglycan exoskeleton that is sandwiched between the inner and outer membranes is believed to effectively block wholesale membrane exchange by vesicular trafficking. Consequently, outer membrane biogenesis likely depends on soluble periplasmic lipid-transfer proteins and/or localized contact bridges between inner and outer membranes.
The lipocalins represent a family of β-barrel proteins that are adapted to bind hydrophobic ligands, essentially chaperoning them through the aqueous compartments of the cell.11 Accumulating evidence supports the hypothesis that lipocalins originated in association with the development of bacterial outer membrane systems, and that bacterial lipocalins exert a function in the maintenance of the outer membrane structure. Here we update the phylogenetic distribution of bacterial lipocalins and describe some interesting new relationships between their structure and function in the outer membrane. Some aspects of bacterial lipocalin structure-function relationships that have been reviewed previously are not addressed again here.12 We suggest that interested readers consult this earlier article to gain a fuller picture of the bacterial lipocalin story.
In the year 2000, 20 lipocalins were identified in Gram-negative bacteria, but that number now exceeds 90. Among the newly identified sequences are three that belong to Gram-positive bacteria, namely, Nocardia and two species of Corynebacteria. These organisms belong to the so called CMN branch (for Corynebacteria, Mycobacteria, Nocardia) of the actinomycetes. The CMN branch represents the most highly derived of the Gram-positive bacteria and do not actually display Gram-positive staining. Instead, the presence of mycolic acids in these organisms generates a permeability barrier extrinsic to the inner membrane that requires staining by the so-called “acid-fast” procedure. The CMN organisms are regarded by some as being intermediate between Gram-negative and Gram-positive bacteria, because the mycolic acid layer requires porin channels to facilitate nutrient uptake and these adopt a transmembrane β-barrel architecture13 - a decidedly Gram-negative characteristic.14 Despite this similarity, the 16s rRNA phylogeny places the CMN branch firmly within the Gram-positive actinomycetes. Therefore, the bacterial lipocalins are associated with organisms that produce either LPS or mycolic acids, the two classes of lipids associated with bacterial permeability barriers that harbor β-barrel membrane proteins. Although lipocalins do not appear among the Mycobacteria, the fatty acid binding-protein fold that resembles lipocalins has been identified in these organisms (Pedro Alzari, personal communication). The mycolic acids of Mycobacteria are unusually long and create a thicker permeability barrier compared to Nocardia and Corynebacteria. To date, no lipocalins have been identified among the Gram-positive or archaebacterial organisms that lack the defining characteristics of outer membranes.
| Organisms | Signal Peptideb | βB-βC Cysteines | Accession Number |
|---|---|---|---|
| Bacteria | |||
| Alpha-Proteobacteria (α) | |||
| Agrobacterium tumefaciens | SPI | Yes | NC_003063 |
| Caulobacter crescentus (1) | SPII | Yes | NC_002696 |
| Caulobacter crescentus (2) | SPI | Yes | NC_002696 |
| Hyphomonas neptunium | SPII | Yes | TIGR (incomplete) |
| Mesorhizobium loti (1) | None | Yes | NC_002678 |
| Mesorhizobium loti (2) | None | Yes | NC_002678 |
| Rhodobacter sphaeroides | SPII | Yes | AAAE01000158 |
| Rhodospirillum rubrum | SPII | No | AAAG02000001 |
| Silicibacter | SPI | Yes | NZ_AAFG01000010 |
| Beta-Proteobacteria (β) | |||
| Acidovorax (partial) | SPII | No | AB044565 |
| Bordetella bronchiseptica | SPI | Yes | NC_002927 |
| Bordetella parapertussis | SPI | Yes | NC_002928 |
| Burkholderia cepacia (1) | SPII | Yes | AAEH01000003 |
| Burkholderia cepacia (2) | SPII | No | AAEH01000003 |
| Burkholderia fungorum (1) | SPII | No | NZ_AAAJ03000001 |
| Burkholderia fungorum (2) | SPII | No | NZ_AAAJ03000001 |
| Burkholderia fungorum (3) | SPII | Yes | NZ_AAAJ03000004 |
| Chromobacterium violaceum | SPII | No | NC_005085 |
| Dechloromonas aromatica | SPI | Yes | AADF01000001 |
| Methylobacillus flagellatus | SPII | No | AADX01000001 |
| Ralstonia eutropha | SPII | No | AADY01000001 |
| Rubrivivax gelatinosus | SPI | Yes | AAEM01000005 |
| Thiobacillus denitrificans | SPII | Yes | AAFH01000001 |
| Gamma-Proteobacteria (γ) | |||
| Acinetobacter (1) | SPI | Yes | CR543861 |
| Acinetobacter (2) | SPII | Yes | CR543861 |
| Azotobacter vinelandii | SPII | Yes | NZ_AAAU02000004 |
| Citrobacter braakii (partial) | SPII | No | AF492447 |
| Citrobacter freundii | SPII | No | U21727 |
| Citrobacter murliniae | SPII | No | AJ607409 |
| Citrobacter werkmanii (partial) | SPII | No | AF492448 |
| Colwellia psychroerythraea | SPII | No | TIGR (Incomplete) |
| Enterobacter cancerogenus (partial) | SPII | No | AF492446 |
| Enterobacter nimipressuralis | SPII | No | AJ487975 |
| Erwinia carotovora | SPII | No | BX950851 |
| Escherichia coli | SPII | No | P39281 |
| Francisella tularensis | SPII | No | AY774926 |
| Idiomarina loihiensis | SPII | No | NC_006512 |
| Klebsiella oxytoca | SPII | No | Y17716 |
| Klebsiella pneumoniae | SPII | No | Incomplete |
| Pseudomonas aeruginosa | SPII | Yes | AABQ07000004 |
| Pseudomonas fluorescens | SPII | Yes | AAAT03000001 |
| Pseudomonas putida (1) | SPII | Yes | NC_002947 |
| Pseudomonas putida (2) | SPII | Yes | NC_002947 |
| Gamma-Proteobacteria (γ) | |||
| Pseudomonas syringae | SPI | Yes | AABP02000002 |
| Salmonella paratyphi | SPII | No | Incomplete |
| Salmonella typhi | SPII | No | NC_006511 |
| Salmonella typhimurium | SPII | No | AE008903 |
| Shewanella oneidensis | SPII | No | AE015615 |
| Shigella flexneri | SPII | No | NC_004741 |
| Vibrio cholerae (1) | SPII | No | NC_002506 |
| Vibrio cholerae (2) | SPII | No | NC_002506 |
| Vibrio cholerae (3) | SPII | No | NC_002506 |
| Vibrio cholerae (4) | SPII | No | NC_002506 |
| Vibrio parahaemolyticus | SPII | No | NC_004605 |
| Vibrio vulnificus | SPII | No | NC_004460 |
| Xanthomonas axonopodis (1) | SPI | No | NC_003919 |
| Xanthomonas axonopodis (2) | SPI | No | NC_003919 |
| Xanthomonas axonopodis (3) | None | Yes | NC_003919 |
| Xanthomonas axonopodis (4) | SPI | Yes | NC_003919 |
| Xanthomonas campestris (1) | SPI | No | NC_003902 |
| Xanthomonas campestris (2) | SPI | No | NC_003902 |
| Xanthomonas campestris (3) | None | Yes | NC_003902 |
| Xanthomonas campestris (4) | SPI | Yes | NC_003902 |
| Yersinia enterocolitica | SPII | No | Sanger (Incomplete) |
| Delta-Proteobacteria (δ) | |||
| Bdellovibrio bacteriovorus (1) | SPI | Yes | NC_005363 |
| Bdellovibrio bacteriovorus (2) | SPI | Yes | NC_005363 |
| Desulfotalea psychrophila | SPII | No | NC_006138 |
| Geobacter sulfurreducens | SPII | Yes | NC_002939 |
| Epsilon-Proteobacteria (ε) | |||
| Campylobacter jejuni | None | Yes | AL139078 |
| Campylobacter lari | None | Yes | NZ_AAFK01000002 |
| Chlorobi | |||
| Chlorobium tepidum (1) | SPII | No | NC_002932 |
| Chlorobium tepidum (2) | None | Yes | NC_002932 |
| Bacteroidetes | |||
| Bacteroides fragilis | None | No | NC_006347 |
| Bacteroides thetaiotaomicron | None | No | NC_004663 |
| Cytophaga hutchinsonii | SPI | No | AABD03000002 |
| Verrucomicrobia | |||
| Verrucomicrobium spinosum (1) | SPII | No | TIGR (incomplete) |
| Verrucomicrobium spinosum (2) | SPII | No | TIGR (incomplete) |
| Chlamydiae | |||
| Parachlamydia | SPI | Yes | NC_005861 |
| Cyanobacteria | |||
| Gloeobacter violaceus | SPII | Yes | NC_005125 |
| Actinobacteria | |||
| Corynebacterium efficiens | SPI | Yes | NC_004369 |
| Corynebacterium glutamicum | SPI | Yes | NC_003450 |
| Nocardia farcinica | SPI | Yes | NC_006361 |
| Eukaryota | |||
| Viridiplantae | |||
| Arabidopsis thaliana | None | No | AY085685 |
| Capsicum annuum | None | No | AY568589 |
| Lycopersicon esculentum | None | No | BT013102 |
| Oryza sativa | None | No | XM_466697 |
| Triticum aestivum | None | No | AY077702 |
| Zea mays | None | No | AY107141 |
| Saccharomycetes | |||
| Debaryomyces hansenii | ER | Yes | XM_460369 |
| Metazoa (Outgroup) | |||
| ApoD | ER | No | P05090 |
| Lazarillo | ER/GPI | No | P49291 |
a A TBLASTN44 search using the E. coli Blc protein sequence was performed on both the microbial genome databases and the non-redundant database at the NCBI website (http://www.ncbi.nlm.nih.gov). The genome sequence 500 bp upstream and downstream of each hit was retrieved. The six resulting translations were evaluated to identify both the full-length lipocalin sequence and the presence of the 16s ribosomal RNA binding site. The taxonomy of each organism was also retrieved using the NCBI TaxBrowser. b Each amino acid sequence was evaluated for the presence of signal peptides and these were assigned using SIGNALP.45 SPI: signal peptidase I; SPII: signal peptidase II; ER: endoplasmic reticulum; GPI: glycosylphosphatidylinositol.
Relationships between apparent disulfide bonding patterns in lazarillo, ApoD, B. fungorum2, and G. violaceus lipocalins. Amino acid residues in the surface loop between β-strands G and H are shown for lazarillo, and as a consensus sequence for the ApoDs; however, the intermolecular disulfide bond with ApoA2 is unique to human ApoD. The apparent disulfide bond flanking β-strands B and C in the G. violaceus lipocalin is present in the majority of Clade IB lipocalins. The putative secondary structural elements are indicated as β-sheet (blue arrows), α-helix (red rectangles), or signal sequences (green rectangles). Diglyceride moieties of the lipid anchors in Blc and lazarillo are shown as black and yellow rectangles. The sugars of the GPI anchor of lazarillo are mannose (green squares), glucosamine (red circle), and inositol (blue diamond).
In a previous analysis, we interpreted the absence of a homologue from an archaebacterium as indicating that lipocalins could not have existed in the last common ancestor of the three domains of life, and that they were probably acquired in eukaryotes horizontally by endosymbiosis.12 Horizontal transfer is probably partly correct, especially in light of the bacterial-like lipocalins of plants and yeast described here. However, a recent study from Rivera and Lake suggests that a genome fusion event occurred between an archaebacterium and a proteobacterium to create the first eukaryotic cell; this framework provides a more parsimonious explanation for the acquisition of most eukaryotic lipocalins by vertical descent from the proteobacterial component of the ancestral chimera.21 This model builds on ideas from Gupta,22 who proposes that monoderm Gram-positive bacteria are ancestral to both the monoderm archaebacteria and the diderm Gram-negative bacteria/CMN actinomycetes.23 Our observations clearly fit with a model where the lipocalins represent a late development in bacterial evolution, having arisen in the diderm bacteria. However, this is not so late a development as to preclude ancestry to the first eukaryotic cell according to the archaebacterial-proteobacterial fusion model.21 The earlier three domain model predicts that proteins acquired vertically by eukaryotes (Eukarya) will likely have ancestors in the archaebacteria (Archaea) if ancestors already exist in the eubacteria (Bacteria).24
Comparison of PagP and serum retinol-binding protein. PagP (A) is shown in green (Protein Data Bank code 1THQ), and RBP (B) is shown in red (Protein Data Bank code 1AQB). The lauroyldimethylamine-N-oxide and retinol ligands are represented as spheres using CPK colors. PagP has an N-terminal α-helix, while most lipocalin structures, including RBP, have a single α-helix at the C-terminus. Both proteins are oriented so the β-strands correspond to each other; note that the first two β-strands of PagP are separated by the disordered L1 surface loop.
We cannot rule out the possibility that this structural similarity represents a case of convergent evolution. However, based on the absence of lipocalins from those bacteria that also lack β-barrel membrane proteins, we cannot currently exclude the hypothesis that PagP and lipocalins share a common ancestor. This hypothesis is supported by the observation that many bacterial lipocalins are anchored in the outer membrane as lipoproteins.2 The lipocalin-lipoprotein might represent an intermediate state in the adaptation between membrane-bound and soluble globular domains. Outer membrane β-barrel domains should be suitably adapted to occupy both soluble and membrane-bound conformations because both environments are encountered during the outer membrane assembly process.6
X-ray structure of Blc in complex with vaccenic acid. A) Ribbon view of Blc with vaccenic acid inside (spheres). Monomer B (left) is pink; monomer A (right) is rainbow colored from N- (blue) to C-terminus (red). B) Compact view of the Blc dimer attached to the membrane with anchor lipid S-sn-1,2-diacylglycerylcysteine attached to the first cysteine (computer model). Monomer B (right) is pink and monomer A (left) is blue; vaccenic acid and lipidic anchors are represented as spheres.
The comparison of subunits A and B indicate significant conformational differences that might be necessary for dimerization. In order to fit within the other monomer, strands F and G, and particularly the loops between them, move up to 5 Å in one monomer relative to the other. The dimer interface involves in large part these loops, and other residues among which the aromatics Tyr113, Tyr137, and several surface-exposed phenylalanines including Phe53 A & B, Phe108 A & B, Phe109 A & B and Phe112 A & B together form an inter-dimer hydrophobic core. The presence of many exposed phenylalanines is an uncommon feature at the surface of a monomeric globular protein, and has to be regarded as a hallmark of protein-protein or protein-lipid interactions. Both subunits are involved in the binding site of Blc, accounting for the stoichiometry of one vaccenic acid molecule per dimer. The vaccenic acid interacts with both subunits and covers 89 Å2 and 171 Å2 of the WAS area of monomers A and B, respectively.
Tryptophan fluorescence quenching studies indicate that dimeric Blc binds fatty acids and phospholipids in a micromolar Kd range. An exposed and unfilled pocket seemingly suited to bind a polar group extending from the fatty acid prompted investigation of lysophospholipids (LPLs), which were found to bind in a nanomolar Kd range. Favorable steric and electrostatic interactions are observed when a glycerophosphoethanolamine moiety is modeled into this region of the structure. Given the high affinity of Blc for LPLs, it seems likely that Blc might fulfil a role in cell envelope LPL transport. Although LPLs are key inner membrane intermediates of phospholipid metabolism, we do not know of any evidence to indicate that they are exported to the outer membrane. However, exogenously supplied LPLs can be taken up by deep-rough LPS mutants and converted by reacylation into glycerophospholipids using inner membrane-associated enzymes.32-34 To date, no accessory factors needed for LPL uptake have been identified. In wild-type cells, at least two enzymes can generate LPLs in the outer membrane, namely, the phospholipase OMPLA35 and the lipid A palmitoyltransferase PagP.36 Both enzymes preferentially generate the sn-1 LPL regioisomers, but these are known to spontaneously rearrange into the more stable sn-2 LPLs.37 Only sn-2 LPLs were tested for binding to Blc.
The LPL products sn-2-lysophosphatidic acid and sphingosine-1-phosphate, together with the structurally related platelet activating factors, have been shown to function as potent biological mediators.38,39 In eukaryotic cells, sn-2 LPLs can be generated by phospholipases that mobilize arachidonic acid for eicosanoid-mediated signal transduction. Additionally, sn-2 LPLs are enzymatic products of the lecithin:cholesterol acyltransferase LCAT, which is, together with ApoD, a key component of the plasma high density lipoprotein particle. Mammalian ApoD and Drosophila lazarillo are the closest eukaryotic homologues of Blc and the only eukaryotic lipocalins that, like Blc, are anchored to lipid membranes. The ligand for lazarillo is unknown, but ApoD binds a variety of ligands including arachidonic acid.40 Perhaps the high affinity of Blc for LPLs might help shed some light on the enigmatic functions of lazarillo and ApoD.
The bacterial lipocalin Blc is distinguished from eukaryotic lipocalins by its dimeric organization with a lipid-binding pocket at the subunit interface and by its high affinity for LPLs. We have indicated that the structural and functional organization of Blc shares some common features with β-barrel membrane proteins. The structural similarity between PagP and lipocalins is illustrative of this point, but, in the absence of measurable amino acid sequence identity, structural similarity does not by any means indicate common ancestry. In fact, we would normally regard the similarity between PagP and lipocalins as an example of convergent evolution, and other similarities between lipocalins and a polyisoprenoid-binding protein, for example,41 indicate that this structural organization has probably arisen independently on more than one occasion in the past; even the lipoprotein chaperones LolA and LolB are lipid-interactive β-barrel proteins.5 However, the tight association of bacterial lipocalins with those organisms that encode outer membrane β-barrel proteins like PagP makes it difficult to pronounce a verdict of convergent evolution. Combined with the functional association of bacterial lipocalins with outer membranes, we cannot ignore the possibility that lipocalins could have either engendered β-barrel membrane proteins, or vice versa. Whatever the case may be, bacterial lipocalins promise to reveal molecular details of lipid membrane biogenesis relevant to both bacteria and higher organisms, and we hope our present insight into their plausible origins will serve to guide experiments aimed at addressing this subject.
Work in the laboratories of REB, GGP, and ERMT was supported by operating grants from the Canadian Institutes of Health Research. Work in the laboratory of CC was supported by the French Ministry of Industry (grant ASG) and the Marseille-Nice Genopole.
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]