NCBI » Bookshelf » Madame Curie Bioscience Database » Cell Metabolism » Bacterial Lipocalins: Origin, Structure, and Function
 
eurekah
Madame Curie Bioscience Database
Landes Bioscience
biochemistrycancercell biologydevelopmental biologygeneticsimmunologyinfectious diseasesmedical geneticsmicrobiologymolecular biologyneuroscience

Bacterial Lipocalins: Origin, Structure, and Function

Russell E Bishop, Christian Cambillau, Gilbert G. Privé, Derek Hsi, Desiree Tillo, and Elisabeth R. M. Tillier
Corresponding Author: Departments of Laboratory Medicine and Pathobiology, and Biochemistry, Faculty of Medicine, University of Toronto, 6213 Medical Sciences Building, 1 King's College Circle, Toronto, Ontario M5S 1A8, Canada. Email: russell.bishop@utoronto.ca
A62567

The bacterial lipocalins were discovered in 1995 and first reviewed in the year 2000. In the subsequent 5 years, two important developments have been made. First, an explosion of molecular sequence information from microbial genome projects has uncovered more than 90 bacterial lipocalin sequences. The phylogenetic distribution indicates that lipocalins are absent from the archaebacteria and nonmycolic acid-producing Gram-positive bacteria, which lack a permeability barrier located extrinsic to the inner (cytoplasmic) membrane, a point of contrast with the lipocalin-encoding Gram-negative bacteria and mycolic acid-producing actinobacteria. This observation strongly supports the conclusion that bacterial lipocalins originated in association with the development of the bacterial outer membrane structure. Combined with the recent finding that the integral outer membrane enzyme PagP displays a compact lipocalin-like 8-stranded anti-parallel β-barrel fold, complete with an internal lipid-binding pocket, it becomes difficult to ignore the possibility that lipocalins might share a common origin with bacterial β-barrel membrane proteins. Second, the high resolution crystal structure of the prototypical bacterial lipocalin Blc, an outer membrane lipoprotein from E. coli, has now been solved. A recent reappraisal of the crystal packing contacts in this structure have led to the conclusion that Blc is a functional dimer with a lipid acyl-chain binding-site buried at the subunit interface. Tryptophan fluorescence-quenching experiments indicate micromolar affinities for various fatty acids and phospholipids, but nanomolar affinities for lysophospholipids. Outer membrane enzymes like PagP generate lysophospholipids as enzymatic products, suggesting that Blc might fulfil a role in outer membrane biogenesis or repair related to lysophospholipid metabolism.

Introduction

The lipocalin protein family consists mainly of small extracellular proteins that bind hydrophobic ligands and fulfill numerous biological functions including ligand transport, cryptic coloration, sensory transduction, the biosynthesis of prostaglandins, and the regulation of cellular homeostasis and immunity. The lipocalins can be structurally characterized as an 8-stranded antiparallel β-barrel followed by a C-terminal α-helix, as exemplified by serum retinol binding protein.1 The lipocalin fold is widely distributed among vertebrates and a few have been isolated from invertebrates and plants. Until 1995, when the first bacterial lipocalins were identified,2 the family was thought to be restricted to eukaryotes. The recent crystal structure of a bacterial lipocalin suggests that phospholipid derivatives are likely ligands.3 The existence of bacterial lipocalins provides insight into the origins of the lipocalin family and, combined with the powerful tools of bacterial genetics, provides fertile ground for the investigation of lipocalin structure and function.

The bacterial lipocalin Blc was first identified as an outer membrane lipoprotein of E. coli in 1995.2 Approximately 100 lipoproteins with varied structures and functions are encoded within the E. coli genome.4 Export of bacterial lipoproteins across the inner membrane is directed by a sec-dependent type-II signal peptide, which converts the amino terminal cysteine into N-acyl-S-sn-1,2-diacylglycerylcysteine. The lipidated protein remains anchored to the periplasmic surface of the inner membrane until it encounters the LolCDE ATP-binding cassette transport system, which can actively disengage the lipoprotein from the membrane and complex it with the periplasmic chaperone LolA. The LolA/lipoprotein complex moves across the periplasmic space and docks in the outer membrane with LolB, which is also a lipoprotein, to allow delivery of the lipoprotein from LolA into the periplasmic leaflet of the outer membrane.5 The outer membrane β-barrel proteins are similarly exported across the inner membrane in a process that subsequently employs dedicated chaperones to move them through the periplasmic space before folding into the outer membrane.6 Lipoproteins and integral membrane β-barrel proteins are the two major classes of proteins present in the bacterial outer membrane. Additionally, the outer membranes of Gram-negative bacteria are distinguished by an asymmetric lipid organization, with the inner and outer leaflets composed of phospholipids and lipopolysaccharides (LPS), respectively.

β-barrel membrane proteins are not normally found in the inner membrane, although there is no physical reason why they could not assemble in that location. The β-barrel structure is very effective at creating pores that would dissipate the proton motive force, suggesting that natural selection acts against their inner membrane localization. Indeed, several protein toxins act by forming β-barrel structures within energy-transducing membrane systems.7 The β-barrel structure satisfies the requirement of providing continuous hydrogen bonding between the amide nitrogen proton and carbonyl oxygen of the peptide bonds, which are otherwise too polar to interact stably with the hydrocarbon chains in a lipid bilayer membrane. Roughly 8 amino acids are sufficient to span the membrane in the extended β-conformation, and half of these residues project into the mostly polar β-barrel interior region.8 Only those residues whose registration projects them into the lipid milieu of the membrane need to possess hydrophobic side-chains. Consequently, β-barrel membrane proteins are only moderately hydrophobic, a fact that facilitates their coursing through the inner membrane and periplasmic space.9 Nature's other solution to provide continuous peptide hydrogen bonding is the transmembrane α-helix, which demands roughly 20 hydrophobic amino acid side chains in order to span the membrane.10 The greater hydrophobicity of transmembrane α-helical proteins dictates that their targeting to the inner membrane is essentially irreversible, a fact that explains why no transmembrane α-helical proteins are found in the outer membrane. The peptidoglycan exoskeleton that is sandwiched between the inner and outer membranes is believed to effectively block wholesale membrane exchange by vesicular trafficking. Consequently, outer membrane biogenesis likely depends on soluble periplasmic lipid-transfer proteins and/or localized contact bridges between inner and outer membranes.

The lipocalins represent a family of β-barrel proteins that are adapted to bind hydrophobic ligands, essentially chaperoning them through the aqueous compartments of the cell.11 Accumulating evidence supports the hypothesis that lipocalins originated in association with the development of bacterial outer membrane systems, and that bacterial lipocalins exert a function in the maintenance of the outer membrane structure. Here we update the phylogenetic distribution of bacterial lipocalins and describe some interesting new relationships between their structure and function in the outer membrane. Some aspects of bacterial lipocalin structure-function relationships that have been reviewed previously are not addressed again here.12 We suggest that interested readers consult this earlier article to gain a fuller picture of the bacterial lipocalin story.

Phylogenetic Distribution of Bacterial Lipocalins

Bacterial Lipocalins Are Codistributed with Outer Membrane Proteins

In the year 2000, 20 lipocalins were identified in Gram-negative bacteria, but that number now exceeds 90. Among the newly identified sequences are three that belong to Gram-positive bacteria, namely, Nocardia and two species of Corynebacteria. These organisms belong to the so called CMN branch (for Corynebacteria, Mycobacteria, Nocardia) of the actinomycetes. The CMN branch represents the most highly derived of the Gram-positive bacteria and do not actually display Gram-positive staining. Instead, the presence of mycolic acids in these organisms generates a permeability barrier extrinsic to the inner membrane that requires staining by the so-called “acid-fast” procedure. The CMN organisms are regarded by some as being intermediate between Gram-negative and Gram-positive bacteria, because the mycolic acid layer requires porin channels to facilitate nutrient uptake and these adopt a transmembrane β-barrel architecture13 - a decidedly Gram-negative characteristic.14 Despite this similarity, the 16s rRNA phylogeny places the CMN branch firmly within the Gram-positive actinomycetes. Therefore, the bacterial lipocalins are associated with organisms that produce either LPS or mycolic acids, the two classes of lipids associated with bacterial permeability barriers that harbor β-barrel membrane proteins. Although lipocalins do not appear among the Mycobacteria, the fatty acid binding-protein fold that resembles lipocalins has been identified in these organisms (Pedro Alzari, personal communication). The mycolic acids of Mycobacteria are unusually long and create a thicker permeability barrier compared to Nocardia and Corynebacteria. To date, no lipocalins have been identified among the Gram-positive or archaebacterial organisms that lack the defining characteristics of outer membranes.

Not All Bacterial Lipocalins Are Bacterial

Table 1

Listing of all Clade I lipocalin sequences (2005)a
OrganismsSignal PeptidebβB-βC CysteinesAccession Number
Bacteria
Alpha-Proteobacteria (α)
Agrobacterium tumefaciensSPIYesNC_003063
Caulobacter crescentus (1)SPIIYesNC_002696
Caulobacter crescentus (2)SPIYesNC_002696
Hyphomonas neptuniumSPIIYesTIGR (incomplete)
Mesorhizobium loti (1)NoneYesNC_002678
Mesorhizobium loti (2)NoneYesNC_002678
Rhodobacter sphaeroidesSPIIYesAAAE01000158
Rhodospirillum rubrumSPIINoAAAG02000001
SilicibacterSPIYesNZ_AAFG01000010
Beta-Proteobacteria (β)
Acidovorax (partial)SPIINoAB044565
Bordetella bronchisepticaSPIYesNC_002927
Bordetella parapertussisSPIYesNC_002928
Burkholderia cepacia (1)SPIIYesAAEH01000003
Burkholderia cepacia (2)SPIINoAAEH01000003
Burkholderia fungorum (1)SPIINoNZ_AAAJ03000001
Burkholderia fungorum (2)SPIINoNZ_AAAJ03000001
Burkholderia fungorum (3)SPIIYesNZ_AAAJ03000004
Chromobacterium violaceumSPIINoNC_005085
Dechloromonas aromaticaSPIYesAADF01000001
Methylobacillus flagellatusSPIINoAADX01000001
Ralstonia eutrophaSPIINoAADY01000001
Rubrivivax gelatinosusSPIYesAAEM01000005
Thiobacillus denitrificansSPIIYesAAFH01000001
Gamma-Proteobacteria (γ)
Acinetobacter (1)SPIYesCR543861
Acinetobacter (2)SPIIYesCR543861
Azotobacter vinelandiiSPIIYesNZ_AAAU02000004
Citrobacter braakii (partial)SPIINoAF492447
Citrobacter freundiiSPIINoU21727
Citrobacter murliniaeSPIINoAJ607409
Citrobacter werkmanii (partial)SPIINoAF492448
Colwellia psychroerythraeaSPIINoTIGR (Incomplete)
Enterobacter cancerogenus (partial)SPIINoAF492446
Enterobacter nimipressuralisSPIINoAJ487975
Erwinia carotovoraSPIINoBX950851
Escherichia coliSPIINoP39281
Francisella tularensisSPIINoAY774926
Idiomarina loihiensisSPIINoNC_006512
Klebsiella oxytocaSPIINoY17716
Klebsiella pneumoniaeSPIINoIncomplete
Pseudomonas aeruginosaSPIIYesAABQ07000004
Pseudomonas fluorescensSPIIYesAAAT03000001
Pseudomonas putida (1)SPIIYesNC_002947
Pseudomonas putida (2)SPIIYesNC_002947
Gamma-Proteobacteria (γ)
Pseudomonas syringaeSPIYesAABP02000002
Salmonella paratyphiSPIINoIncomplete
Salmonella typhiSPIINoNC_006511
Salmonella typhimuriumSPIINoAE008903
Shewanella oneidensisSPIINoAE015615
Shigella flexneriSPIINoNC_004741
Vibrio cholerae (1)SPIINoNC_002506
Vibrio cholerae (2)SPIINoNC_002506
Vibrio cholerae (3)SPIINoNC_002506
Vibrio cholerae (4)SPIINoNC_002506
Vibrio parahaemolyticusSPIINoNC_004605
Vibrio vulnificusSPIINoNC_004460
Xanthomonas axonopodis (1)SPINoNC_003919
Xanthomonas axonopodis (2)SPINoNC_003919
Xanthomonas axonopodis (3)NoneYesNC_003919
Xanthomonas axonopodis (4)SPIYesNC_003919
Xanthomonas campestris (1)SPINoNC_003902
Xanthomonas campestris (2)SPINoNC_003902
Xanthomonas campestris (3)NoneYesNC_003902
Xanthomonas campestris (4)SPIYesNC_003902
Yersinia enterocoliticaSPIINoSanger (Incomplete)
Delta-Proteobacteria (δ)
Bdellovibrio bacteriovorus (1)SPIYesNC_005363
Bdellovibrio bacteriovorus (2)SPIYesNC_005363
Desulfotalea psychrophilaSPIINoNC_006138
Geobacter sulfurreducensSPIIYesNC_002939
Epsilon-Proteobacteria (ε)
Campylobacter jejuniNoneYesAL139078
Campylobacter lariNoneYesNZ_AAFK01000002
Chlorobi
Chlorobium tepidum (1)SPIINoNC_002932
Chlorobium tepidum (2)NoneYesNC_002932
Bacteroidetes
Bacteroides fragilisNoneNoNC_006347
Bacteroides thetaiotaomicronNoneNoNC_004663
Cytophaga hutchinsoniiSPINoAABD03000002
Verrucomicrobia
Verrucomicrobium spinosum (1)SPIINoTIGR (incomplete)
Verrucomicrobium spinosum (2)SPIINoTIGR (incomplete)
Chlamydiae
ParachlamydiaSPIYesNC_005861
Cyanobacteria
Gloeobacter violaceusSPIIYesNC_005125
Actinobacteria
Corynebacterium efficiensSPIYesNC_004369
Corynebacterium glutamicumSPIYesNC_003450
Nocardia farcinicaSPIYesNC_006361
Eukaryota
Viridiplantae
Arabidopsis thalianaNoneNoAY085685
Capsicum annuumNoneNoAY568589
Lycopersicon esculentumNoneNoBT013102
Oryza sativaNoneNoXM_466697
Triticum aestivumNoneNoAY077702
Zea maysNoneNoAY107141
Saccharomycetes
Debaryomyces hanseniiERYesXM_460369
Metazoa (Outgroup)
ApoDERNoP05090
LazarilloER/GPINoP49291

a A TBLASTN44 search using the E. coli Blc protein sequence was performed on both the microbial genome databases and the non-redundant database at the NCBI website (http://www.ncbi.nlm.nih.gov). The genome sequence 500 bp upstream and downstream of each hit was retrieved. The six resulting translations were evaluated to identify both the full-length lipocalin sequence and the presence of the 16s ribosomal RNA binding site. The taxonomy of each organism was also retrieved using the NCBI TaxBrowser. b Each amino acid sequence was evaluated for the presence of signal peptides and these were assigned using SIGNALP.45 SPI: signal peptidase I; SPII: signal peptidase II; ER: endoplasmic reticulum; GPI: glycosylphosphatidylinositol.

BLAST searches using E. coli Blc as a probe can usually discern between bacterial and eukaryotic lipocalins, because the latter sequences display a number of obvious insertions/ deletions. For example, apolipoprotein D (ApoD) and lazarillo, members of a group of eukaryotic lipocalins closely related to Blc, possess an insertion between β-strands G and H that is absent from Blc.12 However, several eukaryotic lipocalins cannot easily be distinguished from the bacterial homologues (Table 1). Most striking among these are a group of six bacterial-like lipocalins from plants. Chromosome 5 of Arabidopsis thaliana encodes a lipocalin that appears to contain a single 84 nucleotide intron with the typical 5'-donor and 3'-acceptor splice sites. Removal of the intron yields a 186 amino acid polypeptide, which exhibits greatest similarity with the bacterial lipocalins. Plant genomes are unique among eukaryotes in that some encode most of the enzymes needed to make the hydrophobic anchor of Gram-negative LPS known as lipid A or endotoxin.15 Green plants are thought to have acquired plastids by a recent endosymbiosis with Gram-negative cyanobacteria, and might consequently have retained some outer membrane components.16 These same components were apparently lost from the mitochondrion after it was derived by an earlier endosymbiosis with a Gram-negative α-proteobacterium.17 Indeed, at least one lipocalin is encoded in the genome of the cyanobacterium G. violaceus. The six bacterial-like lipocalins of plants lack any obvious signal sequence, suggesting they are soluble cytosolic proteins. The eukaryotic acquisition of bacterial genes that code for cytosolic functions is now thought to be an inevitable consequence of endosymbiosis.18 Another bacterial-like lipocalin is derived from the marine yeast D. hansenii, and this protein exhibits a typical eukaryotic signal peptide that likely targets it to the endoplasmic reticulum.

Two Sub-Clades of Bacterial Lipocalins

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch150f1.jpg.

Figure 1

.

Evolutionary relationships between bacterial lipocalins and their closest eukaryotic descendants. Signal peptides were removed from the lipocalin sequences listed in (Table 1). Partial sequences were excluded from the multiple sequence alignment, which was constructed using CLUSTAL W 1.83,42 with default parameters (BLOSUM62 scoring matrix, gap opening penalty = -12, gap extension penalty = -2). In order to obtain an alignment that reflects lipocalin structure, a profile alignment and a gap penalty mask were constructed using the structural alignment and secondary structure information obtained from the published Blc structure.3 Sequences were then added to the profile alignment one at a time, in increasing order of divergence from the E. coli Blc sequence. Once the multiple sequence alignment was complete, the profile alignment and gap penalty mask were removed. The neighbor-joining distance tree was then constructed using PROTDIST, with the Dayhoff option invoked, and NEIGHBOR, both from the PHYLIP 3.63 suite of programs (http://evolution.genetics.washington.edu/phylip.html). Confidence limits of branches were estimated from 100 bootstrap replications using SEQBOOT and CONSENSE. Bootstrap values greater than 50 are shown at the nodes. The tree was visualized using TREEVIEW43 and rooted with ApoD and lazarillo. Asterisks mark those Clade IB lipocalins that possess conserved cysteines flanking β-strands B and C. Colors serve to identify secreted eukaryotic proteins (red), bacterial lipoproteins (black), secreted bacterial proteins (blue), and cytosolic proteins (green). Brackets identify taxonomic associations and the Greek symbols signify subdivisions of proteobacteria; those without brackets belong to the γ-proteobacteria.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch150f2.jpg.

Figure 2

.

Relationships between apparent disulfide bonding patterns in lazarillo, ApoD, B. fungorum2, and G. violaceus lipocalins. Amino acid residues in the surface loop between β-strands G and H are shown for lazarillo, and as a consensus sequence for the ApoDs; however, the intermolecular disulfide bond with ApoA2 is unique to human ApoD. The apparent disulfide bond flanking β-strands B and C in the G. violaceus lipocalin is present in the majority of Clade IB lipocalins. The putative secondary structural elements are indicated as β-sheet (blue arrows), α-helix (red rectangles), or signal sequences (green rectangles). Diglyceride moieties of the lipid anchors in Blc and lazarillo are shown as black and yellow rectangles. The sugars of the GPI anchor of lazarillo are mannose (green squares), glucosamine (red circle), and inositol (blue diamond).

The bacterial lipocalins are defined as belonging to Clade I,19 but we can divide them further into two groups distinguished by the absence (Clade IA) or presence (Clade IB) of a lipocalin subgroup that displays a pair of cysteine residues flanking β-strands B and C in an apparent disulfide-bonding configuration (Fig. 1). The bacterial-like lipocalins from plants belong to a subgroup within Clade IB that appears to have lost this cysteine pair, whereas the D. hansenii lipocalin displays the apparent disulfide. These observations can be rationalized with the reducing cytosolic and oxidizing extracellular environments to which these lipocalins are targeted. Inexplicably, some cytosolic lipocalins within Clade IB still retain this pair of cysteine residues. The two nonconsecutive disulfides found in the closely related group of eukaryotic lipocalins that include ApoD and lazarillo each appear to have their own antecedents within the B. fungorum2 and G. violaceus lipocalins from Clade IB (Fig. 2). Thus, it appears that Clade IB lipocalins are more closely related to the eukaryotic lipocalins than are those from Clade IA. An interesting feature within Clade IA is the N-terminal fusion with a short-chain dehydrogenase/reductase domain in two Bacteroides lipocalins. Similar domains are employed for carbohydrate epimerization during the synthesis of sugars needed for LPS biosynthesis.20 Intriguingly, the Chlorobium genome reveals examples of both Clade IA and Clade IB bacterial lipocalins, suggesting that an early gene duplication event produced these two extant groups.

Vertical versus Horizontal Descent

In a previous analysis, we interpreted the absence of a homologue from an archaebacterium as indicating that lipocalins could not have existed in the last common ancestor of the three domains of life, and that they were probably acquired in eukaryotes horizontally by endosymbiosis.12 Horizontal transfer is probably partly correct, especially in light of the bacterial-like lipocalins of plants and yeast described here. However, a recent study from Rivera and Lake suggests that a genome fusion event occurred between an archaebacterium and a proteobacterium to create the first eukaryotic cell; this framework provides a more parsimonious explanation for the acquisition of most eukaryotic lipocalins by vertical descent from the proteobacterial component of the ancestral chimera.21 This model builds on ideas from Gupta,22 who proposes that monoderm Gram-positive bacteria are ancestral to both the monoderm archaebacteria and the diderm Gram-negative bacteria/CMN actinomycetes.23 Our observations clearly fit with a model where the lipocalins represent a late development in bacterial evolution, having arisen in the diderm bacteria. However, this is not so late a development as to preclude ancestry to the first eukaryotic cell according to the archaebacterial-proteobacterial fusion model.21 The earlier three domain model predicts that proteins acquired vertically by eukaryotes (Eukarya) will likely have ancestors in the archaebacteria (Archaea) if ancestors already exist in the eubacteria (Bacteria).24

Bacterial Lipocalin Structure and Function

Structural Similarities between PagP and Retinol Binding Protein

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch150f3.jpg.

Figure 3

.

Comparison of PagP and serum retinol-binding protein. PagP (A) is shown in green (Protein Data Bank code 1THQ), and RBP (B) is shown in red (Protein Data Bank code 1AQB). The lauroyldimethylamine-N-oxide and retinol ligands are represented as spheres using CPK colors. PagP has an N-terminal α-helix, while most lipocalin structures, including RBP, have a single α-helix at the C-terminus. Both proteins are oriented so the β-strands correspond to each other; note that the first two β-strands of PagP are separated by the disordered L1 surface loop.

A striking similarity is seen between the structures of the β-barrel outer membrane protein PagP and the prototypic lipocalin serum retinol-binding protein (RBP).25,26 Both PagP and RBP are 8-stranded antiparallel β-barrels with a deep lipid-binding pocket located inside one end of the barrel and a flanking α-helix at the other end (Fig. 3A and B). The structures superpose with the proper alignment of the β-strands (i.e., A-H of PagP superposes with A-H of RBP) with an r.m.s.d. of 2.9 Å over 93 Cα atoms. A structural similarity between the outer membrane protein OmpA and the lipocalins has also been noted, and several of the key differences between these proteins was pointed out, including the length of the barrels and the distinctly different protein interiors.27 However, our case for similarity is greater because the PagP and RBP barrels have a higher percentage of alignable positions, and have similar inner lipid-binding pockets. Furthermore, in the lipocalins, a long loop spans strands A and B and typically forms a lid that partially closes the ligand-binding site.11 It is intriguing that the long L1 loop from PagP is at an equivalent position and is strongly implicated in the function of this enzyme,28,29 however, additional structural data will be required to test whether this loop interacts directly with ligand and/or the upper region of the barrel.

We cannot rule out the possibility that this structural similarity represents a case of convergent evolution. However, based on the absence of lipocalins from those bacteria that also lack β-barrel membrane proteins, we cannot currently exclude the hypothesis that PagP and lipocalins share a common ancestor. This hypothesis is supported by the observation that many bacterial lipocalins are anchored in the outer membrane as lipoproteins.2 The lipocalin-lipoprotein might represent an intermediate state in the adaptation between membrane-bound and soluble globular domains. Outer membrane β-barrel domains should be suitably adapted to occupy both soluble and membrane-bound conformations because both environments are encountered during the outer membrane assembly process.6

Structure and Function of the Bacterial Lipocalin Blc

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch150f4.jpg.

Figure 4

.

X-ray structure of Blc in complex with vaccenic acid. A) Ribbon view of Blc with vaccenic acid inside (spheres). Monomer B (left) is pink; monomer A (right) is rainbow colored from N- (blue) to C-terminus (red). B) Compact view of the Blc dimer attached to the membrane with anchor lipid S-sn-1,2-diacylglycerylcysteine attached to the first cysteine (computer model). Monomer B (right) is pink and monomer A (left) is blue; vaccenic acid and lipidic anchors are represented as spheres.

E. coli Blc was recently cocrystallized in the presence of the 18-carbon unsaturated fatty acid known as vaccenic acid, and data collected at 1.8 Å resolution (V. Campanacci, R.E. Bishop, L. Reese, S. Blangy, M. Tegoni, and C. Cambillau; submitted). The complex is isostructural with that of native Blc and two monomers are contained in the asymmetric unit. The Blc monomer has a typical lipocalin fold consisting of a β-barrel with eight anti-parallel strands and an a-helix at the C-terminus. Inside the cavity of monomer B, a well defined elongated electron density could easily accommodate a vaccenic acid molecule (Fig. 4A). The molecule is bent at the position of the cis-double bond (Z11-12). Careful examination of the relationship between monomers A and B revealed that they interact tightly together: the interaction covers 786 Å2 and 825 Å2 of water accessible surface (WAS) area, on monomer A and B, respectively. Given that the total WAS area of each monomer is 7800 Å2, the buried WAS area represents ˜10% of the total, a value indicating that dimerization is not a crystallization artefact.30 For comparison, these values of interacting surface area are comparable to those observed in immunoglobulin fragments/protein complexes.31 The interaction between the two monomers was present and essentially identical in the original native structure, but it escaped earlier observation and was not described.3

The comparison of subunits A and B indicate significant conformational differences that might be necessary for dimerization. In order to fit within the other monomer, strands F and G, and particularly the loops between them, move up to 5 Å in one monomer relative to the other. The dimer interface involves in large part these loops, and other residues among which the aromatics Tyr113, Tyr137, and several surface-exposed phenylalanines including Phe53 A & B, Phe108 A & B, Phe109 A & B and Phe112 A & B together form an inter-dimer hydrophobic core. The presence of many exposed phenylalanines is an uncommon feature at the surface of a monomeric globular protein, and has to be regarded as a hallmark of protein-protein or protein-lipid interactions. Both subunits are involved in the binding site of Blc, accounting for the stoichiometry of one vaccenic acid molecule per dimer. The vaccenic acid interacts with both subunits and covers 89 Å2 and 171 Å2 of the WAS area of monomers A and B, respectively.

It should be stressed that Blc is anchored to membranes by a covalently attached lipid modification at the amino terminus of the protein. The manner in which the lipid-anchor of each monomer is arranged relative to the other is a critical issue. The Blc construct employed has the signal peptide and the first four residues (CSSP) removed and replaced by the Gateway ATTB1 sequence.3 We have modelled into the dimeric structure the four original residues of Blc, as well as the major part of the anchoring lipid, S-sn-1,2-diacylglycerylcysteine, that we included at each N-terminal cysteine. This model reveals that the lipid-anchors of each of the 2 monomers are close to each other in the dimer, and are located on the same face in a geometry compatible with the insertion and stabilization of the dimer into the membrane (Fig. 4B). The binding site for vaccenic acid is located opposite to the membrane insertion site where it is expected to face the periplasmic space.

Tryptophan fluorescence quenching studies indicate that dimeric Blc binds fatty acids and phospholipids in a micromolar Kd range. An exposed and unfilled pocket seemingly suited to bind a polar group extending from the fatty acid prompted investigation of lysophospholipids (LPLs), which were found to bind in a nanomolar Kd range. Favorable steric and electrostatic interactions are observed when a glycerophosphoethanolamine moiety is modeled into this region of the structure. Given the high affinity of Blc for LPLs, it seems likely that Blc might fulfil a role in cell envelope LPL transport. Although LPLs are key inner membrane intermediates of phospholipid metabolism, we do not know of any evidence to indicate that they are exported to the outer membrane. However, exogenously supplied LPLs can be taken up by deep-rough LPS mutants and converted by reacylation into glycerophospholipids using inner membrane-associated enzymes.32-34 To date, no accessory factors needed for LPL uptake have been identified. In wild-type cells, at least two enzymes can generate LPLs in the outer membrane, namely, the phospholipase OMPLA35 and the lipid A palmitoyltransferase PagP.36 Both enzymes preferentially generate the sn-1 LPL regioisomers, but these are known to spontaneously rearrange into the more stable sn-2 LPLs.37 Only sn-2 LPLs were tested for binding to Blc.

The LPL products sn-2-lysophosphatidic acid and sphingosine-1-phosphate, together with the structurally related platelet activating factors, have been shown to function as potent biological mediators.38,39 In eukaryotic cells, sn-2 LPLs can be generated by phospholipases that mobilize arachidonic acid for eicosanoid-mediated signal transduction. Additionally, sn-2 LPLs are enzymatic products of the lecithin:cholesterol acyltransferase LCAT, which is, together with ApoD, a key component of the plasma high density lipoprotein particle. Mammalian ApoD and Drosophila lazarillo are the closest eukaryotic homologues of Blc and the only eukaryotic lipocalins that, like Blc, are anchored to lipid membranes. The ligand for lazarillo is unknown, but ApoD binds a variety of ligands including arachidonic acid.40 Perhaps the high affinity of Blc for LPLs might help shed some light on the enigmatic functions of lazarillo and ApoD.

Conclusions

The bacterial lipocalin Blc is distinguished from eukaryotic lipocalins by its dimeric organization with a lipid-binding pocket at the subunit interface and by its high affinity for LPLs. We have indicated that the structural and functional organization of Blc shares some common features with β-barrel membrane proteins. The structural similarity between PagP and lipocalins is illustrative of this point, but, in the absence of measurable amino acid sequence identity, structural similarity does not by any means indicate common ancestry. In fact, we would normally regard the similarity between PagP and lipocalins as an example of convergent evolution, and other similarities between lipocalins and a polyisoprenoid-binding protein, for example,41 indicate that this structural organization has probably arisen independently on more than one occasion in the past; even the lipoprotein chaperones LolA and LolB are lipid-interactive β-barrel proteins.5 However, the tight association of bacterial lipocalins with those organisms that encode outer membrane β-barrel proteins like PagP makes it difficult to pronounce a verdict of convergent evolution. Combined with the functional association of bacterial lipocalins with outer membranes, we cannot ignore the possibility that lipocalins could have either engendered β-barrel membrane proteins, or vice versa. Whatever the case may be, bacterial lipocalins promise to reveal molecular details of lipid membrane biogenesis relevant to both bacteria and higher organisms, and we hope our present insight into their plausible origins will serve to guide experiments aimed at addressing this subject.

Acknowledgements

Work in the laboratories of REB, GGP, and ERMT was supported by operating grants from the Canadian Institutes of Health Research. Work in the laboratory of CC was supported by the French Ministry of Industry (grant ASG) and the Marseille-Nice Genopole.

References
1.
Newcomer ME, Jones TA, Aqvist J. et al. The three-dimensional structure of retinol-binding protein. Embo J. 1984; 3: 14511454. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
2.
Bishop RE, Penfold SS, Frost LS. et al. Stationary phase expression of a novel Escherichia coli outer membrane lipoprotein and its relationship with mammalian apolipoprotein D. Implications for the origin of lipocalins. J Biol Chem. 1995; 270: 2309723103. [PubMed]
3.
Campanacci V, Nurizzo D, Spinelli S. et al. The crystal structure of the Escherichia coli lipocalin Blc suggests a possible role in phospholipid binding. FEBS Lett. 2004; 562: 183188. [PubMed]
4.
Gonnet P, Rudd KE, Lisacek F. Fine-tuning the prediction of sequences cleaved by signal peptidase II: A curated set of proven and predicted lipoproteins of Escherichia coli K-12. Proteomics. 2004; 4: 15971613. [PubMed]
5.
Tokuda H, Matsuyama S. Sorting of lipoproteins to the outer membrane in E. coli. Biochim Biophys Acta. 2004; 1693: 513. [PubMed]
6.
Mogensen JE, Otzen DE. Interactions between folding factors and bacterial outer membrane proteins. Mol Microbiol. 2005; 57: 326346. [PubMed]
7.
Montoya M, Gouaux E. Beta-barrel membrane protein folding and structure viewed through the lens of alpha-hemolysin. Biochim Biophys Acta. 2003; 1609: 1927. [PubMed]
8.
Schulz GE. The structure of bacterial outer membrane proteins. Biochim Biophys Acta. 2002; 1565: 308317. [PubMed]
9.
Tamm LK, Hong H, Liang B. Folding and assembly of beta-barrel membrane proteins. Biochim Biophys Acta. 2004; 1666: 250263. [PubMed]
10.
White SH, Ladokhin AS, Jayasinghe S. et al. How membranes shape protein structure. J Biol Chem. 2001; 276: 3239532398. [PubMed]
11.
Flower DR, North AC, Sansom CE. The lipocalin protein family: Structural and sequence overview. Biochim Biophys Acta. 2000; 1482: 924. [PubMed]
12.
Bishop RE. The bacterial lipocalins. Biochim Biophys Acta. 2000; 1482: 7383. [PubMed]
13.
Faller M, Niederweis M, Schulz GE. The structure of a mycobacterial outer-membrane channel. Science. 2004; 303: 11891192. [PubMed]
14.
Nikaido H. Molecular basis of bacterial outer membrane permeability revisited. Microbiol Mol Biol Rev. 2003; 67: 593656. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
15.
Raetz CR, Whitfield C. Lipopolysaccharide endotoxins. Annu Rev Biochem. 2002; 71: 635700. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
16.
Delwiche CF, Palmer JD. The origin of plastids and their spread via secondary symbiosis. Pl Syst Evol. 1997; 11(Suppl): 5386.
17.
Gray MW, Burger G, Lang BF. Mitochondrial evolution. Science. 1999; 283: 14761481. [PubMed]
18.
Doolittle WF. You are what you eat: A gene transfer ratchet could account for bacterial genes in eukaryotic nuclear genomes. Trends Genet. 1998; 14: 307311. [PubMed]
19.
Gutierrez G, Ganfornina MD, Sanchez D. Evolution of the lipocalin family as inferred from a protein sequence phylogeny. Biochim Biophys Acta. 2000; 1482: 3545. [PubMed]
20.
Breazeale SD, Ribeiro AA, Raetz CR. Oxidative decarboxylation of UDP-glucuronic acid in extracts of polymyxin-resistant Escherichia coli. Origin of lipid A species modified with 4-amino-4-deoxy-L-arabinose. J Biol Chem. 2002; 277: 28862896. [PubMed]
21.
Rivera MC, Lake JA. The ring of life provides evidence for a genome fusion origin of eukaryotes. Nature. 2004; 431: 152155. [PubMed]
22.
Golding GB, Gupta RS. Protein-based phylogenies support a chimeric origin for the eukaryotic genome. Mol Biol Evol. 1995; 12: 16. [PubMed]
23.
Gupta RS. Protein phylogenies and signature sequences: A reappraisal of evolutionary relationships among archaebacteria, eubacteria, and eukaryotes. Microbiol Mol Biol Rev. 1998; 62: 14351491. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
24.
Woese CR. Bacterial evolution. Microbiol Rev. 1987; 51: 221271. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
25.
Ahn VE, Lo EI, Engel CK. et al. A hydrocarbon ruler measures palmitate in the enzymatic acylation of endotoxin. Embo J. 2004; 23: 29312941. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
26.
Bishop RE. The lipid A palmitoyltransferase PagP: Molecular mechanisms and role in bacterial pathogenesis. Mol Microbiol. 2005; 57: 900912. [PubMed]
27.
Pautsch A, Schulz GE. High-resolution structure of the OmpA membrane domain. J Mol Biol. 2000; 298: 273282. [PubMed]
28.
Hwang PM, Choy WY, Lo EI. et al. Solution structure and dynamics of the outer membrane enzyme PagP by NMR. Proc Natl Acad Sci USA. 2002; 99: 1356013565. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
29.
Hwang PM, Bishop RE, Kay LE. The integral membrane enzyme PagP alternates between two dynamically distinct states. Proc Natl Acad Sci USA. 2004; 101: 96189623. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
30.
Miller S, Lesk AM, Janin J. et al. The accessible surface area and stability of oligomeric proteins. Nature. 1987; 328: 834836. [PubMed]
31.
Desmyter A, Spinelli S, Payan F. et al. Three camelid VHH domains in complex with porcine pancreatic alpha-amylase. Inhibition and versatility of binding topology. J Biol Chem. 2002; 277: 2364523650. [PubMed]
32.
McIntyre TM, Bell RM. Escherichia coli mutants defective in membrane phospholipid synthesis: Binding and metabolism of 1-oleoylglycerol 3-phosphate by a plsB deep rough mutant. J Bacteriol. 1978; 135: 215226. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
33.
Hsu L, Jackowski S, Rock CO. Uptake and acylation of 2-acyl-lysophospholipids by Escherichia coli. J Bacteriol. 1989; 171: 12031205. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
34.
Hsu L, Jackowski S, Rock CO. Isolation and characterization of Escherichia coli K-12 mutants lacking both 2-acyl-glycerophosphoethanolamine acyltransferase and acyl-acyl carrier protein synthetase activity. J Biol Chem. 1991; 266: 1378313788. [PubMed]
35.
Snijder HJ, Ubarretxena-Belandia I, Blaauw M. et al. Structural evidence for dimerization-regulated activation of an integral membrane phospholipase. Nature. 1999; 401: 717721. [PubMed]
36.
Jia W, Zoeiby AE, Petruzziello TN. et al. Lipid trafficking controls endotoxin acylation in outer membranes of Escherichia coli. J Biol Chem. 2004; 279: 4496644975. [PubMed]
37.
Pluckthun A, Dennis EA. Acyl and phosphoryl migration in lysophospholipids: Importance in phospholipid synthesis and phospholipase specificity. Biochemistry. 1982; 21: 17431750. [PubMed]
38.
Anliker B, Chun J. Lysophospholipid G protein-coupled receptors. J Biol Chem. 2004; 279: 2055520558. [PubMed]
39.
Prescott SM, Zimmerman GA, Stafforini DM. et al. Platelet-activating factor and related lipid mediators. Annu Rev Biochem. 2000; 69: 419445. [PubMed]
40.
Morais Cabral JH, Atkins GL, Sanchez LM. et al. Arachidonic acid binds to apolipoprotein D: Implications for the protein's function. FEBS Lett. 1995; 366: 5356. [PubMed]
41.
Handa N, Terada T, Doi-Katayama Y. et al. Crystal structure of a novel polyisoprenoid-binding protein from Thermus thermophilus HB8. Protein Sci. 2005; 14: 10041010. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
42.
Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994; 22: 46734680. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
43.
Page RD. TreeView: An application to display phylogenetic trees on personal computers. Comput Appl Biosci. 1996; 12: 357358. [PubMed]
44.
Altschul SF, Gish W, Miller W. et al. Basic local alignment search tool. J Mol Biol. 1990; 215: 403410. [PubMed]
45.
Nielsen H, Brunak S, von Heijne G. Machine learning approaches for the prediction of signal peptides and other protein sorting signals. Protein Eng. 1999; 12: 39. [PubMed]
Help ǀ Contact Bookshelf
Madame Curie Bioscience Database
(navigation arrows) Go to previous chapter Go to next chapter Go to top of this page Go to bottom of this page Go to Table of Contents