![]() | ![]() |
Formats:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright : © 2006 Dahlbäck et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Epitope Mapping and Topographic Analysis of VAR2CSA DBL3X Involved in P. falciparum Placental Sequestration 1Centre for Medical Parasitology, University of Copenhagen and Copenhagen University Hospital (Rigshospitalet), Copenhagen, Denmark 2Center for Biological Sequence Analysis, BioCentrum, Technical University of Denmark, Kongens Lyngby, Denmark 3Institut de Recherche pour le Développement, Faculté de Pharmacie, Paris, France Alister Craig, Editor Liverpool School of Tropical Medicine, United Kingdom #Contributed equally. * To whom correspondence should be addressed. E-mail: salanti/at/cmp.dk Received August 2, 2006; Accepted October 10, 2006. This article has been cited by other articles in PMC.Abstract Pregnancy-associated malaria is a major health problem, which mainly affects primigravidae living in malaria endemic areas. The syndrome is precipitated by accumulation of infected erythrocytes in placental tissue through an interaction between chondroitin sulphate A on syncytiotrophoblasts and a parasite-encoded protein on the surface of infected erythrocytes, believed to be VAR2CSA. VAR2CSA is a polymorphic protein of approximately 3,000 amino acids forming six Duffy-binding-like (DBL) domains. For vaccine development it is important to define the antigenic targets for protective antibodies and to characterize the consequences of sequence variation. In this study, we used a combination of in silico tools, peptide arrays, and structural modeling to show that sequence variation mainly occurs in regions under strong diversifying selection, predicted to form flexible loops. These regions are the main targets of naturally acquired immunoglobulin gamma and accessible for antibodies reacting with native VAR2CSA on infected erythrocytes. Interestingly, surface reactive anti-VAR2CSA antibodies also target a conserved DBL3X region predicted to form an α-helix. Finally, we could identify DBL3X sequence motifs that were more likely to occur in parasites isolated from primi- and multigravidae, respectively. These findings strengthen the vaccine candidacy of VAR2CSA and will be important for choosing epitopes and variants of DBL3X to be included in a vaccine protecting women against pregnancy-associated malaria. Synopsis Pregnancy-associated malaria caused by Plasmodium falciparum is characterized by the accumulation of parasite-infected red blood cells in the placenta and is a major health problem in Africa. VAR2CSA is a parasite protein expressed on the surface of malaria-infected red blood cells and mediates the binding to the placental receptor, chondroitin sulphate A. It is believed that a vaccine based on VAR2CSA will protect pregnant women against the adverse effects of pregnancy-associated malaria. However, due to the size and polymorphism of VAR2CSA it is required to define smaller regions that can be included in a vaccine and to analyze the degree and consequences of sequence variation to ensure a broadly protective immune response. The authors have characterized the chondroitin sulphate A-binding DBL3X domain of VAR2CSA with regard to epitopes targeted by naturally acquired antibodies and the influence of sequence variation by bioinformatics and experimental data based on a VAR2CSA peptide array. They identify both variable and conserved surface-exposed epitopes that are targets of naturally acquired immunoglobulin gamma in pregnant women with placental malaria. These findings will be imperative for choosing epitopes and variants of DBL3X to be included in a vaccine protecting pregnant women against malaria. Introduction Individuals living in areas with high Plasmodium falciparum transmission acquire immunity to malaria over time and adults have markedly reduced risk of getting severe disease [1]. Pregnant women constitute an important exception to this rule, and this has severe consequences for both mother and child [2]. Pregnancy-associated malaria (PAM) is characterized by selective accumulation of P. falciparum in the intervillous blood spaces of the placenta [3,4]. The main pathophysiological consequences of PAM are delivery of low birth weight babies and maternal anaemia [5]. In areas of high parasite transmission PAM affects mainly primigravidae as immunity is acquired as a function of parity [2]. Parasite sequestration in the placenta is mediated by an interaction between chondroitin sulphate A (CSA) on the syncytiotrophoblasts and proteins expressed on the surface of infected erythrocytes [6]. VAR2CSA, a single and uniquely structured molecule belonging to the Plasmodium falciparum erythrocyte membrane protein 1 (PfEMP1) family, is currently believed to be the main parasite ligand for placental binding [7]. Var2csa is markedly up-regulated in P. falciparum selected in vitro to bind to CSA [7] and in parasites isolated from the placenta [8]. Antibodies to the surface-expressed VAR2CSA are acquired by women exposed to malaria during pregnancy [9,10], and high levels of anti-VAR2CSA antibodies at delivery are associated with protection from low birth weight [9]. Furthermore, it has been demonstrated that targeted disruption of var2csa results in the loss of [11], or a marked reduction [12] in the ability of parasites to adhere to CSA. Based on these findings, VAR2CSA is recognized as the leading PAM vaccine candidate; however, var2csa is a polymorphic gene and the sequence variation between genes from different parasites ranges from 10%–30% at the nucleic acid level [7,13]. It is thus a major challenge for vaccine development to characterize the importance of the sequence variation and to define smaller epitopes that can be used in a vaccine to protect women against PAM. This study had two objectives. Firstly, to characterize the epitopes of the CSA-binding Duffy-binding-like (DBL) 3X domain of VAR2CSA, which are recognized by naturally acquired antibodies. Secondly, to analyze the degree and consequences of sequence variation and selection pressure within the var2csa subfamily, using the var2csa cDNA sequences of a large number of fresh placental parasite isolates. These studies would also test the validity of B cell epitope predictions and structural modeling of the DBL3X domain. Results/Discussion Recombinant VAR2CSA DBL3X Binds CSA and the Affinity Depends on the Primary Amino Acid Sequence It has previously been shown that DBL3X expressed on the surface of Chinese hamster ovary cells binds CSA in vitro [14]. However, it is important to test the CSA-binding properties of secreted VAR2CSA proteins produced in expression systems that could allow for larger-scale production of a vaccine. For this study, recombinant HIS-tagged proteins were produced in Baculovirus-transfected insect cells and binding to CSA was determined in an enzyme-linked immunosorbent assay (ELISA) system. It has previously been suggested that the FCR3 DBL3X domain binds CSA, whereas the 3D7 DBL3X domain does not [14]. We found that both variants had affinity to CSA and that the 3D7 variant exhibited the strongest binding. Binding of both FCR3 and 3D7 DBL3X was concentration-dependent (Figure 1
The Structure of the VAR2CSA DBL3X Domain Can Be Modeled on the Basis of the Structure of the DBL Domains of Erythrocyte-Binding Protein-175 Recently, the structures of the two DBL domains (F1 and F2) of erythrocyte-binding protein (EBA)-175 and a DBL domain in Plasmodium knowlesi (Pk)α-DBL were solved [15,16]. Using the crystal structure of EBA-175 F1 (Protein Data Bank code 1ZRO) as template, a three-dimensional model of DBL3X VAR2CSA was constructed by comparative modeling on the basis of the 3D7 sequence (Figure 1 Sequence Variation within the var2csa Family Is Present as Small Hypervariable Blocks and Parasite Diversity Is Similar on a Local and Global Scale Var2csa is a relatively conserved var gene carried by all P. falciparum genomes; however, sequence polymorphisms are present in the gene. In a previous study, it was established that var2csa is transcribed at high levels by placental parasites isolated at delivery from Senegalese women [8,20]. Using cDNA from 24 placentas from that study, the region encoding DBL3X of var2csa was cloned and sequenced. A multiple alignment of 43 sequences showed that the average nucleotide diversity was low (π = 8.48% ± 0.37%, see Materials and Methods for details) reflecting a limited inter-parasite diversity in these isolates collected from a geographically small and well-defined area of West Africa. To test whether the Senegalese placental DBL3X sequences represented a monophyletic group compared to non-Senegalese DBL3X sequences, a phylogenetic tree, which included VAR2CSA DBL3X sequences available in GenBank, was constructed (Figure 2
VAR2CSA Sequence Motifs Can Be Linked to the Parity of the Infected Women We have previously shown that the expression of certain var genes is associated with severe malaria in young children [22] and suggested that var gene expression is hierarchically structured. This could occur because the progeny of parasites expressing var gene products that mediate the most efficient sequestration outgrow the progeny of parasites expressing a molecule mediating less efficient binding [22–24]. A similar process could shape the expression of molecules mediating binding in the placenta. If that was the case, it would be predicted that a systematic difference between molecules mediating binding in primigravidae and multigravidae women could be detected in areas of high P. falciparum transmission where women are exposed to several parasite clones during pregnancy [25]. This was addressed by calculating the Kullback-Leibler distance between 21 VAR2CSA sequences originating from primigravidae and 21 sequences from multigravidae. The overall cumulated Kullback-Leibler distance (DKL) between sequences from the primigravidae and the multigravidae was higher than for randomly chosen groupings of the sequences (p = 0.0075). The DKL was also calculated for each position in the alignment to determine which polymorphisms contributed to the difference between the sequence of parasites from primi- and multigravidae women and visualized in a Kullback-Leibler sequence logo (Figure 4
VAR2CSA Is Under both Positive and Purifying Selection A recent study has suggested that sequence polymorphisms in a region of VAR2CSA upstream to DBL3X largely are due to positive natural selection pressure [13]. To further investigate the nature of sequence diversity in var2csa, the dN/dS ratios (dN, rate of non-synonymous mutations per non-synonymous site and dS, the rate of synonymous mutations per synonymous site) for DBL3X were calculated on the basis of the sequenced DBL3X domains. A dN/dS <1 indicates that the position is under negative or purifying selection pressure leading to conservation of the residue, while dN/dS >1 suggests positive or diversifying selection pressure, and suggests that amino acid changes are evolutionarily advantageous at the position. Purifying selection was mainly found in the conserved regions C1–4 and it was especially pronounced in the C2 and C4 regions, although single sites were observed to be under diversifying selection (Figure 5
Recombination Is a Factor in the Generation of var2csa Sequence Variation Previous studies have reported that frequent recombination events generate sequence diversity in the PfEMP1 family [26–28]. To determine the role of recombination for DBL3X sequence variation, we estimated the population recombination rate ρ defined for partially inbreeding haploid species by the compound parameter 2Ner(1− f), where Ne represents the effective population size, r is the per-generation cross-over recombination rate per base pair (bp), and f is the inbreeding coefficient. Variations in ρ across DBL3X correspond to variations in the recombination rate r, as Ne and f are constant for the dataset. Two recombinational hotspots were observed at bp positions 138–178 and 704–730 (Figure 5 VAR2CSA DBL3X B Cell Epitope Prediction VAR2CSA epitopes exposed on the surface of the protein and accessible for immunoglobulin gamma (IgG) binding could be under diversifying selection resulting in escape mutations and high dN/dS ratios, whereas residues involved in protein folding, stability, and anchoring could be under purifying selection with low dN/dS ratios. To predict linear B cell epitopes, the 3D7 sequence was submitted to the BepiPred server [29] and seven epitopes were predicted within the DBL3X sequence (Figure 5 Fine Epitope Mapping of VAR2CSA Antibodies Acquired during Pregnancy To verify the above bioinformatical predictions, we evaluated the fine specificity of naturally acquired human antibodies to VAR2CSA DBL3X in a peptide array consisting of 442 overlapping 31mer peptides covering exon 1 of VAR2CSA of the 3D7 sequence. Antibody reactivity to individual amino acids was assigned on the basis of an algorithm based on the observation that a major part of antibody binding motifs in a set of conformational epitopes are from two to seven amino acids long, containing either two or three defined residues spaced by undefined amino acids [31]. The VAR2CSA of 3D7 contains 31,149 such motifs and each was assigned an average PepScan value by adding the measured reactivity from the 31mers in which the motif was present, and dividing with the number of times the motif occurred. The method, described in Materials and Methods, was validated by testing serum from rabbits immunized with a VAR2CSA construct, which showed that both the measurements based on individual peptide readings and the algorithm described above, mapped antibody responses to regions present in the antigen used for immunization (Figure S2). Furthermore, there was a high concurrence between antibody peaks defined by the reactivity of the individual peptides and the peaks defined by the algorithm defining single amino acid scores (Figure S2). Plasma from individuals not exposed to malaria did not react with any of the peptides in the array (unpublished data). The IgG reactivity in the plasma of eight Ghanaian women with a known history of a placental malaria infection was analyzed and the peptide array data was visualized on the DBL3X model (Figure 6
PepScan Analysis of Affinity-Purified Antibodies During infection, VAR2CSA will be degraded and antibodies will be acquired against epitopes that are not accessible for antibodies when the protein is in its natural conformation. It is therefore possible that some of the antibody reactivities measured in the peptide array were directed against such epitopes. To address this question, analysis was performed on plasma, which had been affinity purified on recombinant DBL3X or antibody-depleted by incubation with erythrocytes infected with VAR2CSA-expressing parasites. Plasma from a rabbit immunized with recombinant DBL3X and plasma from women who had suffered a placental infection were affinity purified on the recombinant CSA-binding DBL3X protein and analyzed by the peptide array. Before affinity purification, the plasma pool from Ghanaian women showed reactivity corresponding to eight peaks distributed throughout the domain (Figure 7
When exchanging the F1 domain of one of the EBA-175 molecules in the EBA-175 dimer with VAR2CSA DBL3X and mapping SE1–SE3 to the model, it is apparent that the surface-exposed regions are on the opposite side of the central cavity of the dimer (Figure 7 Our results demonstrate that both conserved and polymorphic surface-exposed regions are targets for VAR2CSA DBL3X antibodies acquired during pregnancy by malaria-infected women. This opens for vaccine, strategies similar to those being pursued for the polymorphic merozoite surface protein (MSP) 1 [32]. The proteolytic processing of MSP1 is a prerequisite for successful parasite invasion of erythrocytes, and one vaccine strategy is based on the induction of antibodies against the conserved C-terminal part of the molecule that inhibits processing [33]. Another MSP1 vaccine strategy employs chimeric vaccine constructs designed to induce antibodies targeting the polymorphic types present in the N-terminal part of the molecule [34]. In a similar fashion, VAR2CSA vaccine constructs could target conserved epitopes like those identified in SE3, or alternatively, constructs should induce antibodies targeting different serological variants like those predicted to be generated by the sequence polymorphisms present in regions SE1 and SE2. The human antibody pools used in this study to identify surface-exposed antigenic targets inhibit parasite binding to CSA in vitro (unpublished data). However, the molecular targets of the inhibitory antibodies have not been identified and knowledge is required about the antigenic targets for antibodies on the other VAR2CSA DBL domains. The high similarity between the DBL structures of EBA-175 [15], Pkα-DBL [16], and the VAR2CSA DBL3X model suggests that the DBL structures in Plasmodium are relatively conserved and that the antigenic characteristics of the DBL3X might be comparable to those of the remaining VAR2CSA DBL domains. However, a more comprehensive analysis of sequence variation, antibody epitopes, and structure of the VAR2CSA DBL domains not belonging to DBL3X is needed to establish the extent of the structural conservation between the domains. Development of PAM vaccines requires a much better understanding about the molecular interaction between placental parasites and the ligand on the syncytiotrophoblasts, as well as knowledge about the fine specificity of the targets for antibodies inhibiting binding. Native PfEMP1 molecules are difficult to isolate and with current technologies it is difficult to produce correctly folded recombinant material in quantities allowing structure elucidation by crystallography. In this paper, we have combined in silico methods such as model building and sequence analysis and the analysis of antibody reactivity to obtain new information and generate hypotheses about the structure and functional relationship of VAR2CSA. Materials and Methods Cloning and expression of VAR2CSA domains. DBL3X and DBL4 of VAR2CSA were amplified from FCR3 and 3D7 genomic DNA with the following primers: FCR3 DBL3X – 5′ CG GAA TTC ACC AAT ATT AAT AAA AGT GAA and 3′ ATT TGC GGC CGC CAG CAT TAT TAT ATT TGT A, 3D7 DBL3X – 5′CG GAA TTC AAG ATG AAG TCC TCC GAG and 3′ATT TGC GGC CGC CAA AAC AGC CAA GCT GGA, 3D7 DBL4 - 5′CG GAA TTC CAG GTG AAG TAC TAC GAA and 3′CTG TTC CTC CAC GTG CTC CAG. PCR products were digested with EcoRI and NotI for cloning into the Baculovirus vector, pAcGP67-A (BD Biosciences, http://www.bdbiosciences.com), which was modified to contain the V5 epitope upstream of a histidine tag in the C-terminal end of the constructs. Linearised Bakpak6 Baculovirus DNA (BD Biosciences) was co-transfected with pAcGP67-A into Sf9 insect cells for generation of recombinant virus particles. Recombinant protein was purified on Co2+ metal-chelate agarose columns as secreted histidine-tagged proteins from the supernatant of infected High-Five insect cells.CSA binding assay. Binding to CSA (C9819, Sigma-Aldrich, http://www.sigmaaldrich.com) was determined in an ELISA system. ELISA plates (Falcon 351172) were coated overnight with CSA (50 μg/ml) in PBS at 4 °C. Coating with 1% BSA in PBS (blocking buffer) was used as negative control. Plates were incubated with blocking buffer for 1 h at room temperature (RT) to inhibit non-specific adsorption to the plate. The VAR2CSA proteins were diluted in blocking buffer (1–10 μg/ml protein), added to the wells, and incubated for 1 h at RT. For the inhibition assays, proteins (7 μg/ml) were pre-incubated with different concentrations of soluble CSA for 30 min. Plates were washed four times in PBS between the different steps. Specific binding was visualized by adding an HRP-conjugated antibody (R960–25, Invitrogen, http://www.invitrogen.com) targeting the V5 epitope of the constructs. Plates were incubated for 1 h with the anti-V5 antibody diluted 1:3000 in blocking buffer. The color reactions were developed for 15 min by the addition of o-phenylenediamine substrate and stopped by adding 2.5 M H2SO4. The optical density was measured at 490 nm. Cloning and sequencing of placental var2csa genes. All DBL3X sequences were obtained from cDNA, whereas DBL2X and the overlapping region of DBL4 and DBL5 of var2csa were cloned from genomic DNA of placental parasites. In brief, parasites were dissolved in Trizol LS (Invitrogen) and RNA was prepared according to the manufacturer's instruction. RNA pellets were dissolved in 10 μl of RNase-free water and treated with DNaseI (Sigma-Aldrich) for 25 min at RT, followed by 10 min heat inactivation at 65 °C. All RNA samples were subsequently tested in real-time PCR for contamination with genomic DNA using a primer set for the housekeeping gene, seryl-tRNA synthetase. DNA-free RNA samples were used for synthesis of cDNA by reverse transcriptase (Superscript II, Invitrogen) and random hexamer primers as described by the manufacturer. Following primer sets were used for cloning DBL3X from cDNA into either the Baculovirus vector, pAcGP67-A (BD Biosciences) or the pCR2.1-TOPO vector (Invitrogen): p509 5′ CG GAA TTC GAT ACA AAT GGT GCC TGT and p510 3′ ATT TGC GGC CGC ATA TAC TGC TAT AAT CTC C, p508 5′ CG GAA TTC ACA CAA AAT TTA TGT GTT and p510, p503 5′ GAG ATA CAA ATG GTG CCT GT and p505 3′ AAA TTT GCT GAT ATA CAT TCA G. PCR products aimed for the Baculovirus vector were digested with EcoRI and NotI before ligation. Three to six colonies of each cloning and corresponding plasmids were sequenced by Macrogen (http://www.macrogen.com).Genomic DNA was extracted from filter paper using a chelex-based method [35]. Briefly, filter spots were dissolved in 0.5% saponin in PBS using a microtiter plate and incubated overnight on a shaker at RT. After washing the filter spots twice in PBS, a solution of 50 μl of H2O and 100 μl of 10% chelex mixture was added to each well. The plate was boiled for 8 min and subsequently cooled down for 10 min at RT. A PCR reaction was run with primers for the seryl-tRNA synthetase gene to control for the DNA content. Around 1–3 μl of DNA was used for the PCR reactions amplifying the different var2csa regions. All PCR products were cloned into the pCR2.1-TOPO vector and the inserts sequenced on a 3100-Avant Genetic Analyzer (Applied Biosystems, http://www.appliedbiosystems.com). The origin of parasites are described in [20]. Phylogenetic reconstruction. The alignment of 43 placental and four database VAR2CSA DBL3X sequences, covering the 3D7 amino acid positions 1256–1549, was constructed using the software RevTrans [36], and subsequently manually corrected for errors. To cover more of the DBL3X domain, an alignment of 17 database sequences was constructed in the same way, covering the 3D7 positions 1217–1255. For the phylogenetic tree, 21 Malawian sequences [21] were aligned with the sequences mentioned above, again using RevTrans and manual correction, resulting in a 609-bp alignment. The program MrModeltest version 2.2 [37] was used to find the most appropriate nucleotide substitution model based on the Akaike information criterion [38]. Phylogenetic trees based on the above alignments were constructed by Bayesian inference using the program MrBayes version 3.1.1 [39]. In all cases Markov chain Monte Carlo (MCMC) sampling was performed for 10,000,000 generations with eight chains. Convergence was confirmed by comparing the results of two independent runs. Burn-in was determined using Tracer [40] and 50% majority rule consensus trees were constructed. Model fitting and Akaike weighted dN/dS average. The program codeml from the PAML package version 3.14 [41] was used to fit a range of codon-based evolutionary models to the VAR2CSA DBL3X region using the alignments and Bayesian trees mentioned above. The 11 codon-based models were tested using codeml: M0, M1a, M2a, M3 (with either 3, 4, 5, 6, or 7 site categories), M5, M7, and M8 [42–44]. All 11 models were fitted using the F3x4 (different nucleotide frequencies for each codon position) approach for estimating codon frequencies. Convergence was confirmed by comparing the results of several independent runs started with different parameter vectors. The Akaike information criterion (AIC) was used to assess model fits [38,45,46]. Briefly, AIC estimates the expected relative Kullback-Leibler distance (i.e., AIC is an estimate of the amount of information that is lost when a given model is used to approximate the full truth). AIC is a function of the maximized log-likelihood (lnL) and the number of estimated parameters (K) for a model. Specifically, AIC = −2lnL + 2K where lower AIC values indicate better models. From AIC it was possible to compute Akaike weights, which can be interpreted as the conditional probability of the model given the data and the set of initial models [38,46]. On this basis, dN/dS ratios for codon positions were calculated as an average of the dN/dS ratios estimated from each of the 11 models, weighted by the Akaike weights for the corresponding model. Recombination and mutation rates, diversity, and sequence logo creation. The population recombination rate ρ was estimated for the VAR2CSA DBL3X domain in LDhat version 2.0 [47], which based on population genetics uses coalescent methods specially adapted to account for the possibility of recurrent or back mutation and for an AT-rich genome such as that of P. falciparum [48]. As argued by Mu et al. [28], the coalescent recombination estimate can, for partially inbreeding haploid species such as P. falciparum, be interpreted as the compound parameter ρ = 2Ner(1 − f), where Ne represents the effective population size of the DBL3X population, r denotes the rate of recombination cross-over events per generation per bp, and f is the inbreeding coefficient. The effective population size should be thought of as the size of an ideal population [48,49] with the same magnitude of random genetic drift as the actual population with size N [50]. To test if the placental DBL3X sequence data showed evidence for deviation from the neutral model of evolution assumed by the coalescent method, we calculated Tajima's D statistic [51] and Fu and Li's D* and F* statistic [52]. All three statistics were insignificantly different from zero (p > 0.1 in all cases), indicating that the coalescent approach could be applied. The hypothesis of no recombination was rejected using the likelihood permutation test [48] with 1,000 permutations of segregating sites, of which none produced a higher maximum composite likelihood than for the DBL3X data. The hypothesis of a constant recombination rate across the analyzed region was also rejected (p = 0.048 with 10,000 simulations) using the method described in [47], indicating significance in the recombination rate variations over DBL3X. For calculation of the population recombination rate ρ, we used the Bayesian reversible-jump Markov chain Monte Carlo (RJMCMC) method with a block penalty of 10, running for 10,000,000 iterations with 2,000 iterations per sample and a burn in of 50 samples. The overall region estimate was converted to bp units using the average length of the analyzed sequences. Recombination hotspots were defined as intervals containing SNPs where the population recombination rate mean was above the upper limit of the 95% confidence interval for the overall region estimate of 2Ner(1 − f). Fu and Li's D* and F* and Tajima's D were calculated using DnaSP version 4.10.6 [53]. The average nucleotide diversity and its variance were calculated according to Nei [54] equation 10.5 and 10.7, respectively (gapped columns were included). The Kullback-Leibler sequence logo was created by calculating the distance between the two groups of sequences for each amino acid position in the alignment using the symmetric Kullback-Leibler distance:
where p and p′ are the frequencies of an amino acid type in each of the two groups, and AA indicates that the sum is over all the amino acid types. The cumulated Kullback-Leibler distance was calculated as the sum of Dkl for all positions in the alignment. Gaps in the alignment were in this analysis assigned the letter “O” and treated as an amino acid class. To test if the mentioned grouping according to parity gave two significantly different sequence groups, we created 10,000 random groupings and for each of these summed the DKL over all amino acid positions. Similarly for the individual positions in the logo, the distribution of DKL for 10,000 random shuffles of sequence grouping was noted specifically for each position, and the p-values were based on these distributions. PepScan motif analysis. 442 overlapping 31mer peptides covering the exon 1 of 3D7 VAR2CSA were synthesized as solid phase peptide synthesis (SPPS) with a stepwise addition of the different amino acids attached to a solid resin. The long peptides were synthesized with a cysteine at amino acid (aa) position 15 allowing some secondary structure. This approach allows identification of antigenic sites that cannot be mapped using short, linear peptides (PepScan Systems, the Netherlands). The raw data from a PepScan experiment consists of figures measuring the amount of IgG bound to each of the overlapping 31mer peptides. We used the overlap in primary sequence to determine more specifically what the antibodies have affinity for. The motif analysis is based on the concept that polyclonal IgG response consists of subpopulations of monoclonal antibodies each binding a certain set of amino acid sequence motifs. We then made a list of all possible binding motifs and transferred the information from the peptide array to these, giving each motif a score indicating the IgG affinity for the motif. The PepScan assay is designed primarily to determine linear epitopes, and thus we are mainly interested in short binding motifs and gaps. On this basis we performed the PepScan motif analysis, using motifs containing either two or three defined residues spaced by undefined amino acids up to a certain maximal length of 5, 7, 10, 15, 20, or 25 residues. We found that the different maximal motif lengths gave similar results, but with different detail resolutions (long motifs had a smoothing effect), and a binding motif with a maximal length of seven residues was selected as being most informative. Thus, the presented results are based on the assumption that motifs are two to seven amino acids long, and contain either two or three defined residues spaced by undefined amino acids, e.g., a possible motif could be “WXXXDXE” or simply “KN.” The method was validated by comparison to the raw data, where rabbit serum from a rabbit immunized with a DBL5 construct was used in the PepScan assay (Figure S2). The figure shows that the motif analysis produces peaks approximately at the same positions as in the raw data, and that the analysis does not introduce bias in the other regions of the protein. As control for the human IgG used in Figure 6 Affinity purification of antibodies and depletion of serum on parasites. Affinity purification of antibodies was done according to manufacturer's instructions. In brief, 1 mg of recombinant protein was dialyzed against 0.2 M NaHCO3, 0.5 M NaCl (pH 8.3), and applied to a NHS-activated HiTrap 1-ml column (GE Healthcare, http://www.gehealthcare.com) that had been equilibrated with 3 × 2 ml 4 °C 1 mM HCl. After coupling, the columns were washed with 0.5 M ethanolamine, 0.5 M NaCl (pH 8), 0.1 M acetate, 0.5 M NaCl (pH 4), and a final wash with PBS (pH 7.4). 1 ml of a plasma pool (28 women from Ghana) was then applied to the column. After washing in 10 ml PBS, affinity-bound antibodies were eluted by CH3COONH4 (pH 3) and neutralized in 1 M Tris (pH 7.5). The specificity of the purified antibodies was tested in ELISA against (1) the domain used for affinity purification, (2) other VAR2CSA domains, and (3) glutamine rich protein (GLURP) [55]. Affinity-purified antibodies used for PepScan analysis were all negative in ELISA against control proteins and positive against the homologous domain (unpublished data). Surface reactive antibodies in a pool of pregnancy plasma (from 15 pregnant women from Korogwe, Tanzania) were depleted using a parasite line selected for VAR2CSA expression using VAR2CSA specific antibodies [9]. In brief, 40 μl of the plasma pool were incubated with 2.0 × 108 MACS purified intact late stage trophozoite- and schizont-infected erythrocytes for 20 min at 4 °C. Hereafter, the cells were centrifuged at 800 g for 8 min, and the supernatant used to suspend a new pellet of 2.0 × 108 infected erythrocytes. This procedure was repeated four times. The depletion of surface reactive antibodies from the plasma pool was confirmed using a flow cytometry assay [9] after the final depletion. 3-D modeling of the 3D7 DBL3X domain. The 3-D structure of the 3D7 sequence (PFL0030c aa 1217–1559) was modeled using the HHpred server with default settings [56]. Briefly, the HHpred method is specialized in remote homology detection using hidden Markow models (HMMs) built from PSI-BLAST profiles and secondary structure. The crystal structure of EBA-175 F1 (Protein Data Bank code 1ZRO chain A, [15]) was used as template and had the highest sequence and secondary structure alignment scores. The HHpred alignment was corrected in a short template loop sequence (Figure S1, positions 215–219) positioned next to a gap. The correction shifted the position of the gap and allowed for the modeling of a disulfide bridge in DBL3X, which was conserved in the EBA-175 F1, F2, and Pkα-DBL domains. HHpred HMMs for DBL3X and the template continued to match. Finally the corrected alignment was used to generate a 3-D model using MODELLER [57] with the protocol setup in the HHpred server toolkit. A superimposition of the EBA-175 F1 structure and the DBL3X model was obtained by the HHpred toolkit. Naccess version 2.1.1 [58] was used to calculate relative surface-exposed areas (RSAs) in single chains of EBA-175 F1, F2, and the Pkα-DBL domain [16]. The MAMMOTH-mult alignment server [59] was used to make a multiple structure superimposition of DBL3X model on the EBA-175 F1 and F2 DBL domains [15] and the Pkα-DBL domain [16]. The resulting alignment was inspected to identify conserved positions of cysteines and buried hydrophobic residues (RSA < 30%). Structural visualizations were made using PyMol [60]. Figure S1: Multiple Structural Alignment of VAR2CSA DBL3X, EBA-175 F1, EBA-175 F2, and Pkα-DBL Cysteines in conserved disulfide bonds are highlighted in blue, conserved tryptophans buried in the structures are shown in red, and other conserved, buried, and hydrophobic positions are shown in green. (51 KB PTT) Click here for additional data file.(51K, ppt) Figure S2: PepScan Analysis of Rabbit Serum Immunized with a DBL5 Recombinant Construct (A) Plasma from a rabbit immunized with a DBL5 construct was tested in 1:1000 on the VAR2CSA PepScan array. The upper diagram is the raw PepScan values with the PepScan score (y-axis) for each of the 420 peptides (x-axis). (B) The same PepScan scores as (A) but calculated using a motif algorithm assigning a value to each amino acid. (160 KB PPT) Click here for additional data file.(161K, ppt) Accession Numbers All sequence data are available at GenBank (http://www.ncbi.nlm.nih.gov/Genbank) under the accession numbers DQ995590–DQ995632. Acknowledgments We acknowledge the women who donated serum and parasites for this study. We thank Gilean McVean for advice regarding recombination analysis, Kristoffer Rapacki for help with logo generation, and Anne Corfitz for excellent technical assistance. PepScan systems (the Netherlands) are thanked for technical assistance on the data work on the serum PepScan data. Abbreviations
Footnotes Competing interests. The authors have declared that no competing interests exist. Author contributions. MD, TSR, AGP, TGT, and AS conceived and designed the experiments. MD, TSR, MAN, MR, and AS performed the experiments. MD, TSR, PHA, OL, TGT, and AS analyzed the data. MD, PHA, NTN, LT, PD, and LH contributed reagents/materials/analysis tools. MD, TSR, PHA, and AS wrote the paper. Funding. This study was primarily funded by the European Malaria Vaccine Initiative grant 0012–2004. This study also received financial support from the Danish Medical Research Council (SSVF) (grants 22-02-0220 and 22-03-0333), the Danish Research Council for Development Research (RUF) (grant 104.Dan.8.L.306 and 8.L.306), and the Danish National Research Foundation (Danish Platform for Integrative Biology). AS is supported by a postdoctoral grant from SSVF. MD is supported by a PhD studentship from RUF. MAN is supported by a postdoctoral grant from Hovedstadens Sygehusfaellesskab. NTN is supported by a postdoctoral grant from the Fondation Recherche Médicale. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bull World Health Organ. 1983; 61(6):1005-16.
[Bull World Health Organ. 1983]Am J Pathol. 1982 Dec; 109(3):330-42.
[Am J Pathol. 1982]Trans R Soc Trop Med Hyg. 1979; 73(4):427-31.
[Trans R Soc Trop Med Hyg. 1979]Trans R Soc Trop Med Hyg. 1983; 77(2):232-44.
[Trans R Soc Trop Med Hyg. 1983]Science. 1996 Jun 7; 272(5267):1502-4.
[Science. 1996]J Infect Dis. 2005 Mar 15; 191(6):1010-3.
[J Infect Dis. 2005]Cell. 2005 Jul 29; 122(2):183-93.
[Cell. 2005]Nature. 2006 Feb 9; 439(7077):741-4.
[Nature. 2006]Mol Biochem Parasitol. 2006 Jul; 148(1):1-9.
[Mol Biochem Parasitol. 2006]Nature. 1993 Apr 15; 362(6421):603-9.
[Nature. 1993]Mol Biochem Parasitol. 1996 Oct 30; 81(2):119-26.
[Mol Biochem Parasitol. 1996]J Infect Dis. 2005 Jul 15; 192(2):331-5.
[J Infect Dis. 2005]J Infect Dis. 2004 Dec 1; 190(11):2001-9.
[J Infect Dis. 2004]Infect Immun. 2006 Aug; 74(8):4875-83.
[Infect Immun. 2006]J Exp Med. 2004 May 3; 199(9):1179-90.
[J Exp Med. 2004]Malar J. 2005 Apr 27; 4(1):21.
[Malar J. 2005]Microbes Infect. 2006 Jun; 8(7):1773-82.
[Microbes Infect. 2006]Mol Biochem Parasitol. 2006 Aug; 148(2):169-80.
[Mol Biochem Parasitol. 2006]Cell. 2005 Jul 29; 122(2):183-93.
[Cell. 2005]Mol Biochem Parasitol. 2000 Oct; 110(2):391-7.
[Mol Biochem Parasitol. 2000]Mol Biochem Parasitol. 2006 Aug; 148(2):169-80.
[Mol Biochem Parasitol. 2006]Immunome Res. 2006 Apr 24; 2():2.
[Immunome Res. 2006]J Mol Evol. 2003 Aug; 57(2):212-21.
[J Mol Evol. 2003]Parassitologia. 1999 Sep; 41(1-3):409-14.
[Parassitologia. 1999]J Exp Med. 1994 Jul 1; 180(1):389-93.
[J Exp Med. 1994]Infect Immun. 2005 Sep; 73(9):5928-35.
[Infect Immun. 2005]Cell. 2005 Jul 29; 122(2):183-93.
[Cell. 2005]Nature. 2006 Feb 9; 439(7077):741-4.
[Nature. 2006]Antimicrob Agents Chemother. 2003 Apr; 47(4):1347-54.
[Antimicrob Agents Chemother. 2003]J Infect Dis. 2004 Dec 1; 190(11):2001-9.
[J Infect Dis. 2004]Nucleic Acids Res. 2003 Jul 1; 31(13):3537-9.
[Nucleic Acids Res. 2003]Infect Immun. 2006 Aug; 74(8):4875-83.
[Infect Immun. 2006]Syst Biol. 2004 Oct; 53(5):793-808.
[Syst Biol. 2004]Bioinformatics. 2003 Aug 12; 19(12):1572-4.
[Bioinformatics. 2003]Comput Appl Biosci. 1997 Oct; 13(5):555-6.
[Comput Appl Biosci. 1997]Mol Biol Evol. 1994 Sep; 11(5):725-36.
[Mol Biol Evol. 1994]Mol Biol Evol. 1998 Dec; 15(12):1600-11.
[Mol Biol Evol. 1998]Syst Biol. 2004 Oct; 53(5):793-808.
[Syst Biol. 2004]Science. 2004 Apr 23; 304(5670):581-4.
[Science. 2004]Genetics. 2002 Mar; 160(3):1231-41.
[Genetics. 2002]Genetics. 2001 Dec; 159(4):1805-17.
[Genetics. 2001]Genetics. 1989 Nov; 123(3):585-95.
[Genetics. 1989]Genetics. 1993 Mar; 133(3):693-709.
[Genetics. 1993]Genetics. 2002 Mar; 160(3):1231-41.
[Genetics. 2002]Science. 2004 Apr 23; 304(5670):581-4.
[Science. 2004]Bioinformatics. 2003 Dec 12; 19(18):2496-7.
[Bioinformatics. 2003]J Infect Dis. 2000 Mar; 181(3):1202-5.
[J Infect Dis. 2000]J Exp Med. 2004 Nov 1; 200(9):1197-203.
[J Exp Med. 2004]Nucleic Acids Res. 2005 Jul 1; 33(Web Server issue):W244-8.
[Nucleic Acids Res. 2005]Cell. 2005 Jul 29; 122(2):183-93.
[Cell. 2005]Proteins. 1995 Nov; 23(3):318-26.
[Proteins. 1995]Nature. 2006 Feb 9; 439(7077):741-4.
[Nature. 2006]Bioinformatics. 2005 Aug 1; 21(15):3255-63.
[Bioinformatics. 2005]