# PCE: web tools to compute protein continuum electrostatics

^{1}INSERM U726, EBGM, University Paris 7, France

^{*}To whom correspondence should be addressed at INSERM U648, Faculté des Sciences Pharmaceutiques et Biologiques, 4 avenue de l'Observatoire, 75006 Paris, France. Tel: +33 1 53 73 95 73; Fax: +33 1 53 73 95 74; Email: rf.5sirap-vinu@avetim.airam

The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact gro.slanruojpuo@snoissimrep.slanruoj

## Abstract

PCE (protein continuum electrostatics) is an online service for protein electrostatic computations presently based on the MEAD (macroscopic electrostatics with atomic detail) package initially developed by D. Bashford [(2004) *Front Biosci*., 9, 1082–1099]. This computer method uses a macroscopic electrostatic model for the calculation of protein electrostatic properties, such as p*K*_{a} values of titratable groups and electrostatic potentials. The MEAD package generates electrostatic energies via finite difference solution to the Poisson–Boltzmann equation. Users submit a PDB file and PCE returns potentials and p*K*_{a} values as well as color (static or animated) figures displaying electrostatic potentials mapped on the molecular surface. This service is intended to facilitate electrostatics analyses of proteins and thereby broaden the accessibility to continuum electrostatics to the biological community. PCE can be accessed at http://bioserv.rpbs.jussieu.fr/PCE.

## INTRODUCTION

Electrostatic interactions play a central role in protein structure and function (1–3). However, in spite of their importance, it is still difficult to treat electrostatic properties theoretically. One reason is that Coulomb's law is valid for charges immersed in an infinite medium of a uniform dielectric (4). Yet, many reactions take place at the interface between regions having different dielectric values. Analytical solution of the Poisson–Boltzmann equation can resolve this problem only in several particular cases. Thus, the numerical solution of the Poisson–Boltzmann equation, like the Finite Difference Poisson–Boltzmann method (FDPB) (5), appears to be a good model for describing macromolecules in water solutions (6,7). Some of these methods are implemented at: http://agave.wustl.edu/pdb2pqr/ (8), http://agave.wustl.edu/apbs/ (9) and http://honiglab.cpmc.columbia.edu/ (10).

The basic electrostatic property that can be calculated via PCE (protein continuum electrostatics) is the electrostatic potential, yet continuum electrostatic models have also proven to be useful for the calculation of p*K*_{a} in proteins. We have implemented on the PCE server (http://bioserv.rpbs.jussieu.fr/PCE) the MEAD (macroscopic electrostatics with atomic detail) package based on the FDPB method for the computation of electrostatic potential values and pH titration behavior of titratable groups in proteins (11). In this approach, the protein is modeled as a low dielectric material (dielectric usually between 2.0 and 4.0) with embedded partial charges surrounded by a high dielectric medium (the solvent is regarded as a continuum with a dielectric value around 80.0), the atomic details of the protein structure are used to define the boundary and charge placement and the FDPB method is used to solve the resulting partial differential equations for the electrostatic potential.

## METHODS AND IMPLEMENTATION

The PCE web service is driven by a modular, Python-written collection of routines, which provides non-interactive, high-throughput usage of the package MEAD. MEAD is based on the FDPB method to compute electrostatic potential values and pH titration behavior of titratable groups in proteins (11).

In the MEAD package, the electrostatic interactions are calculated with a continuum dielectric model where the protein and the water phase are represented as homogenous materials with a low (by default ε_{in} = 4) and high (by default ε_{out} = 80) dielectric constant ε, respectively. The dielectric boundary between protein and solvent is the molecular surface, which depends on the atomic radii and the radius of a solvent-sized spherical probe. The Poisson–Boltzmann equation is solved by means of a finite difference algorithm (5,11–13). The user can use the default values for ε or, for example, use ε_{in} = ε_{out} (in this case the Poisson–Boltzmann equation will be resolved in a homogenous dielectric medium) or evaluate the effect of changing ε_{in} from 2 to 10 or 20 to simulate protein flexibility (14).

### Electrostatic potentials calculations

In our implementation, the electrostatic potentials are computed by the program ‘potential’ of MEAD starting from the coordinate file. Our implementation adds hydrogen atoms (15) and assigns the Parse parameters (radii and charges) (16). In addition, it is possible to upload PQR files instead of PDB files, such format parameterized files can be obtained, e.g. at http://agave.wustl.edu/pdb2pqr/ (8) (at this site, users can also perform hydrogen-bond network optimization). The finite difference computation uses a cubic box with 1 Å lattice spacing with number of grid points depending on the protein size and centered on the protein.

### Calculations of p*K*_{a}s of titratable groups

The overall methodology for calculating p*K*_{a}, defined as the pH for which the average protonation of the group is 1/2, has been described elsewhere (12). This approach implies the calculation of the intrinsic p*K*_{a} values of all titratable groups (defined as the p*K*_{a} of one titratable group if all other titratable groups in the protein were held in their neutral form) as well as the pair-wise electrostatic interactions between the titratable groups:

- The intrinsic p
*K*_{a}is calculated by p*K*_{int}= p*K*_{mod}− (ΔΔ*G*_{Born}+ ΔΔ*G*_{back})/2.3*RT*, where the p*K*_{mod}is the p*K*_{a}of model compound of amino acid in water determined experimentally. In the MEAD implementation, the p*K*_{mod}values are as follows: Asp, 4.0; Glu, 4.4; His, 6.6 for the Nε2 or 7.0 for Nδ1; Tyr, 9.6; Lys, 10.4; Arg, 12.0; C-terminus, 4.0; and N-terminus, 7.5. The correction ΔΔ*G*_{Born}takes into account the Born self-energy owing to the transfer of the amino acid group from water (high dielectric medium) to the protein (low dielectric medium). The correction ΔΔ*G*_{back}is the background interaction term owing to the interaction of the titratable group with non-titratable (permanent) charges in the protein. In our implementation, all permanent partial charges on all non-titratable groups are taken into consideration. - The pair-wise charge interactions between the titratable groups can be represented as a matrix,
*W*, where_{ij}*i*and*j*label the interacting groups. - As a result, the p
*K*_{a}of one titratable group*i*is represented by $\text{p}{K}_{{\text{a}}_{i}}=\text{p}{K}_{\text{int},\text{i}}+{\scriptstyle \frac{1}{2}}\sum {q}_{i}{q}_{j}{W}_{ij}$, where*q*and_{i}*q*are the pH-dependent charges of the titratable groups_{j}*i*and*j*, respectively. All electrostatic energies (*W*, ΔΔ_{ij}*G*_{Born}and ΔΔ*G*_{back}) can be calculated by solving the Poisson–Boltzmann equation for the electrostatic potential generated by the protonated and deprotonated forms of each titratable group placed in protein and in water. For these computations, our implementation applies the MEAD program ‘multiflex’ to solve the electrostatic potentials by the FDPB method using two successive lattices with increasing grid resolutions, 1 and 0.25 Å, respectively. - As seen in (ii), the charges
*q*and_{i}*q*of the titratable groups at any given pH have to be additionally computed. Proteins can contain several tens to hundreds of titratable groups, thus leading to the problem of multiple site titration for the computation of p_{j}*K*_{a}values. The exact calculation of a multiple site titration curve, given the intrinsic p*K*_{a}s and the pair-wise charge interactions, could be carried out at any given pH by a Boltzmann-weighted average over all of the possible protonation states of the protein (12). However, such calculations grow exponentially with the number of titratable groups (2combinations for^{N}*N*titratable groups). In the MEAD package, this problem is overcome by introducing the Reduced Site approximation method (17). This approach assumes that a titratable group can be considered as non-titratable at a pH far away from its p*K*_{mod}. In this case, this group can be assumed to be non-titratable for this pH, and the number of possible combinations to compute is reduced to 2^{N}^{−1}. Thus, our service proposes at present p*K*_{a}calculations for <50 titratable groups. The computed titration curves, in our implementation, give a pH value where the protonation state of the titratable groups is 0.5, which is assigned to be the p*K*_{1/2}and can be interpreted as the p*K*_{a}of this group.

## IMPLEMENTATION

Potential and p*K*_{a} calculations are presented as two distinct services. Both of them start from a coordinate file in the PDB Format (18). Users can either specify a PDB identifier or upload their own file (with or without hydrogens). The protein internal dielectric, solvent dielectric as well as ionic strength values can be adjusted interactively.

For the electrostatic potential calculations, it is in addition possible to specify coordinates at which the value of the potentials will be returned. Finally, the electrostatic potential values can then be used to color-code the molecular surface of the selected protein via DINO (Visualizing Structural Biology, http://www.dino3d.org). Some options allow to adjust the color gradients for positive and negative potential values or to specify a site in the protein that is of importance such that this residue becomes positioned at the center of the picture. Returned images are either static (four orthogonal views are proposed) or animated (rotation around the *y*-axis) in the GIF.

For the p*K*_{a} calculations, it is presently only possible to activate Cys and Tyr as titratable, Asp, Glu, Lys, Arg and His being always considered as titratable. This is likely to evolve in future versions of the interface but was presently retained so as to lower the risk of getting improper results.

## RESULTS

### Electrostatic potentials

The users can download as results, color figures (in different formats, different orientations and resolutions together with animated GIF files) of 3D electrostatic potentials distribution on the protein surface. The users can select the energy range for the color-coding as well as decide on which region to focus the figure. In addition, the PQR file can be obtained. If the users specify a set of *x*, *y*, *z* coordinates, the server can return electrostatic potential values at these points.

As an example, if the users select lysozyme (PDB code 7lyz) as input file, they can obtain figures as follows (Figure 1). Such computations can be used to compare electrostatic potentials for wild-type proteins and mutants, in such cases the users need to upload their mutant PDB files.

### p*K*_{a}s of titratable groups

The PCE server returns p*K*_{int}, p*K*_{1/2}, titration curves for all titratable groups and pair-wise charge interactions between titratable groups as well as the isoelectric point (pI). We have tested the implementation on several proteins taken from the PDB (18) and report the results for lysozyme (PDB entry 7lyz) and ribonuclease A (PDB entry 3rn3). Our computed p*K*_{a} values (ε_{in} = 4) are compared with p*K*_{a} values previously calculated or experimentally determined (Tables 1 and and2)2) (12,14). Small deviations between our data and the previously calculated p*K*_{1/2} are owing to the differences in some parameters in the finite difference calculations. Overall, a good correspondence between our p*K*_{1/2} and the experimental p*K*_{a} values can be seen, except for few titratable groups. This seems to be due, in part, to the fact that proteins should have considerable mobility for the solvent-exposed side chains (19). These fluctuations can be simulated by using a higher ε_{in} as proposed in (14) or by computing p*K*_{a} values on conformations selected from molecular dynamic trajectories (20). Other methods have been developed to account for protein flexibility during p*K*_{a} calculations (21–23).

## CONCLUSION AND FUTURE DIRECTIONS

PCE is presently based on MEAD and allows users to compute protein electrostatic potentials via a FDPB solver starting with a PDB file as query input. The electrostatic potential values can then be used to color-code the molecular surface of the molecules presently generated within DINO. Static images in different formats and with different resolutions, centered on specific residues, as well as animated GIF files can be downloaded. The PARSE parameters are used for the computations, but we plan to implement other forcefield parameters in a near future. Currently, non-protein or nucleic acid atoms are excluded from the calculations, but work is in progress to circumvent this limitation. Additional optimizations are in progress as we plan to define automatically the charge assignments depending on the calculated p*K*_{a} values and then compute electrostatic potentials taking into account such information.

Users can also compute p*K*_{int}, p*K*_{1/2}, titration curves for all titratable groups and pair-wise charge interactions between titratable groups as well as isoelectric point of the protein. At present, no more than 50 titratable sites can be considered. However, we are working on the implementation of a Monte Carlo method to allow p*K*_{a} computations for larger proteins.

## Acknowledgments

This work was supported by the Inserm Institute via a ‘poste-vert’ to M.M. and the Avenir grant to B.V. RPBS was supported by ‘Action Conjointe CNRS-INSERM, Bioinformatique 2002’. Funding to pay the Open Access publication charges for this article was provided by INSERM.

*Conflict of interest statement*. None declared.

## REFERENCES

*K*

_{a}values of ionizable groups in bacteriorhodopsin. J. Mol. Biol. 1992;224:473–486. [PubMed]

*K*

_{a}'s of ionizable groups in proteins: atomic detail from a continuum electrostatic model. Biochemistry. 1990;29:10219–10225. [PubMed]

*K*

_{a}s in proteins. Biochemistry. 1996;35:7819–7833. [PubMed]

^{1}H NMR spin–spin coupling constants. Biochemistry. 1991;30:986–996. [PubMed]

*K*(a)s in proteins. Biophys. J. 2002;83:1731–1748. [PMC free article] [PubMed]

*K*(a)s. Proteins. 2003;50:94–103. [PubMed]

**Oxford University Press**

## Formats:

- Article |
- PubReader |
- ePub (beta) |
- PDF (148K) |
- Citation

- Protein electrostatics: a review of the equations and methods used to model electrostatic equations in biomolecules--applications in biotechnology.[Biotechnol Annu Rev. 2003]
*Neves-Petersen MT, Petersen SB.**Biotechnol Annu Rev. 2003; 9:315-95.* - Structure-based pKa calculations using continuum electrostatics methods.[Curr Protoc Bioinformatics. 2007]
*Fitch CA, García-Moreno E B.**Curr Protoc Bioinformatics. 2007 Jan; Chapter 8:Unit 8.11.* - Patch Finder Plus (PFplus): a web server for extracting and displaying positive electrostatic patches on protein surfaces.[Nucleic Acids Res. 2007]
*Shazman S, Celniker G, Haber O, Glaser F, Mandel-Gutfreund Y.**Nucleic Acids Res. 2007 Jul; 35(Web Server issue):W526-30. Epub 2007 May 30.* - A parameterized, continuum electrostatic model for predicting protein pKa values.[Proteins. 2011]
*Burger SK, Ayers PW.**Proteins. 2011 Jul; 79(7):2044-52. Epub 2011 May 9.* - Investigating the mechanisms of photosynthetic proteins using continuum electrostatics.[Photosynth Res. 2008]
*Ullmann GM, Kloppmann E, Essigke T, Krammer EM, Klingen AR, Becker T, Bombarda E.**Photosynth Res. 2008 Jul; 97(1):33-53. Epub 2008 May 14.*

- DelPhi Web Server: A comprehensive online suite for electrostatic calculations of biological macromolecules and their complexes[Communications in computational physics. 20...]
*Sarkar S, Witham S, Zhang J, Zhenirovskyy M, Rocchia W, Alexov E.**Communications in computational physics. 2013 Jan; 13(1)269-284* - In Silico Mechanistic Profiling to Probe Small Molecule Binding to Sulfotransferases[PLoS ONE. ]
*Martiny VY, Carbonell P, Lagorce D, Villoutreix BO, Moroy G, Miteva MA.**PLoS ONE. 8(9)e73587* - Exploring NMR ensembles of calcium binding proteins: Perspectives to design inhibitors of protein-protein interactions[BMC Structural Biology. ]
*Isvoran A, Badel A, Craescu CT, Miron S, Miteva MA.**BMC Structural Biology. 1124* - Analysis of Binding Sites on Complement Factor I That Are Required for Its Activity[The Journal of Biological Chemistry. 2010]
*Nilsson SC, Nita I, Månsson L, Groeneveld TW, Trouw LA, Villoutreix BO, Blom AM.**The Journal of Biological Chemistry. 2010 Feb 26; 285(9)6235-6245* - Physical Properties of Intact Proteins May Predict Allergenicity or Lack Thereof[PLoS ONE. ]
*Singh S, Taneja B, Salvi SS, Agrawal A.**PLoS ONE. 4(7)e6273*

- PCE: web tools to compute protein continuum electrostaticsPCE: web tools to compute protein continuum electrostaticsNucleic Acids Research. Jul 1, 2005; 33(Web Server issue)W372

Your browsing activity is empty.

Activity recording is turned off.

See more...