Format

Send to

Choose Destination
R Soc Open Sci. 2015 Aug 26;2(8):150238. doi: 10.1098/rsos.150238. eCollection 2015 Aug.

Efficient conformational space exploration in ab initio protein folding simulation.

Author information

1
AℓEDA Group, Department of CSE , BUET , ECE Building, Dhaka 1205, Bangladesh ; Department of CSE , Independent University , Bangladesh, Dhaka 1229, Bangladesh.
2
AℓEDA Group, Department of CSE , BUET , ECE Building, Dhaka 1205, Bangladesh.
3
AℓEDA Group, Department of CSE , BUET , ECE Building, Dhaka 1205, Bangladesh ; Department of CSE , United International University , Dhanmondi, Dhaka 1209, Bangladesh.
4
Barts Cancer Institute , Queen Mary University of London , London, UK.

Abstract

Ab initio protein folding simulation largely depends on knowledge-based energy functions that are derived from known protein structures using statistical methods. These knowledge-based energy functions provide us with a good approximation of real protein energetics. However, these energy functions are not very informative for search algorithms and fail to distinguish the types of amino acid interactions that contribute largely to the energy function from those that do not. As a result, search algorithms frequently get trapped into the local minima. On the other hand, the hydrophobic-polar (HP) model considers hydrophobic interactions only. The simplified nature of HP energy function makes it limited only to a low-resolution model. In this paper, we present a strategy to derive a non-uniform scaled version of the real 20×20 pairwise energy function. The non-uniform scaling helps tackle the difficulty faced by a real energy function, whereas the integration of 20×20 pairwise information overcomes the limitations faced by the HP energy function. Here, we have applied a derived energy function with a genetic algorithm on discrete lattices. On a standard set of benchmark protein sequences, our approach significantly outperforms the state-of-the-art methods for similar models. Our approach has been able to explore regions of the conformational space which all the previous methods have failed to explore. Effectiveness of the derived energy function is presented by showing qualitative differences and similarities of the sampled structures to the native structures. Number of objective function evaluation in a single run of the algorithm is used as a comparison metric to demonstrate efficiency.

KEYWORDS:

discrete lattices; energy function; genetic algorithms; optimization; protein folding simulation; protein structure prediction

Supplemental Content

Full text links

Icon for PubMed Central
Loading ...
Support Center