Evolutionary non-linear modelling for selecting vaccines against antigenically variable viruses

Bioinformatics. 2015 Mar 15;31(6):834-40. doi: 10.1093/bioinformatics/btu768. Epub 2014 Nov 19.

Abstract

Motivation: In vitro and in vivo selection of vaccines is time consuming, expensive and the selected vaccines may not be able to provide protection against broad-spectrum viruses because of emerging antigenically novel disease strains. A powerful computational model that incorporates these protein/DNA or RNA level fluctuations can effectively predict antigenically variant strains, and can minimize the amount of resources spent on exclusive serological testing of vaccines and make wide spectrum vaccines possible for many diseases. However, in silico vaccine prediction remains a grand challenge. To address the challenge, we investigate the use of linear and non-linear regression models to predict the antigenic similarity in foot-and-mouth disease virus strains and in influenza strains, where the structure and parameters of the non-linear model are optimized using an evolutionary algorithm (EA). In addition, we examine two different scoring methods for weighting the type of amino acid substitutions in the linear and non-linear models. We also test our models with some unseen data.

Results: We achieved the best prediction results on three datasets of SAT2 (Foot-and-Mouth disease), two datasets of serotype A (Foot-and-Mouth disease) and two datasets of influenza when the scoring method based on biochemical properties of amino acids is employed in combination with a non-linear regression model. Models based on substitutions in the antigenic areas performed better than those that took the entire exposed viral capsid proteins. A majority of the non-linear regression models optimi Z: ed with the EA: performed better than the linear and non-linear models whose parameters are estimated using the least-squares method. In addition, for the best models, optimi Z: ed non-linear regression models consist of more terms than their linear counterparts, implying a non-linear nature of influences of amino acid substitutions. Our models were also tested on five recently generated FMDV datasets and the best model was able to achieve an 80% agreement rate.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Antibodies, Neutralizing / immunology
  • Antigenic Variation / genetics*
  • Antigens, Viral
  • Biological Evolution*
  • Capsid Proteins / immunology
  • Computational Biology
  • Computer Simulation
  • Foot-and-Mouth Disease / immunology*
  • Foot-and-Mouth Disease / virology
  • Foot-and-Mouth Disease Virus / genetics*
  • Foot-and-Mouth Disease Virus / immunology
  • Influenza Vaccines / immunology*
  • Nonlinear Dynamics*

Substances

  • Antibodies, Neutralizing
  • Antigens, Viral
  • Capsid Proteins
  • Influenza Vaccines