An efficient method of exploring simulation models by assimilating literature and biological observational data

Biosystems. 2014 Jul:121:54-66. doi: 10.1016/j.biosystems.2014.06.001. Epub 2014 Jun 5.

Abstract

Recently, several biological simulation models of, e.g., gene regulatory networks and metabolic pathways, have been constructed based on existing knowledge of biomolecular reactions, e.g., DNA-protein and protein-protein interactions. However, since these do not always contain all necessary molecules and reactions, their simulation results can be inconsistent with observational data. Therefore, improvements in such simulation models are urgently required. A previously reported method created multiple candidate simulation models by partially modifying existing models. However, this approach was computationally costly and could not handle a large number of candidates that are required to find models whose simulation results are highly consistent with the data. In order to overcome the problem, we focused on the fact that the qualitative dynamics of simulation models are highly similar if they share a certain amount of regulatory structures. This indicates that better fitting candidates tend to share the basic regulatory structure of the best fitting candidate, which can best predict the data among candidates. Thus, instead of evaluating all candidates, we propose an efficient explorative method that can selectively and sequentially evaluate candidates based on the similarity of their regulatory structures. Furthermore, in estimating the parameter values of a candidate, e.g., synthesis and degradation rates of mRNA, for the data, those of the previously evaluated candidates can be utilized. The method is applied here to the pharmacogenomic pathways for corticosteroids in rats, using time-series microarray expression data. In the performance test, we succeeded in obtaining more than 80% of consistent solutions within 15% of the computational time as compared to the comprehensive evaluation. Then, we applied this approach to 142 literature-recorded simulation models of corticosteroid-induced genes, and consequently selected 134 newly constructed better models. The method described here was found to be capable of efficiently exploring candidate simulation models and obtaining better models within a short span of time. Furthermore, the results suggest that there may be room for improvement in literature recorded pathways and that they can be systematically updated using biological observational data.

Keywords: Gene expression; Pharmacogenome; Simulation; Systems biology.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adrenal Cortex Hormones / metabolism*
  • Adrenal Cortex Hormones / pharmacology
  • Animals
  • Chick Embryo
  • Computer Simulation / statistics & numerical data
  • Gene Expression Regulation / drug effects
  • Gene Expression Regulation / physiology*
  • High-Throughput Screening Assays / methods
  • Metabolic Networks and Pathways / physiology*
  • Models, Biological*
  • Pharmacogenetics / methods*
  • Protein Array Analysis / methods
  • Rats
  • Research Design*

Substances

  • Adrenal Cortex Hormones