Format

Send to

Choose Destination
BMC Syst Biol. 2016 Nov 29;10(1):129.

The quality of metabolic pathway resources depends on initial enzymatic function assignments: a case for maize.

Author information

1
Bioinformatics and Computational Biology Program, Iowa State University, Ames IA, USA.
2
Electrical and Computer Engineering Department, Iowa State University, Ames IA, USA.
3
USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames IA, USA.
4
USDA-ARS Plant Genetics Research Unit and Division of Plant Sciences, University of Missouri, Columbia MO, USA.
5
Department of Plant Biology, Carnegie Institution for Science, Stanford CA, USA.
6
Bioinformatics and Computational Biology Program, Iowa State University, Ames IA, USA. taner.sen@ars.usda.gov.
7
USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames IA, USA. taner.sen@ars.usda.gov.
8
Department of Genetics, Development and Cell Biology, Iowa State University, Ames IA, USA. taner.sen@ars.usda.gov.
9
USDA-ARS Crop Improvement and Genetics Research Unit, Albany CA, USA. taner.sen@ars.usda.gov.

Abstract

BACKGROUND:

As metabolic pathway resources become more commonly available, researchers have unprecedented access to information about their organism of interest. Despite efforts to ensure consistency between various resources, information content and quality can vary widely. Two maize metabolic pathway resources for the B73 inbred line, CornCyc 4.0 and MaizeCyc 2.2, are based on the same gene model set and were developed using Pathway Tools software. These resources differ in their initial enzymatic function assignments and in the extent of manual curation. We present an in-depth comparison between CornCyc and MaizeCyc to demonstrate the effect of initial computational enzymatic function assignments on the quality and content of metabolic pathway resources.

RESULTS:

These two resources are different in their content. MaizeCyc contains GO annotations for over 21,000 genes that CornCyc is missing. CornCyc contains on average 1.6 transcripts per gene, while MaizeCyc contains almost no alternate splicing. MaizeCyc also does not match CornCyc's breadth in representing the metabolic domain; MaizeCyc has fewer compounds, reactions, and pathways than CornCyc. CornCyc's computational predictions are more accurate than those in MaizeCyc when compared to experimentally determined function assignments, demonstrating the relative strength of the enzymatic function assignment pipeline used to generate CornCyc.

CONCLUSIONS:

Our results show that the quality of initial enzymatic function assignments primarily determines the quality of the final metabolic pathway resource. Therefore, biologists should pay close attention to the methods and information sources used to develop a metabolic pathway resource to gauge the utility of using such functional assignments to construct hypotheses for experimental studies.

KEYWORDS:

BioCyc; CornCyc; Database comparison; JavaCycO; MaizeCyc; Metabolic pathway databases

PMID:
27899149
PMCID:
PMC5129634
DOI:
10.1186/s12918-016-0369-x
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center