Black sheep, dark horses, and colorful dogs: a review on the current state of the Gene Ontology with respect to iron homeostasis in Arabidopsis thaliana

Front Plant Sci. 2023 Jul 24:14:1204723. doi: 10.3389/fpls.2023.1204723. eCollection 2023.

Abstract

Cellular homeostasis of the micronutrient iron is highly regulated in plants and responsive to nutrition, stress, and developmental signals. Genes for iron management encode metal and other transporters, enzymes synthesizing chelators and reducing substances, transcription factors, and several types of regulators. In transcriptome or proteome datasets, such iron homeostasis-related genes are frequently found to be differentially regulated. A common method to detect whether a specific cellular pathway is affected in the transcriptome data set is to perform Gene Ontology (GO) enrichment analysis. Hence, the GO database is a widely used resource for annotating genes and identifying enriched biological pathways in Arabidopsis thaliana. However, iron homeostasis-related GO terms do not consistently reflect gene associations and levels of evidence in iron homeostasis. Some genes in the existing iron homeostasis GO terms lack direct evidence of involvement in iron homeostasis. In other aspects, the existing GO terms for iron homeostasis are incomplete and do not reflect the known biological functions associated with iron homeostasis. This can lead to potential errors in the automatic annotation and interpretation of GO term enrichment analyses. We suggest that applicable evidence codes be used to add missing genes and their respective ortholog/paralog groups to make the iron homeostasis-related GO terms more complete and reliable. There is a high likelihood of finding new iron homeostasis-relevant members in gene groups and families like the ZIP, ZIF, ZIFL, MTP, OPT, MATE, ABCG, PDR, HMA, and HMP. Hence, we compiled comprehensive lists of genes involved in iron homeostasis that can be used for custom enrichment analysis in transcriptomic or proteomic studies, including genes with direct experimental evidence, those regulated by central transcription factors, and missing members of small gene families or ortholog/paralog groups. As we provide gene annotation and literature alongside, the gene lists can serve multiple computational approaches. In summary, these gene lists provide a valuable resource for researchers studying iron homeostasis in A. thaliana, while they also emphasize the importance of improving the accuracy and comprehensiveness of the Gene Ontology.

Keywords: Arabidopsis thaliana; GO term; biological process; enrichment analysis; gene ontology; gene set; iron homeostasis.

Publication types

  • Review

Grants and funding

This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 456082119 – TRR 341/1, and by Deutsche Forschungsgemeinschaft grants International Research Training group 2466, F0205120 (NextPlant, project ID 391465903/GRK 2466) (DB is a member of the Next Plant graduate school). Support was also provided by Germany´s Excellence Strategy – EXC-2048/1 – project ID 390686111.