Signatures of optimal codon usage in metabolic genes inform budding yeast ecology

PLoS Biol. 2021 Apr 19;19(4):e3001185. doi: 10.1371/journal.pbio.3001185. eCollection 2021 Apr.

Abstract

Reverse ecology is the inference of ecological information from patterns of genomic variation. One rich, heretofore underutilized, source of ecologically relevant genomic information is codon optimality or adaptation. Bias toward codons that match the tRNA pool is robustly associated with high gene expression in diverse organisms, suggesting that codon optimization could be used in a reverse ecology framework to identify highly expressed, ecologically relevant genes. To test this hypothesis, we examined the relationship between optimal codon usage in the classic galactose metabolism (GAL) pathway and known ecological niches for 329 species of budding yeasts, a diverse subphylum of fungi. We find that optimal codon usage in the GAL pathway is positively correlated with quantitative growth on galactose, suggesting that GAL codon optimization reflects increased capacity to grow on galactose. Optimal codon usage in the GAL pathway is also positively correlated with human-associated ecological niches in yeasts of the CUG-Ser1 clade and with dairy-associated ecological niches in the family Saccharomycetaceae. For example, optimal codon usage of GAL genes is greater than 85% of all genes in the genome of the major human pathogen Candida albicans (CUG-Ser1 clade) and greater than 75% of genes in the genome of the dairy yeast Kluyveromyces lactis (family Saccharomycetaceae). We further find a correlation between optimization in the GALactose pathway genes and several genes associated with nutrient sensing and metabolism. This work suggests that codon optimization harbors information about the metabolic ecology of microbial eukaryotes. This information may be particularly useful for studying fungal dark matter-species that have yet to be cultured in the lab or have only been identified by genomic material.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Carbohydrate Metabolism / genetics
  • Codon
  • Codon Usage / physiology*
  • Ecosystem*
  • Galactose / metabolism
  • Gene-Environment Interaction
  • Genes, Fungal / physiology
  • Genetic Association Studies
  • Metabolic Networks and Pathways / genetics*
  • Organisms, Genetically Modified
  • Saccharomyces cerevisiae / genetics
  • Saccharomyces cerevisiae / growth & development
  • Saccharomyces cerevisiae / metabolism
  • Saccharomycetales* / classification
  • Saccharomycetales* / genetics
  • Saccharomycetales* / metabolism

Substances

  • Codon
  • Galactose

Associated data

  • figshare/10.6084/m9.figshare.c.4498292
  • figshare/10.6084/m9.figshare.c.5067962

Grants and funding

This work was supported by the National Science Foundation under Grant Nos. DEB-1442113 (to A.R.) and DEB-1442148 (to C.T.H.), in part by the DOE Great Lakes Bioenergy Research Center (DOE BER Office of Science DE-SC0018409), USDA National Institute of Food and Agriculture (Hatch Project 1020204 to C.T.H.), and a Guggenheim fellowship (to A.R). C.T.H. is a Pew Scholar in the Biomedical Sciences and a H. I. Romnes Faculty Fellow, respectively supported by the Pew Charitable Trusts and the Office of the Vice Chancellor for Research and Graduate Education with funding from the Wisconsin Alumni Research Foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.