Prediction of protein architectures involved in the signaling-pathway initiating sporulation in Firmicutes

BMC Res Notes. 2019 Oct 23;12(1):686. doi: 10.1186/s13104-019-4712-3.

Abstract

Objectives: Like many other proteins, those belonging to the signal transduction cascade initiating sporulation (Spo0 pathway) have conserved protein domains (Capra and Laub in Annu Rev Microbiol 66:325-47, 2012). Improvements in bioinformatics applications to discover proteins involved in the initiation of the sporulating cascade in newly sequenced genomes is an important task that requires rigorous comparative genomic methods and manual curation to identify endospore-forming bacteria. This note aims to present a collection of predicted proteins involved in the Spo0 pathway found in the proteomes of fully sequenced and manually curated endospore-forming Firmicutes species. This collection may serve as a guide to conduct future experiments in endospore formers in genomic and metagenomic projects.

Data description: Similar to the report of Davidson et al. (PLoS Genet 14:1-33, 2018), we used Pfam profiles (El-Gebali et al. in Nucleic Acids Res 47:D427-32, 2019) defining each protein and the genomic context surrounding the query gene to predict probable orthologs of the Spo0 pathway in Firmicutes. We present in this note a collection of 325 Firmicutes species organized by phylogenetic class and classified as spore formers, non-spore formers or unknown spore phenotype based on published literature, for which we predicted probable orthologs defining the signal transduction pathway initiating sporulation.

Keywords: Firmicutes; Orthologs; Protein architecture; Sporulation.

MeSH terms

  • Bacterial Proteins / genetics*
  • Bacterial Proteins / metabolism
  • Computational Biology / methods
  • Computational Biology / statistics & numerical data
  • Firmicutes / classification
  • Firmicutes / genetics*
  • Firmicutes / metabolism
  • Gene Expression Regulation, Bacterial*
  • Genome, Bacterial / genetics
  • Genomics / methods
  • Genomics / statistics & numerical data
  • Phylogeny
  • Proteomics / methods
  • Proteomics / statistics & numerical data
  • Signal Transduction / genetics*
  • Species Specificity
  • Spores, Bacterial / genetics*

Substances

  • Bacterial Proteins

Grants and funding