A comparative study of sequence- and structure-based features of small RNAs and other RNAs of bacteria

RNA Biol. 2018 Jan 2;15(1):95-103. doi: 10.1080/15476286.2017.1387709. Epub 2017 Nov 13.

Abstract

Small RNAs (sRNAs) in bacteria have emerged as key players in transcriptional and post-transcriptional regulation of gene expression. Here, we present a statistical analysis of different sequence- and structure-related features of bacterial sRNAs to identify the descriptors that could discriminate sRNAs from other bacterial RNAs. We investigated a comprehensive and heterogeneous collection of 816 sRNAs, identified by northern blotting across 33 bacterial species and compared their various features with other classes of bacterial RNAs, such as tRNAs, rRNAs and mRNAs. We observed that sRNAs differed significantly from the rest with respect to G+C composition, normalized minimum free energy of folding, motif frequency and several RNA-folding parameters like base-pairing propensity, Shannon entropy and base-pair distance. Based on the selected features, we developed a predictive model using Random Forests (RF) method to classify the above four classes of RNAs. Our model displayed an overall predictive accuracy of 89.5%. These findings would help to differentiate bacterial sRNAs from other RNAs and further promote prediction of novel sRNAs in different bacterial species.

Keywords: Base-pairing propensity; base-pair distance; minimum free energy of folding; motifs; random forests; shannon entropy; small RNAs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria / genetics
  • Base Composition / genetics
  • Base Pairing
  • Gene Expression Regulation, Bacterial
  • RNA, Bacterial / classification
  • RNA, Bacterial / genetics
  • RNA, Messenger / classification
  • RNA, Messenger / genetics*
  • RNA, Ribosomal / classification
  • RNA, Ribosomal / genetics*
  • RNA, Small Untranslated / classification
  • RNA, Small Untranslated / genetics*
  • RNA, Transfer / classification
  • RNA, Transfer / genetics*

Substances

  • RNA, Bacterial
  • RNA, Messenger
  • RNA, Ribosomal
  • RNA, Small Untranslated
  • RNA, Transfer

Grants and funding

This project was supported by Indian Council of Medical Research [extramural project (IRIS ID: 2013–1551G)].