Format

Send to

Choose Destination
Sci Rep. 2015 Jan 23;5:8004. doi: 10.1038/srep08004.

MBSTAR: multiple instance learning for predicting specific functional binding sites in microRNA targets.

Author information

1
Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India.
2
Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37203, USA.
3
1] Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37203, USA [2] Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, TN 37232, USA [3] Department of Psychiatry, Vanderbilt University School of Medicine, Nashville, TN 37212, USA [4] Center for Quantitative Sciences, Vanderbilt University, Nashville, TN, 37232, USA.

Abstract

MicroRNA (miRNA) regulates gene expression by binding to specific sites in the 3'untranslated regions of its target genes. Machine learning based miRNA target prediction algorithms first extract a set of features from potential binding sites (PBSs) in the mRNA and then train a classifier to distinguish targets from non-targets. However, they do not consider whether the PBSs are functional or not, and consequently result in high false positive rates. This substantially affects the follow up functional validation by experiments. We present a novel machine learning based approach, MBSTAR (Multiple instance learning of Binding Sites of miRNA TARgets), for accurate prediction of true or functional miRNA binding sites. Multiple instance learning framework is adopted to handle the lack of information about the actual binding sites in the target mRNAs. Biologically validated 9531 interacting and 973 non-interacting miRNA-mRNA pairs are identified from Tarbase 6.0 and confirmed with PAR-CLIP dataset. It is found that MBSTAR achieves the highest number of binding sites overlapping with PAR-CLIP with maximum F-Score of 0.337. Compared to the other methods, MBSTAR also predicts target mRNAs with highest accuracy. The tool and genome wide predictions are available at http://www.isical.ac.in/~bioinfo_miu/MBStar30.htm.

PMID:
25614300
PMCID:
PMC4648438
DOI:
10.1038/srep08004
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Nature Publishing Group Icon for PubMed Central
Loading ...
Support Center