Send to

Choose Destination
See comment in PubMed Commons below
PLoS One. 2013;8(3):e53253. doi: 10.1371/journal.pone.0053253. Epub 2013 Mar 26.

Class prediction and feature selection with linear optimization for metagenomic count data.

Author information

  • 1University of Maryland Greenebaum Cancer Center, Baltimore, Maryland, USA.

Erratum in

  • PLoS One. 2014;9(5):e97958.


The amount of metagenomic data is growing rapidly while the computational methods for metagenome analysis are still in their infancy. It is important to develop novel statistical learning tools for the prediction of associations between bacterial communities and disease phenotypes and for the detection of differentially abundant features. In this study, we presented a novel statistical learning method for simultaneous association prediction and feature selection with metagenomic samples from two or multiple treatment populations on the basis of count data. We developed a linear programming based support vector machine with L(1) and joint L(1,∞) penalties for binary and multiclass classifications with metagenomic count data (metalinprog). We evaluated the performance of our method on several real and simulation datasets. The proposed method can simultaneously identify features and predict classes with the metagenomic count data.

[PubMed - indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Public Library of Science Icon for PubMed Central
    Loading ...
    Support Center