Display Settings:

Format

Send to:

Choose Destination

    Bioinformatics. 2007 Feb 1;23(3):277-80. Epub 2006 Nov 22.

    A predictive model for identifying proteins by a single peptide match.

    Higdon R, Kolker E.

    The BIATECH Institute, Bothell, WA 98011, USA.

    MOTIVATION: Tandem mass-spectrometry of trypsin digests, followed by database searching, is one of the most popular approaches in high-throughput proteomics studies. Peptides are considered identified if they pass certain scoring thresholds. To avoid false positive protein identification, > or = 2 unique peptides identified within a single protein are generally recommended. Still, in a typical high-throughput experiment, hundreds of proteins are identified only by a single peptide. We introduce here a method for distinguishing between true and false identifications among single-hit proteins. The approach is based on randomized database searching and usage of logistic regression models with cross-validation. This approach is implemented to analyze three bacterial samples enabling recovery 68-98% of the correct single-hit proteins with an error rate of < 2%. This results in a 22-65% increase in number of identified proteins. Identifying true single-hit proteins will lead to discovering many crucial regulators, biomarkers and other low abundance proteins. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

    PMID: 17121779 [PubMed - indexed for MEDLINE]

    Supplemental Content

    Click here to read