Format

Send to

Choose Destination
Mol Cell Proteomics. 2014 Sep;13(9):2467-79. doi: 10.1074/mcp.O113.036327. Epub 2014 Jun 2.

Computing exact p-values for a cross-correlation shotgun proteomics score function.

Author information

1
From the ‡Department of Genome Sciences, University of Washington, Seattle, Washington;
2
From the ‡Department of Genome Sciences, University of Washington, Seattle, Washington; §Department of Computer Science and Engineering, University of Washington, Seattle, Washington william-noble@uw.edu.

Abstract

The core of every protein mass spectrometry analysis pipeline is a function that assesses the quality of a match between an observed spectrum and a candidate peptide. We describe a procedure for computing exact p-values for the oldest and still widely used score function, SEQUEST XCorr. The procedure uses dynamic programming to enumerate efficiently the full distribution of scores for all possible peptides whose masses are close to that of the spectrum precursor mass. Ranking identified spectra by p-value rather than XCorr significantly reduces variance because of spectrum-specific effects on the score. In combination with the Percolator postprocessor, the XCorr p-value yields more spectrum and peptide identifications at a fixed false discovery rate than Mascot, X!Tandem, Comet, and MS-GF+ across a variety of data sets.

PMID:
24895379
PMCID:
PMC4159662
DOI:
10.1074/mcp.O113.036327
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for HighWire Icon for PubMed Central
Loading ...
Support Center