Format

Send to

Choose Destination
J Am Soc Mass Spectrom. 2015 Nov;26(11):1885-94. doi: 10.1007/s13361-015-1204-0. Epub 2015 Jun 30.

Novor: real-time peptide de novo sequencing software.

Author information

1
School of Computer Science, University of Waterloo, 200 University Ave. W., Waterloo, ON, N2L3G1, Canada. binma@uwaterloo.ca.

Abstract

De novo sequencing software has been widely used in proteomics to sequence new peptides from tandem mass spectrometry data. This study presents a new software tool, Novor, to greatly improve both the speed and accuracy of today's peptide de novo sequencing analyses. To improve the accuracy, Novor's scoring functions are based on two large decision trees built from a peptide spectral library with more than 300,000 spectra with machine learning. Important knowledge about peptide fragmentation is extracted automatically from the library and incorporated into the scoring functions. The decision tree model also enables efficient score calculation and contributes to the speed improvement. To further improve the speed, a two-stage algorithmic approach, namely dynamic programming and refinement, is used. The software program was also carefully optimized. On the testing datasets, Novor sequenced 7%-37% more correct residues than the state-of-the-art de novo sequencing tool, PEAKS, while being an order of magnitude faster. Novor can de novo sequence more than 300 MS/MS spectra per second on a laptop computer. The speed surpasses the acquisition speed of today's mass spectrometer and, therefore, opens a new possibility to de novo sequence in real time while the spectrometer is acquiring the spectral data. Graphical Abstract ᅟ.

KEYWORDS:

Decision tree; Peptide de novo sequencing; Real time; Software; Tandem mass spectrometry

PMID:
26122521
PMCID:
PMC4604512
DOI:
10.1007/s13361-015-1204-0
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for PubMed Central
Loading ...
Support Center