Format

Send to

Choose Destination
Proteomics. 2019 Feb;19(4):e1800357. doi: 10.1002/pmic.201800357. Epub 2019 Jan 18.

MS-Rescue: A Computational Pipeline to Increase the Quality and Yield of Immunopeptidomics Experiments.

Author information

1
Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, Av. 25 de Mayo y Francia CP(1650), San Martín, Argentina.
2
Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN, UK.
3
Oxford NIHR Biomedical Research Centre, Oxford, OX4 2PG, UK.
4
The Jenner Institute, University of Oxford, Oxford, OX3 7DQ, UK.
5
Department of Bio and Health Informatics, Technical University of Denmark, 2800Kgs. Lyngby, Denmark.

Abstract

LC-MS/MS has become the standard platform for the characterization of immunopeptidomes, the collection of peptides naturally presented by major histocompatibility complex molecules to the cell surface. The protocols and algorithms used for immunopeptidomics data analysis are based on tools developed for traditional bottom-up proteomics that address the identification of peptides generated by tryptic digestion. Such algorithms are generally not tailored to the specific requirements of MHC ligand identification and, as a consequence, immunopeptidomics datasets suffer from dismissal of informative spectral information and high false discovery rates. Here, a new pipeline for the refinement of peptide-spectrum matches (PSM) is proposed, based on the assumption that immunopeptidomes contain a limited number of recurring peptide motifs, corresponding to MHC specificities. Sequence motifs are learned directly from the individual peptidome by training a prediction model on high-confidence PSMs. The model is then applied to PSM candidates with lower confidence, and sequences that score significantly higher than random peptides are rescued as likely true ligands. The pipeline is applied to MHC class I immunopeptidomes from three different species, and it is shown that it can increase the number of identified ligands by up to 20-30%, while effectively removing false positives and products of co-precipitation. Spectral validation using synthetic peptides confirms the identity of a large proportion of rescued ligands in the experimental peptidome.

KEYWORDS:

MHC; machine learning; mass spectrometry; peptidome; sequence motifs

PMID:
30578603
DOI:
10.1002/pmic.201800357

Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center