Format

Send to

Choose Destination
PLoS Comput Biol. 2014 Sep 4;10(9):e1003822. doi: 10.1371/journal.pcbi.1003822. eCollection 2014 Sep.

Pep2Path: automated mass spectrometry-guided genome mining of peptidic natural products.

Author information

1
Department of Microbial Physiology, University of Groningen, Groningen, The Netherlands; Groningen Bioinformatics Centre, University of Groningen, Groningen, The Netherlands.
2
Skaggs School of Pharmacy and Pharmaceutical Sciences and Departments of Pharmacology, Chemistry and Biochemistry, University of California, San Diego, La Jolla, California, United States of America.
3
Department of Microbial Physiology, University of Groningen, Groningen, The Netherlands; Manchester Institute of Biotechnology, Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.
4
Groningen Bioinformatics Centre, University of Groningen, Groningen, The Netherlands; Manchester Institute of Biotechnology, Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.

Abstract

Nonribosomally and ribosomally synthesized bioactive peptides constitute a source of molecules of great biomedical importance, including antibiotics such as penicillin, immunosuppressants such as cyclosporine, and cytostatics such as bleomycin. Recently, an innovative mass-spectrometry-based strategy, peptidogenomics, has been pioneered to effectively mine microbial strains for novel peptidic metabolites. Even though mass-spectrometric peptide detection can be performed quite fast, true high-throughput natural product discovery approaches have still been limited by the inability to rapidly match the identified tandem mass spectra to the gene clusters responsible for the biosynthesis of the corresponding compounds. With Pep2Path, we introduce a software package to fully automate the peptidogenomics approach through the rapid Bayesian probabilistic matching of mass spectra to their corresponding biosynthetic gene clusters. Detailed benchmarking of the method shows that the approach is powerful enough to correctly identify gene clusters even in data sets that consist of hundreds of genomes, which also makes it possible to match compounds from unsequenced organisms to closely related biosynthetic gene clusters in other genomes. Applying Pep2Path to a data set of compounds without known biosynthesis routes, we were able to identify candidate gene clusters for the biosynthesis of five important compounds. Notably, one of these clusters was detected in a genome from a different subphylum of Proteobacteria than that in which the molecule had first been identified. All in all, our approach paves the way towards high-throughput discovery of novel peptidic natural products. Pep2Path is freely available from http://pep2path.sourceforge.net/, implemented in Python, licensed under the GNU General Public License v3 and supported on MS Windows, Linux and Mac OS X.

PMID:
25188327
PMCID:
PMC4154637
DOI:
10.1371/journal.pcbi.1003822
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Public Library of Science Icon for PubMed Central
Loading ...
Support Center