Format

Send to

Choose Destination
J Chromatogr B Analyt Technol Biomed Life Sci. 2017 Dec 15;1071:19-28. doi: 10.1016/j.jchromb.2017.06.002. Epub 2017 Jun 4.

Clustering of MS2 spectra using unsupervised methods to aid the identification of secondary metabolites from Pseudomonas aeruginosa.

Author information

1
Department of Chemical Biology, Helmholtz Centre for Infection Research, Inhoffenstraße 7, D-38124 Braunschweig, Germany.
2
Department of Chemical Biology, Helmholtz Centre for Infection Research, Inhoffenstraße 7, D-38124 Braunschweig, Germany. Electronic address: Raimo.Franke@helmholtz-hzi.de.
3
Department of Chemical Biology, Helmholtz Centre for Infection Research, Inhoffenstraße 7, D-38124 Braunschweig, Germany; German Centre for Infection Research (DZIF), Inhoffenstraße 7, D-38124 Braunschweig, Germany. Electronic address: Mark.Broenstrup@helmholtz-hzi.de.

Abstract

Pseudomonas aeruginosa is an important opportunistic pathogen that produces a large arsenal of small molecule virulence factors and quorum sensing signal molecules. The annotation of these secondary metabolites in untargeted, mass spectrometry-based metabolomics is difficult, as many of them cannot be found in common metabolite databases, and as manual annotation is tedious. We therefore developed an algorithm named CluMSID that uses cosine similarities of product ion spectra and neutral loss patterns in combination with unsupervised clustering methods such as multidimensional scaling, density based clustering and hierarchical clustering to group structurally similar compounds and hence facilitate their annotation. The use of this tool allowed us to find clusters for several classes of primary and secondary metabolites, and helped identifying spectral similarities that would have gone unnoticed in standard untargeted metabolomics data analysis workflows. CluMSID enabled the annotation of 27 previously undescribed members of the canonical classes of alkyl quinolone quorum sensing signal molecules and provided evidence for the postulation of a new putative alkyl quinolone class. The CluMSID script written in R is open source and can be used by anyone in the metabolomics and natural product research community.

KEYWORDS:

Alkyl quinolones; Clustering; Data mining; LC–MS/MS; Metabolite identification; Metabolomics; Pseudomonas aeruginosa; Quorum sensing

PMID:
28642031
DOI:
10.1016/j.jchromb.2017.06.002
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Elsevier Science
Loading ...
Support Center