School of Molecular and Microbial Biosciences, University of Sydney, Sydney, New South Wales 2006, Australia.
A new algorithm has been designed and tested to identify protein, or any other macromolecular, complexes that have been widely reported in mass spectral data. The program takes advantage of the appearance of multiply charged ions that are common to both electrospray ionization and, to a lesser extent, matrix-assisted laser desorption/ionization (MALDI) mass spectra. The algorithm, known as COMPLX for the COMposition of Protein-Ligand compleXes, is capable of identifying complexes for any protein or macromolecule with a binding partner of molecular mass up to 100 000 Da. It does so by identifying ion pairs present in a mass spectrum that, when they share a common charge, have an m/z value difference that is an integer fraction of a ligand or binding partner molecular mass. Several additional criteria must be met in order for the result to be ranked in the output file including that all m/z values for ions of the protein or complex have progressively lower values as their assigned charge increases, the difference between the m/z values for adjacent charge states (z, z + 1) decrease as the assigned charge state increases, and the ratio of any two m/z values assigned to a protein or complex is equal to the inverse ratio of their charge. The entries that satisfy these criteria are then ranked according to the appearance of ions in the mass spectrum associated with the binding partner, the length of a continuous series of charges across any set of ions for a protein and complex and the lowest error recorded for the molecular mass of the ligand or binding partner. A diverse range of hypothetical and experimental mass spectral data were used to implement and test the program, including those recorded for antibody-peptide, protein-peptide and protein-heme complexes. Spectra of increasing complexity, in terms of the number of ions input, were also successfully analysed in which the number of input m/z values far exceeds the few associated with a macromolecular complex. Thus the program will be of value in a future goal of proteomics, where mass spectrometry already plays a central role, for the direct analysis of protein and other associations within biological extracts. Copyright 2003 John Wiley & Sons, Ltd.