Logo of bioinfoLink to Publisher's site
Bioinformatics. 2008 Jul 1; 24(13): 1498–1502.
Published online 2008 May 3. doi:  10.1093/bioinformatics/btn216
PMCID: PMC3133936

High-performance hardware implementation of a parallel database search engine for real-time peptide mass fingerprinting


Motivation: Peptide mass fingerprinting (PMF) is a method for protein identification in which a protein is fragmented by a defined cleavage protocol (usually proteolysis with trypsin), and the masses of these products constitute a ‘fingerprint’ that can be searched against theoretical fingerprints of all known proteins. In the first stage of PMF, the raw mass spectrometric data are processed to generate a peptide mass list. In the second stage this protein fingerprint is used to search a database of known proteins for the best protein match. Although current software solutions can typically deliver a match in a relatively short time, a system that can find a match in real time could change the way in which PMF is deployed and presented. In a paper published earlier we presented a hardware design of a raw mass spectra processor that, when implemented in Field Programmable Gate Array (FPGA) hardware, achieves almost 170-fold speed gain relative to a conventional software implementation running on a dual processor server. In this article we present a complementary hardware realization of a parallel database search engine that, when running on a Xilinx Virtex 2 FPGA at 100 MHz, delivers 1800-fold speed-up compared with an equivalent C software routine, running on a 3.06 GHz Xeon workstation. The inherent scalability of the design means that processing speed can be multiplied by deploying the design on multiple FPGAs. The database search processor and the mass spectra processor, running on a reconfigurable computing platform, provide a complete real-time PMF protein identification solution.

Contact: ku.ca.dleiffehs@acoc.d


  • Anish TA, et al. Hardware-accelerated protein identification for mass spectrometry. Rapid Commun. Mass Spectrom. 2005;19:833–837. [PubMed]
  • Bogdan I, et al. Hardware acceleration of processing of mass spectrometric data for proteomics. Bioinform. Gene Express. 2007;23:724–731. [PubMed]
  • Fagin B, et al. A special-purpose processor for gene sequence analysis. Comput. Appl. BioSci. 1993;9:221–226. [PubMed]
  • Gattiker A, et al. FindPept, a tool to identify unmatched masses in peptide mass fingerprinting protein identification. Proteomix. 2002;2:1435–1444. [PubMed]
  • Guccione AS, Keller E. Gene matching using Jbits. Lecture Notes in Computer Science; Proceedings of the Reconfigurable Computing is Going Mainstream, 12th International Conference on Field-Programmable Logic and Applications; London: Springer-Verlag; 2002. pp. 1168–1171.
  • Guerdoux-Jamet P, Lavenier D. SAMBA: hardware accelerator for biological sequence comparison. Comput. Appl. BioSci. 1997;13:609–615. [PubMed]
  • Hughey R. Parallel hardware for sequence comparison and alignment. Comput. Appl. BioSci. 1996;12:473–479. [PubMed]
  • Krogh A, et al. Hidden Markov models in computational biology: applications to protein modeling. J. Mol. Biol. 1994;235:1501–1531. [PubMed]
  • Lavenier D. Speeding up genome computations with systolic accelerator. SIAM News. 1998;31:1–8.
  • Marongiu A, et al. Designing hardware for protein sequence analysis. Bioinformatics. 2003;19:1739–1740. [PubMed]
  • Oliver T, et al. Using reconfigurable hardware to accelerate multiple sequence alignment with ClustaIW. Bioinformatics. 2005;21:3431–3432. [PubMed]
  • Panitz F, et al. SNP mining porcine ESTs with MAVIANT, a novel tool for SNP evaluation and annotation. Bioinformatics. 2007;23:i387–i391. [PubMed]
  • Pappin DJC, et al. Rapid identification of proteins by peptide-mass fingerprinting. Curr. Biol. 1993;3:327–332. [PubMed]
  • Perkins DN, et al. Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis. 1999;20:3551–3567. [PubMed]
  • Samuelsson J, et al. Modular, scriptable and automated analysis tools for high-throughput peptide mass fingerprinting. Bioinformatics. 2004;20:3628–3635. [PubMed]
  • Schechter I, Berger A. On the size of the active site in proteases. Biochem. Biophys. Res. Com. 1967;27:157–162. [PubMed]
  • Simmler H, et al. Real-Time Primer Design for DNA Chips. Intersci. Concurr. Comput. Pract. Exper. 2004;16:855–872.
  • Sun Y, Buhler J. Designing patterns for profile HMM search. Bioinformatics. 2007;23:e36–e43. [PubMed]
  • Wozniak A. Using video-oriented instructions to speed up sequence comparison. Comput. Appl. BioSci. 1997;13:145–150. [PubMed]
  • Zhang W, Chait BT. ProFound: an expert system for protein identification using mass spectrometric peptide mapping information. Anal. Chem. 2000;72:2482–2489. [PubMed]

Articles from Bioinformatics are provided here courtesy of Oxford University Press

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


  • MedGen
    Related information in MedGen
  • PubMed
    PubMed citations for these articles

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...