Format

Send to

Choose Destination
See comment in PubMed Commons below
BMC Bioinformatics. 2013;14 Suppl 2:S24. doi: 10.1186/1471-2105-14-S2-S24. Epub 2013 Jan 21.

Mass spectrometry-based protein identification by integrating de novo sequencing with database searching.

Author information

1
Prince of Wales Clinical School, University of New South Wales, Australia. penghao.wang@unsw.edu.au

Abstract

BACKGROUND:

Mass spectrometry-based protein identification is a very challenging task. The main identification approaches include de novo sequencing and database searching. Both approaches have shortcomings, so an integrative approach has been developed. The integrative approach firstly infers partial peptide sequences, known as tags, directly from tandem spectra through de novo sequencing, and then puts these sequences into a database search to see if a close peptide match can be found. However the current implementation of this integrative approach has several limitations. Firstly, simplistic de novo sequencing is applied and only very short sequence tags are used. Secondly, most integrative methods apply an algorithm similar to BLAST to search for exact sequence matches and do not accommodate sequence errors well. Thirdly, by applying these methods the integrated de novo sequencing makes a limited contribution to the scoring model which is still largely based on database searching.

RESULTS:

We have developed a new integrative protein identification method which can integrate de novo sequencing more efficiently into database searching. Evaluated on large real datasets, our method outperforms popular identification methods.

PMID:
23369017
PMCID:
PMC3549845
DOI:
10.1186/1471-2105-14-S2-S24
[Indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for BioMed Central Icon for PubMed Central
    Loading ...
    Support Center