Format

Send to

Choose Destination
Methods Mol Biol. 2016;1410:77-89. doi: 10.1007/978-1-4939-3524-6_5.

Proteogenomic Methods to Improve Genome Annotation.

Author information

1
Institute of Bioinformatics, International Technology Park, Bangalore, 560066, India.
2
School of Biotechnology, KIIT University, Bhubaneswar, 751024, Odisha, India.
3
Centre for Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry, 605014, India.
4
Institute of Bioinformatics, International Technology Park, Bangalore, 560066, India. harsha@ibioinformatics.org.
5
School of Biotechnology, KIIT University, Bhubaneswar, 751024, Odisha, India. harsha@ibioinformatics.org.

Abstract

Annotation of protein coding genes in sequenced genomes has been routinely carried out using gene prediction programs guided by available transcript data. The advent of mass spectrometry has enabled the identification of proteins in a high-throughput manner. In addition to searching proteins annotated in public databases, mass spectrometry data can also be searched against conceptually translated genome as well as transcriptome to identify novel protein coding regions. This proteogenomics approach has resulted in the identification of novel protein coding regions in both prokaryotic and eukaryotic genomes. These studies have also revealed that some of the annotated noncoding RNAs and pseudogenes code for proteins. This approach is likely to become a part of most genome annotation workflows in the future. Here we describe a general methodology and approach that can be used for proteogenomics.

KEYWORDS:

Mass spectrometry; Noncoding RNAs; Novel proteins; Proteogenomics; Pseudogenes

PMID:
26867739
DOI:
10.1007/978-1-4939-3524-6_5
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Springer
Loading ...
Support Center