![]() | ![]() |
Formats:
|
||||||||||||
Copyright © The Author 2006. Published by Oxford University Press. All rights reserved MitoP2: the mitochondrial proteome database—now including mouse data 1Institute of Human Genetics, Technical University of Munich, Munich, Germany 2Institute of Human Genetics, GSF National Research Center for Environment and Health, Neuherberg, Germany 3Institute for Bioinformatics, GSF National Research Center for Environment and Health, Neuherberg, Germany 4Department of Biochemistry and Stanford Genome Technology Center, 855 California Avenue, Palo Alto, CA 94304, USA *To whom correspondence should be addressed. Tel: +49 89 3187 2890; Fax: +49 89 3187 3297; Email: prokisch/at/gsf.de Received September 15, 2005; Revised October 18, 2005; Accepted October 18, 2005. The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions/at/oxfordjournals.org This article has been cited by other articles in PMC.Abstract The MitoP2 database (http://www.mitop.de) integrates information on mitochondrial proteins, their molecular functions and associated diseases. The central database features are manually annotated reference proteins localized or functionally associated with mitochondria supplied for yeast, human and mouse. MitoP2 enables (i) the identification of putative orthologous proteins between these species to study evolutionarily conserved functions and pathways; (ii) the integration of data from systematic genome-wide studies such as proteomics and deletion phenotype screening; (iii) the prediction of novel mitochondrial proteins using data integration and the assignment of evidence scores; and (iv) systematic searches that aim to find the genes that underlie common and rare mitochondrial diseases. The data and analysis files are referenced to data sources in PubMed and other online databases and can be easily downloaded. MitoP2 users can explore the relationship between mitochondrial dysfunctions and disease and utilize this information to conduct systems biology approaches on mitochondria. INTRODUCTION The application of genomics to biology and medicine requires an understanding how specific gene variants contribute to phenotypes, in combination with a comprehensive knowledge of the ‘parts list’ of a cellular system and how these components are assembled into functional units (1). Mitochondria are ubiquitous and defined substructures of nucleated cells and lend themselves to systems biology approaches. However, in generic databases the annotation of mitochondrial proteins is often incomplete and does not always distinguish between proteins which have a confirmed mitochondrial subcellular localization and those which are only candidates according to preliminary experimental results or in silico predictions. For the human species, about half of the estimated 1500 proteins localized or functionally associated with mitochondria are known (2). Since the mitochondrial organelle is an evolutionarily conserved entity, systematic studies in model organisms are powerful to identify mitochondrial proteins in other organisms (3). The MitoP2 database was created to consolidate and structure public information on mitochondrial proteins, their functions and associated human diseases (4,5). MitoP2 provides a wide variety of search functions to explore and download information and to access references in PubMed and other public databases. We have further expanded the manually annotated reference sets of mitochondrial proteins in yeast (522 proteins) and human (624 proteins), and have now added the section MitoP2-Mouse (615 proteins). For these three species, we integrated data from genome-wide approaches applied to the study of mitochondria, and assigned an evidence score of a candidate protein being mitochondrial (3). With the help of MitoP2, proteins involved in mitochondrial biogenesis and function have been identified and characterized (6,7). In addition, MitoP2 has enabled the identification of disease genes using positional candidate approaches (8–10). MitoP2-YEAST A wealth of information has been collected over the past several years from single gene and genome-wide studies of Saccharomyces cerevisiae (11). The list of yeast ORFs and protein annotations in MitoP2-Yeast are based on information in the Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) (12). This MitoP2-Yeast update now provides annotated information for 522 mitochondrial reference proteins, which are based on experimental validation of each of these proteins. Recently, systematic cellular sublocalization studies estimated a total of 800 mitochondrial proteins presenting ~12% of the currently known yeast genes (13,14). Therefore, ~250–300 mitochondrial proteins are still missing. In order to identify these missing genes, we have validated and integrated genome-wide approaches applied to the study of mitochondria (3). MitoP2-Yeast datasets in Table 1 show 20 systematic approaches used for this purpose: phenotypes of single gene deletion mutant phenotypes (15,16); systematic subcellular localization studies (13,14); transcriptome datasets of differentially expressed genes including fermentable and non-fermentable growth conditions, the response to diauxic shift, and Hap4 transcription factor screening (3,17,18); proteome analyses of purified mitochondrial organelles (3,19,20); protein abundance measurements (21); and data from protein–protein interaction studies that include interactions to mitochondrial proteins (22). In addition to experimental datasets, mitochondrial proteins can be predicted in silico based on the presence of mitochondrial targeting sequences (23–26), and by sequence similarity to a known mitochondrial protein from other species (defined as bidirectional best BLAST hit or best BLAST hit with a score <1 × 10−10) (27). Data from each of these systematic studies can be searched and downloaded.
Using the MitoP2-Yeast reference proteins, it is possible to analyze the specificity and sensitivity of the data from genome-wide studies (Figure 1
MitoP2-HUMAN AND MitoP2-MOUSE We manually annotated mitochondrial reference proteins for human (624) and mouse (615) that now cover about half of the estimated mitochondrial proteins in these two species. These reference proteins present a subset of all the protein entries in the database: MitoP2-Human contains 36 504 proteins and MitoP2-Mouse contains 32 422 proteins. These datasets have been downloaded from the Swiss-Prot database (http://www.expasy.org/sprot/) (28). To identify putative orthologue proteins between human and mouse we calculated a bidirectional best BLAST hit or a best BLAST hit with score <1 × 10−10 between the two datasets. For each MitoP2 protein, we extracted descriptions, chromosomal positions, subcellular localization and literature references from Swiss-Prot. In addition, functional annotations such as biological processes and functional categories were extracted from the Gene Ontology database (GO; http://www.geneontology.org/). For MitoP2-Mouse, we annotated functional descriptions according to the MIPS functional catalogue (29), and provided access to DNA and protein sequence information. Each of these protein annotations is accompanied by its PubMed reference link. Phenotypic information on available mouse models are provided by the Mouse Genome Informatics database (MGI; http://www.informatics.jax.org/) (30). To date, more than 50 mouse models carrying mutations or deletions of mitochondrial genes have been investigated. For researchers interested in studying these models, MitoP2 provides links to the International Gene Trap Consortium (IGTC; http://www.genetrap.org/) to access the related mouse cell lines.MitoP2-Mouse and MitoP2-Human provide similar search options that allow single or combined searches for individual database components (Figure 2
Each entry in MitoP2-Human and MitoP2-Mouse corresponds to a Swiss-Prot identifier with protein descriptions, annotated subcellular localization and sequence map positions according to UCSC genome browser (http://genome.ucsc.edu/). In addition, the single protein entry summarizes the information from in silico predictions, high-throughput experiments, the availability of mouse gene trap clones and the predictive MitoP2 score. An example for a single protein entry in MitoP2-Mouse, the adenine nucleotide (ADP/ATP) translocator 2, is shown in Figure 3
For genes implicated in a hereditary disease, MitoP2 provides a link to the corresponding entry in the Online Mendelian Inheritance in Man database (OMIM; http://www.ncbi.nlm.nih.gov) (36). To date, more than 120 of the 624 human mitochondrial proteins are known to be involved in a hereditary disease. Mitochondrial disorders have a diversity of debilitating phenotypes and include a wide variety of neurodegenerative processes, cardiovascular disorders, diabetes mellitus and several cancer types. Many of these disease genes function in the metabolism of amino acids, nucleic acid, fatty acids and lipids, and energy production. The MitoP2 database enables the systematic identification of candidate genes to study mitochondrial diseases (5). Elpeleg et al. (8), for example, mapped a locus for hereditary mtDNA depletions associated with mitochondrial encephalomyopathy to a 21 Mb interval on chromosome 13. The mapping coordinates (i.e. 13:40878920 and 13:61359487) were used as a selection criteria to prioritize MitoP2 candidate genes among the 113 genes predicted in this region. In combination with a MitoP2 score >60, three proteins were identified as disease candidate genes. One of these genes (SUCLA2), a mitochondrial reference protein identified in two proteome experiments, was found to be mutated in affected members of the linkage family. This study demonstrates that human disease genes can be identified using information provided by MitoP2. SUPPLEMENTARY DATA Supplementary Data are available at NAR Online. [Supplementary Material]
Acknowledgments The MitoP2 project is funded by the German National Genome Network (German Ministry for Education and Research, grant 01GR0411), BFAM (Bioinformatics for the Functional Analysis of Mammalian Genomes) and the MitEURO consortium. Funding to pay the Open Access publication charges for this article was provided by the GSF Research Centre. Conflict of interest statement. None declared. REFERENCES 1. Collins F.S., Green E.D., Guttmacher A.E., Guyer M.S. A vision for the future of genomics research. Nature. 2003;422:835–847. [PubMed] 2. Taylor S.W., Fahy E., Ghosh S.S. Global organellar proteomics. Trends Biotechnol. 2003;21:82–88. [PubMed] 3. Prokisch H., Scharfe C., Camp D.G., II, Xiao W., David L., Andreoli C., Monroe M.E., Moore R.J., Gritsenko M.A., Kozany C., et al. Integrative analysis of the mitochondrial proteome in yeast. PLoS Biol. 2004;2:e160. [PubMed] 4. Andreoli C., Prokisch H., Hortnagel K., Mueller J.C., Munsterkotter M., Scharfe C., Meitinger T. MitoP2, an integrated database on mitochondrial proteins in yeast and man. Nucleic Acids Res. 2004;32:D459–D462. [PubMed] 5. Scharfe C., Zaccaria P., Hoertnagel K., Jaksch M., Klopstock T., Dembowski M., Lill R., Prokisch H., Gerbitz K.D., Neupert W., et al. MITOP, the mitochondrial proteome database: 2000 update. Nucleic Acids Res. 2000;28:155–158. [PubMed] 6. Szklarz L., Guiard B., Rissler M., Wiedemann N., Kozjak V., van der Laan M., Lohaus C., Marcus K., Meyer H., Chacinska A., et al. Inactivation of the mitochondrial heat shock protein Zim17 leads to aggregation of matrix Hsp70s followed by plelotropic effects on morphology and protein biogenesis. J. Mol. Biol. 2005;351:206–218. [PubMed] 7. van der Laan M., Chacinska A., Lind M., Perschil I., Sickmann A., Meyer H., Guiard B., Meisinger C., Pfanner N., Rehling P. Pam17 is required for architecture and translocation activity of the mitochondrial protein import motor. Mol. Cell. Biol. 2005;25:7449–7458. [PubMed] 8. Elpeleg O., Miller C., Hershkovitz E., Bitner-Glindzicz M., Bondi-Rubinstein G., Rahman S., Pagnamenta A., Eshhar S., Saada A. Deficiency of the ADP-forming succinyl-CoA synthase activity is associated with encephalomyopathy and mitochondrial DNA depletion. Am. J. Hum. Genet. 2005;76:1081–1086. [PubMed] 9. Mootha V.K., Lepage P., Miller K., Bunkenborg J., Reich M., Hjerrild M., Delmonte T., Villeneuve A., Sladek R., Xu F., et al. Identification of a gene causing human cytochrome c oxidase deficiency by integrative genomics. Proc. Natl Acad. Sci. USA. 2003;100:605–610. [PubMed] 10. Tiranti V., Hoertnagel K., Carrozzo R., Galimberti C., Munaro M., Granatiero M., Zelante L., Gasparini P., Marzella R., Rocchi M., et al. Mutations of SURF-1 in Leigh disease associated with cytochrome c oxidase deficiency. Am. J. Hum. Genet. 1998;63:1609–1621. [PubMed] 11. Reichert A.S., Neupert W. Mitochondriomics or what makes us breathe. Trends Genet. 2004;20:555–562. [PubMed] 12. Cherry J.M., Ball C., Weng S., Juvik G., Schmidt R., Adler C., Dunn B., Dwight S., Riles L., Mortimer R.K., et al. Genetic and physical maps of Saccharomyces cerevisiae. Nature. 1997;387:67–73. [PubMed] 13. Huh W.K., Falvo J.V., Gerke L.C., Carroll A.S., Howson R.W., Weissman J.S., O'Shea E.K. Global analysis of protein localization in budding yeast. Nature. 2003;425:686–691. [PubMed] 14. Kumar A., Cheung K.H., Tosches N., Masiar P., Liu Y., Miller P., Snyder M. The TRIPLES database: a community resource for yeast molecular biology. Nucleic Acids Res. 2002;30:73–75. [PubMed] 15. Dimmer K.S., Fritz S., Fuchs F., Messerschmitt M., Weinbach N., Neupert W., Westermann B. Genetic basis of mitochondrial function and morphology in Saccharomyces cerevisiae. Mol. Biol. Cell. 2002;13:847–853. [PubMed] 16. Steinmetz L.M., Scharfe C., Deutschbauer A.M., Mokranjac D., Herman Z.S., Jones T., Chu A.M., Giaever G., Prokisch H., Oefner P.J., et al. Systematic screen for human disease genes in yeast. Nature Genet. 2002;31:400–404. [PubMed] 17. DeRisi J.L., Iyer V.R., Brown P.O. Exploring the metabolic and genetic control of gene expression on a genomic scale. Science. 1997;278:680–686. [PubMed] 18. Lascaris R., Bussemaker H.J., Boorsma A., Piper M., van der Spek H., Grivell L., Blom J. Hap4p overexpression in glucose-grown Saccharomyces cerevisiae induces cells to enter a novel metabolic state. Genome Biol. 2003;4:R3. [PubMed] 19. Pflieger D., Le Caer J.P., Lemaire C., Bernard B.A., Dujardin G., Rossier J. Systematic identification of mitochondrial proteins by LC-MS/MS. Anal. Chem. 2002;74:2400–2406. [PubMed] 20. Sickmann A., Reinders J., Wagner Y., Joppich C., Zahedi R., Meyer H.E., Schonfisch B., Perschil I., Chacinska A., Guiard B., et al. The proteome of Saccharomyces cerevisiae mitochondria. Proc. Natl Acad. Sci. USA. 2003;100:13207–13212. [PubMed] 21. Ghaemmaghami S., Huh W.K., Bower K., Howson R.W., Belle A., Dephoure N., O'Shea E.K., Weissman J.S. Global analysis of protein expression in yeast. Nature. 2003;425:737–741. [PubMed] 22. von Mering C., Krause R., Snel B., Cornell M., Oliver S.G., Fields S., Bork P. Comparative assessment of large-scale data sets of protein–protein interactions. Nature. 2002;417:399–403. [PubMed] 23. Claros M.G. MitoProt, a Macintosh application for studying mitochondrial proteins. Comput. Appl. Biosci. 1995;11:441–447. [PubMed] 24. Guda C., Fahy E., Subramaniam S. MITOPRED: a genome-scale method for prediction of nucleus-encoded mitochondrial proteins. Bioinformatics. 2004;20:1785–1794. [PubMed] 25. Nakai K., Horton P. PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization. Trends Biochem. Sci. 1999;24:34–36. [PubMed] 26. Small I., Peeters N., Legeai F., Lurin C. Predotar: a tool for rapidly screening proteomes for N-terminal targeting sequences. Proteomics. 2004;4:1581–1590. [PubMed] 27. Altschul S.F., Madden T.L., Schaffer A.A., Zhang J., Zhang Z., Miller W., Lipman D.J. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. [PubMed] 28. Boeckmann B., Bairoch A., Apweiler R., Blatter M.C., Estreicher A., Gasteiger E., Martin M.J., Michoud K., O'Donovan C., Phan I., et al. The Swiss-Prot protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003;31:365–370. [PubMed] 29. Ruepp A., Zollner A., Maier D., Albermann K., Hani J., Mokrejs M., Tetko I., Guldener U., Mannhaupt G., Munsterkotter M., et al. The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. Nucleic Acids Res. 2004;32:5539–5545. [PubMed] 30. Bult C.J., Blake J.A., Richardson J.E., Kadin J.A., Eppig J.T., Baldarelli R.M., Barsanti K., Baya M., Beal J.S., Boddy W.J., et al. The Mouse Genome Database (MGD): integrating biology with the genome. Nucleic Acids Res. 2004;32:D476–D481. [PubMed] 31. Gaucher S.P., Taylor S.W., Fahy E., Zhang B., Warnock D.E., Ghosh S.S., Gibson B.W. Expanded coverage of the human heart mitochondrial proteome using multidimensional liquid chromatography coupled with tandem mass spectrometry. J. Proteome. Res. 2004;3:495–505. [PubMed] 32. Taylor S.W., Fahy E., Zhang B., Glenn G.M., Warnock D.E., Wiley S., Murphy A.N., Gaucher S.P., Capaldi R.A., Gibson B.W., et al. Characterization of the human heart mitochondrial proteome. Nat. Biotechnol. 2003;21:281–286. [PubMed] 33. Da Cruz S., Xenarios I., Langridge J., Vilbois F., Parone P.A., Martinou J.C. Proteomic analysis of the mouse liver mitochondrial inner membrane. J. Biol. Chem. 2003;278:41566–41571. [PubMed] 34. Mootha V.K., Bunkenborg J., Olsen J.V., Hjerrild M., Wisniewski J.R., Stahl E., Bolouri M.S., Ray H.N., Sihag S., Kamal M., et al. Integrated analysis of protein composition, tissue diversity, and gene regulation in mouse mitochondria. Cell. 2003;115:629–640. [PubMed] 35. Ozawa T., Sako Y., Sato M., Kitamura T., Umezawa Y. A genetic approach to identifying mitochondrial proteins. Nat. Biotechnol. 2003;21:287–293. [PubMed] 36. Hamosh A., Scott A.F., Amberger J., Bocchini C., Valle D., McKusick V.A. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 2002;30:52–55. [PubMed] 37. Drawid A., Gerstein M. A Bayesian system integrating expression data with sequence patterns for localizing proteins: comprehensive application to the yeast genome. J. Mol. Biol. 2000;301:1059–1075. [PubMed] 38. Marc P., Margeot A., Devaux F., Blugeon C., Corral-Debrinski M., Jacq C. Genome-wide analysis of mRNAs targeted to yeast mitochondria. EMBO Rep. 2002;3:159–164. [PubMed] 39. Ohlmeier S., Kastaniotis A.J., Hiltunen J.K., Bergmann U. The yeast mitochondrial proteome, a study of fermentative and respiratory growth. J. Biol. Chem. 2004;279:3956–3979. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||
Nature. 2003 Apr 24; 422(6934):835-47.
[Nature. 2003]Trends Biotechnol. 2003 Feb; 21(2):82-8.
[Trends Biotechnol. 2003]PLoS Biol. 2004 Jun; 2(6):e160.
[PLoS Biol. 2004]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D459-62.
[Nucleic Acids Res. 2004]Nucleic Acids Res. 2000 Jan 1; 28(1):155-8.
[Nucleic Acids Res. 2000]PLoS Biol. 2004 Jun; 2(6):e160.
[PLoS Biol. 2004]J Mol Biol. 2005 Aug 5; 351(1):206-18.
[J Mol Biol. 2005]Mol Cell Biol. 2005 Sep; 25(17):7449-58.
[Mol Cell Biol. 2005]Trends Genet. 2004 Nov; 20(11):555-62.
[Trends Genet. 2004]Nature. 1997 May 29; 387(6632 Suppl):67-73.
[Nature. 1997]Nature. 2003 Oct 16; 425(6959):686-91.
[Nature. 2003]Nucleic Acids Res. 2002 Jan 1; 30(1):73-5.
[Nucleic Acids Res. 2002]PLoS Biol. 2004 Jun; 2(6):e160.
[PLoS Biol. 2004]PLoS Biol. 2004 Jun; 2(6):e160.
[PLoS Biol. 2004]Nucleic Acids Res. 2003 Jan 1; 31(1):365-70.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2004; 32(18):5539-45.
[Nucleic Acids Res. 2004]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D476-81.
[Nucleic Acids Res. 2004]J Proteome Res. 2004 May-Jun; 3(3):495-505.
[J Proteome Res. 2004]Nat Biotechnol. 2003 Mar; 21(3):281-6.
[Nat Biotechnol. 2003]J Biol Chem. 2003 Oct 17; 278(42):41566-71.
[J Biol Chem. 2003]Cell. 2003 Nov 26; 115(5):629-40.
[Cell. 2003]Nat Biotechnol. 2003 Mar; 21(3):287-93.
[Nat Biotechnol. 2003]Nucleic Acids Res. 2002 Jan 1; 30(1):52-5.
[Nucleic Acids Res. 2002]Nucleic Acids Res. 2000 Jan 1; 28(1):155-8.
[Nucleic Acids Res. 2000]Am J Hum Genet. 2005 Jun; 76(6):1081-6.
[Am J Hum Genet. 2005]Trends Biochem Sci. 1999 Jan; 24(1):34-6.
[Trends Biochem Sci. 1999]Comput Appl Biosci. 1995 Aug; 11(4):441-7.
[Comput Appl Biosci. 1995]J Mol Biol. 2000 Aug 25; 301(4):1059-75.
[J Mol Biol. 2000]Proteomics. 2004 Jun; 4(6):1581-90.
[Proteomics. 2004]Genome Biol. 2003; 4(1):R3.
[Genome Biol. 2003]