Format

Send to

Choose Destination
BMC Bioinformatics. 2018 Apr 11;19(1):134. doi: 10.1186/s12859-018-2118-1.

Encompassing new use cases - level 3.0 of the HUPO-PSI format for molecular interactions.

Author information

1
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, CB10 1SD, UK.
2
Cancer Research Center (CiC-IBMCC, CSIC/USAL/IBSAL), Consejo Superior de Investigaciones Científicas (CSIC) and Universidad de Salamanca (USAL), 37007, Salamanca, Spain.
3
School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, USA.
4
Target Sciences, GSK, Stevenage, UK.
5
Institute of Cardiovascular Science, University College London, Rayne Building, 5 University Street, London, WC1E 6JF, UK.
6
Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia (IIT), Via Adamello 16, I-20139, Milan, Italy.
7
Department of Biology, University of Rome Tor Vergata, Via della Ricerca Scientifica, Rome, Italy.
8
Wellcome Trust Centre for Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3BF, UK.
9
Cambridge Systems Biology Centre, University of Cambridge, Cambridge, UK.
10
Department of Genetics, University of Cambridge, Cambridge, UK.
11
State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, National Center for Protein Sciences (The PHOENIX Center, Beijing), Beijing, China.
12
Krembil Research Institute, University Health Network, Toronto, ON, M5T 2S8, Canada.
13
Departments of Medical Biophysics and Computer Science, University of Toronto, Toronto, ON, Canada.
14
EMBL Australia Group, South Australian Health and Medical Research Institute, Adelaide, Australia.
15
School of Medicine, Flinders University, Bedford Park, Adelaide, Australia.
16
Department of Biology, Ecology and Earth Sciences, Università della Calabria, Rende, Italy.
17
Univ Lyon, University Claude Bernard Lyon 1, INSA Lyon, CPE, Institute of Molecular and Supramolecular Chemistry and Biochemistry (ICBMS), UMR 5246, F-69622, Villeurbanne, France.
18
SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, 1 rue Michel Servet, 1211, Geneva 4, Switzerland.
19
UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, USA.
20
TIMC-IMAG, CNRS, Univ. Grenoble Alpes, F-38000, Grenoble, France.
21
Structural and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, D-69117, Heidelberg, Germany.
22
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, CB10 1SD, UK. orchard@ebi.ac.uk.

Abstract

BACKGROUND:

Systems biologists study interaction data to understand the behaviour of whole cell systems, and their environment, at a molecular level. In order to effectively achieve this goal, it is critical that researchers have high quality interaction datasets available to them, in a standard data format, and also a suite of tools with which to analyse such data and form experimentally testable hypotheses from them. The PSI-MI XML standard interchange format was initially published in 2004, and expanded in 2007 to enable the download and interchange of molecular interaction data. PSI-XML2.5 was designed to describe experimental data and to date has fulfilled this basic requirement. However, new use cases have arisen that the format cannot properly accommodate. These include data abstracted from more than one publication such as allosteric/cooperative interactions and protein complexes, dynamic interactions and the need to link kinetic and affinity data to specific mutational changes.

RESULTS:

The Molecular Interaction workgroup of the HUPO-PSI has extended the existing, well-used XML interchange format for molecular interaction data to meet new use cases and enable the capture of new data types, following extensive community consultation. PSI-MI XML3.0 expands the capabilities of the format beyond simple experimental data, with a concomitant update of the tool suite which serves this format. The format has been implemented by key data producers such as the International Molecular Exchange (IMEx) Consortium of protein interaction databases and the Complex Portal.

CONCLUSIONS:

PSI-MI XML3.0 has been developed by the data producers, data users, tool developers and database providers who constitute the PSI-MI workgroup. This group now actively supports PSI-MI XML2.5 as the main interchange format for experimental data, PSI-MI XML3.0 which additionally handles more complex data types, and the simpler, tab-delimited MITAB2.5, 2.6 and 2.7 for rapid parsing and download.

KEYWORDS:

Data standards; HUPO-PSI; Molecular interactions; PSI-MI; Protein complexes; Protein-protein interaction; XML

PMID:
29642841
PMCID:
PMC5896046
DOI:
10.1186/s12859-018-2118-1
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center