Send to

Choose Destination
PLoS One. 2018 Jul 24;13(7):e0200926. doi: 10.1371/journal.pone.0200926. eCollection 2018.

Fostering population-based cohort data discovery: The Maelstrom Research cataloguing toolkit.

Author information

Research Institute of the McGill University Health Centre, Montreal, Quebec, Canada.
Swiss Tropical and Public Health Institute, Basel, Switzerland.
University of Basel, Basel, Switzerland.
Research Center of the Sainte-Justine University Hospital, Montreal, Quebec, Canada.



The lack of accessible and structured documentation creates major barriers for investigators interested in understanding, properly interpreting and analyzing cohort data and biological samples. Providing the scientific community with open information is essential to optimize usage of these resources. A cataloguing toolkit is proposed by Maelstrom Research to answer these needs and support the creation of comprehensive and user-friendly study- and network-specific web-based metadata catalogues.


Development of the Maelstrom Research cataloguing toolkit was initiated in 2004. It was supported by the exploration of existing catalogues and standards, and guided by input from partner initiatives having used or pilot tested incremental versions of the toolkit.


The cataloguing toolkit is built upon two main components: a metadata model and a suite of open-source software applications. The model sets out specific fields to describe study profiles; characteristics of the subpopulations of participants; timing and design of data collection events; and datasets/variables collected at each data collection event. It also includes the possibility to annotate variables with different classification schemes. When combined, the model and software support implementation of study and variable catalogues and provide a powerful search engine to facilitate data discovery.


The Maelstrom Research cataloguing toolkit already serves several national and international initiatives and the suite of software is available to new initiatives through the Maelstrom Research website. With the support of new and existing partners, we hope to ensure regular improvements of the toolkit.

Conflict of interest statement

I have read the journal's policy and the authors of this manuscript have the following competing interests: YM owns Epigeny, a company that offers services based on the Opal and Mica software described in this article. This does not alter our adherence to PLOS ONE policies on sharing data and materials.

Supplemental Content

Full text links

Icon for Public Library of Science Icon for PubMed Central
Loading ...
Support Center