Darwin Core: an evolving community-developed biodiversity data standard

PLoS One. 2012;7(1):e29715. doi: 10.1371/journal.pone.0029715. Epub 2012 Jan 6.

Abstract

Biodiversity data derive from myriad sources stored in various formats on many distinct hardware and software platforms. An essential step towards understanding global patterns of biodiversity is to provide a standardized view of these heterogeneous data sources to improve interoperability. Fundamental to this advance are definitions of common terms. This paper describes the evolution and development of Darwin Core, a data standard for publishing and integrating biodiversity information. We focus on the categories of terms that define the standard, differences between simple and relational Darwin Core, how the standard has been implemented, and the community processes that are essential for maintenance and growth of the standard. We present case-study extensions of the Darwin Core into new research communities, including metagenomics and genetic resources. We close by showing how Darwin Core records are integrated to create new knowledge products documenting species distributions and changes due to environmental perturbations.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Biodiversity*
  • Biota*
  • Data Collection / standards
  • Databases, Factual / standards*
  • Environment
  • Evolution, Molecular
  • Genetic Fitness / genetics
  • Genetic Fitness / physiology*
  • Guidelines as Topic
  • Humans
  • Information Storage and Retrieval / standards
  • Models, Biological
  • Selection, Genetic*
  • Validation Studies as Topic