Format

Send to

Choose Destination
Database (Oxford). 2016 Mar 15;2016. pii: baw001. doi: 10.1093/database/baw001. Print 2016.

Principles of metadata organization at the ENCODE data coordination center.

Author information

1
Department of Genetics, Stanford University School of Medicine Department of Genetics, Stanford, CA, USA.
2
Center for Biomolecular Science and Engineering Santa Cruz, University of California, Santa Cruz, CA, USA.
3
Department of Genetics, Stanford University School of Medicine Department of Genetics, Stanford, CA, USA cherry@stanford.edu.

Abstract

The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/). Database URL: www.encodeproject.org.

PMID:
26980513
PMCID:
PMC4792520
DOI:
10.1093/database/baw001
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center