Format

Send to

Choose Destination
mSystems. 2016 Oct 18;1(5). pii: e00062-16. eCollection 2016 Sep-Oct.

mockrobiota: a Public Resource for Microbiome Bioinformatics Benchmarking.

Author information

1
Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, Arizona, USA.
2
Department of Biology, Tufts University, Medford, Massachusetts, USA.
3
Department of Microbiology & Immunology Department, Microbiome and Disease Tolerance Centre, McGill University, Montreal, Quebec, Canada.
4
Division of Biological Sciences, University of California San Diego, La Jolla, California, USA.
5
Department of Microbiology and Immunology, GW Hooper Foundation, University of California, San Francisco, San Francisco, California, USA.
6
Department of Computer Science and Engineering, University of California San Diego, La Jolla, California, USA; Department of Pediatrics, University of California San Diego, La Jolla, California, USA; Center for Microbiome Innovation, University of California San Diego, La Jolla, California, USA.
7
Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, Arizona, USA; Department of Biological Sciences, Northern Arizona University, Flagstaff, Arizona, USA.

Abstract

Mock communities are an important tool for validating, optimizing, and comparing bioinformatics methods for microbial community analysis. We present mockrobiota, a public resource for sharing, validating, and documenting mock community data resources, available at http://caporaso-lab.github.io/mockrobiota/. The materials contained in mockrobiota include data set and sample metadata, expected composition data (taxonomy or gene annotations or reference sequences for mock community members), and links to raw data (e.g., raw sequence data) for each mock community data set. mockrobiota does not supply physical sample materials directly, but the data set metadata included for each mock community indicate whether physical sample materials are available. At the time of this writing, mockrobiota contains 11 mock community data sets with known species compositions, including bacterial, archaeal, and eukaryotic mock communities, analyzed by high-throughput marker gene sequencing. IMPORTANCE The availability of standard and public mock community data will facilitate ongoing method optimizations, comparisons across studies that share source data, and greater transparency and access and eliminate redundancy. These are also valuable resources for bioinformatics teaching and training. This dynamic resource is intended to expand and evolve to meet the changing needs of the omics community.

Supplemental Content

Full text links

Icon for American Society for Microbiology Icon for PubMed Central
Loading ...
Support Center