Format

Send to

Choose Destination
See comment in PubMed Commons below
BMC Med Genomics. 2016 Oct 13;9(1):63.

Protecting genomic data analytics in the cloud: state of the art and opportunities.

Author information

  • 1School of Informatics and Computing, Indiana University, Bloomington, IN, USA. hatang@indiana.edu.
  • 2Department of Biomedical Informatics, University of California San Diego, La Jolla, CA, USA.
  • 3School of Informatics and Computing, Indiana University, Bloomington, IN, USA.
  • 4National Human Genome Research Institute, Rockville, MD, USA.
  • 5School of Law, University of San Diego, San Diego, CA, USA.
  • 6Microsoft Research, Redmond, WA, USA.
  • 7Department of Biomedical Informatics, School of Medicine, Vanderbilt University, Nashville, TN, USA.
  • 8The J. Craig Venter Institute, La Jolla, CA, USA.
  • 9Department of Mathematics and Computer Science, Emory University, Atlanta, GA, USA.

Abstract

The outsourcing of genomic data into public cloud computing settings raises concerns over privacy and security. Significant advancements in secure computation methods have emerged over the past several years, but such techniques need to be rigorously evaluated for their ability to support the analysis of human genomic data in an efficient and cost-effective manner. With respect to public cloud environments, there are concerns about the inadvertent exposure of human genomic data to unauthorized users. In analyses involving multiple institutions, there is additional concern about data being used beyond agreed research scope and being prcoessed in untrused computational environments, which may not satisfy institutional policies. To systematically investigate these issues, the NIH-funded National Center for Biomedical Computing iDASH (integrating Data for Analysis, 'anonymization' and SHaring) hosted the second Critical Assessment of Data Privacy and Protection competition to assess the capacity of cryptographic technologies for protecting computation over human genomes in the cloud and promoting cross-institutional collaboration. Data scientists were challenged to design and engineer practical algorithms for secure outsourcing of genome computation tasks in working software, whereby analyses are performed only on encrypted data. They were also challenged to develop approaches to enable secure collaboration on data from genomic studies generated by multiple organizations (e.g., medical centers) to jointly compute aggregate statistics without sharing individual-level records. The results of the competition indicated that secure computation techniques can enable comparative analysis of human genomes, but greater efficiency (in terms of compute time and memory utilization) are needed before they are sufficiently practical for real world environments.

PMID:
27733153
PMCID:
PMC5062944
DOI:
10.1186/s12920-016-0224-3
[PubMed - in process]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for PubMed Central
    Loading ...
    Support Center