Format

Send to

Choose Destination
See comment in PubMed Commons below
PLoS One. 2013;8(2):e55760. doi: 10.1371/journal.pone.0055760. Epub 2013 Feb 5.

Yahtzee: an anonymized group level matching procedure.

Author information

1
Political Science Department and Medical Genetics Division, University of California San Diego, La Jolla, California, United States of America.

Abstract

Researchers often face the problem of needing to protect the privacy of subjects while also needing to integrate data that contains personal information from diverse data sources. The advent of computational social science and the enormous amount of data about people that is being collected makes protecting the privacy of research subjects ever more important. However, strict privacy procedures can hinder the process of joining diverse sources of data that contain information about specific individual behaviors. In this paper we present a procedure to keep information about specific individuals from being "leaked" or shared in either direction between two sources of data without need of a trusted third party. To achieve this goal, we randomly assign individuals to anonymous groups before combining the anonymized information between the two sources of data. We refer to this method as the Yahtzee procedure, and show that it performs as predicted by theoretical analysis when we apply it to data from Facebook and public voter records.

PMID:
23441156
PMCID:
PMC3564933
DOI:
10.1371/journal.pone.0055760
[Indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Public Library of Science Icon for PubMed Central
    Loading ...
    Support Center