Format

Send to

Choose Destination
Hum Mutat. 2015 Oct;36(10):989-97. doi: 10.1002/humu.22848. Epub 2015 Aug 20.

The genomic birthday paradox: how much is enough?

Author information

1
Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin 13353, Germany.
2
Berlin Center for Regenerative Therapies (BCRT), Charité-Universitätsmedizin Berlin, Berlin 13353, Germany.
3
Department of Computer Science, University of Toronto, Toronto, Ontario, M5S 3G4, Canada.
4
Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, M5G 0A4, Canada.
5
Max Planck Institute for Molecular Genetics, Berlin 14195, Germany.
6
Institute for Bioinformatics, Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin 14195, Germany.

Abstract

Genomic matchmaking databases (GMDs) allow participants to submit genomic and phenotypic data with the goal of identifying previously uncharacterized disease-associated genes by "matching" to other comparable cases. Current estimates suggest that there are at least 3,000 Mendelian disease-associated genes that have not yet been characterized as such, but the true number may be substantially higher. Therefore, GMDs are addressing a pressing medical need, and it is important to ask how they should be designed and how much data they should strive to contain in order to identify a certain number of these genes. In this work, we argue that genomic matchmaking has similarities to the so-called "birthday paradox," which refers to the observation that within a group of just 23 persons, two people will have the same birthday with probability greater than 50%. We develop a series of simulations to provide a rough estimate of the number of cases required and to explore the influence of parameters such as genetic heterogeneity, mode of inheritance, background variation, precision of phenotypic descriptions, disease prevalence, and the accuracy of bioinformatics pathogenicity prediction programs on the performance of genomic matchmaking.

KEYWORDS:

database; exome; genomic matchmaking; matchmaker exchange; phenotype

PMID:
26239817
DOI:
10.1002/humu.22848
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center