Format

Send to

Choose Destination
Bioinformatics. 2016 Jun 15;32(12):i101-i110. doi: 10.1093/bioinformatics/btw282.

Comparative analyses of population-scale phenomic data in electronic medical records reveal race-specific disease networks.

Author information

1
Department of Genetics and Genomic Sciences Icahn Institute for Genomics and Multiscale Biology Harris Center for Precision Wellness, Icahn School of Medicine at Mount Sinai, New York City, NY 10029, USA.
2
Department of Genetics and Genomic Sciences Icahn Institute for Genomics and Multiscale Biology.
3
Department of Biomedical Informatics, Harvard Medical School, Boston, 02115 MA, USA.
4
Department of Genetics and Genomic Sciences Icahn Institute for Genomics and Multiscale Biology Harris Center for Precision Wellness, Icahn School of Medicine at Mount Sinai, New York City, NY 10029, USA Department of Population Health Science and Policy, New York City, NY 10029, USA.

Abstract

MOTIVATION:

Underrepresentation of racial groups represents an important challenge and major gap in phenomics research. Most of the current human phenomics research is based primarily on European populations; hence it is an important challenge to expand it to consider other population groups. One approach is to utilize data from EMR databases that contain patient data from diverse demographics and ancestries. The implications of this racial underrepresentation of data can be profound regarding effects on the healthcare delivery and actionability. To the best of our knowledge, our work is the first attempt to perform comparative, population-scale analyses of disease networks across three different populations, namely Caucasian (EA), African American (AA) and Hispanic/Latino (HL).

RESULTS:

We compared susceptibility profiles and temporal connectivity patterns for 1988 diseases and 37 282 disease pairs represented in a clinical population of 1 025 573 patients. Accordingly, we revealed appreciable differences in disease susceptibility, temporal patterns, network structure and underlying disease connections between EA, AA and HL populations. We found 2158 significantly comorbid diseases for the EA cohort, 3265 for AA and 672 for HL. We further outlined key disease pair associations unique to each population as well as categorical enrichments of these pairs. Finally, we identified 51 key 'hub' diseases that are the focal points in the race-centric networks and of particular clinical importance. Incorporating race-specific disease comorbidity patterns will produce a more accurate and complete picture of the disease landscape overall and could support more precise understanding of disease relationships and patient management towards improved clinical outcomes.

CONTACTS:

rong.chen@mssm.edu or joel.dudley@mssm.edu

SUPPLEMENTARY INFORMATION:

Supplementary data are available at Bioinformatics online.

PMID:
27307606
PMCID:
PMC4908366
DOI:
10.1093/bioinformatics/btw282
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center