Format

Send to

Choose Destination
BMC Bioinformatics. 2016 Jun 21;17:247. doi: 10.1186/s12859-016-1122-6.

An integrative imputation method based on multi-omics datasets.

Lin D1,2, Zhang J2,3, Li J1,2, Xu C2,3, Deng HW2,3, Wang YP4,5,6.

Author information

1
Department of Biomedical Engineering, Tulane University, New Orleans, LA, 70118, USA.
2
Center for Bioinformatics and Genomics, Tulane University, New Orleans, LA, 70112, USA.
3
Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA.
4
Department of Biomedical Engineering, Tulane University, New Orleans, LA, 70118, USA. wyp@tulane.edu.
5
Center for Bioinformatics and Genomics, Tulane University, New Orleans, LA, 70112, USA. wyp@tulane.edu.
6
Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA. wyp@tulane.edu.

Abstract

BACKGROUND:

Integrative analysis of multi-omics data is becoming increasingly important to unravel functional mechanisms of complex diseases. However, the currently available multi-omics datasets inevitably suffer from missing values due to technical limitations and various constrains in experiments. These missing values severely hinder integrative analysis of multi-omics data. Current imputation methods mainly focus on using single omics data while ignoring biological interconnections and information imbedded in multi-omics data sets.

RESULTS:

In this study, a novel multi-omics imputation method was proposed to integrate multiple correlated omics datasets for improving the imputation accuracy. Our method was designed to: 1) combine the estimates of missing value from individual omics data itself as well as from other omics, and 2) simultaneously impute multiple missing omics datasets by an iterative algorithm. We compared our method with five imputation methods using single omics data at different noise levels, sample sizes and data missing rates. The results demonstrated the advantage and efficiency of our method, consistently in terms of the imputation error and the recovery of mRNA-miRNA network structure.

CONCLUSIONS:

We concluded that our proposed imputation method can utilize more biological information to minimize the imputation error and thus can improve the performance of downstream analysis such as genetic regulatory network construction.

KEYWORDS:

Ensemble learning; Imputation; Integrative analysis; Multi-omics data

PMID:
27329642
PMCID:
PMC4915152
DOI:
10.1186/s12859-016-1122-6
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center