Format

Send to

Choose Destination
Anal Chim Acta. 2016 May 12;920:18-28. doi: 10.1016/j.aca.2016.03.042. Epub 2016 Mar 29.

Exploring Omics data from designed experiments using analysis of variance multiblock Orthogonal Partial Least Squares.

Author information

1
School of Pharmaceutical Sciences, University of Geneva, University of Lausanne, Geneva, Switzerland. Electronic address: julien.boccard@unige.ch.
2
School of Pharmaceutical Sciences, University of Geneva, University of Lausanne, Geneva, Switzerland.

Abstract

Many experimental factors may have an impact on chemical or biological systems. A thorough investigation of the potential effects and interactions between the factors is made possible by rationally planning the trials using systematic procedures, i.e. design of experiments. However, assessing factors' influences remains often a challenging task when dealing with hundreds to thousands of correlated variables, whereas only a limited number of samples is available. In that context, most of the existing strategies involve the ANOVA-based partitioning of sources of variation and the separate analysis of ANOVA submatrices using multivariate methods, to account for both the intrinsic characteristics of the data and the study design. However, these approaches lack the ability to summarise the data using a single model and remain somewhat limited for detecting and interpreting subtle perturbations hidden in complex Omics datasets. In the present work, a supervised multiblock algorithm based on the Orthogonal Partial Least Squares (OPLS) framework, is proposed for the joint analysis of ANOVA submatrices. This strategy has several advantages: (i) the evaluation of a unique multiblock model accounting for all sources of variation; (ii) the computation of a robust estimator (goodness of fit) for assessing the ANOVA decomposition reliability; (iii) the investigation of an effect-to-residuals ratio to quickly evaluate the relative importance of each effect and (iv) an easy interpretation of the model with appropriate outputs. Case studies from metabolomics and transcriptomics, highlighting the ability of the method to handle Omics data obtained from fixed-effects full factorial designs, are proposed for illustration purposes. Signal variations are easily related to main effects or interaction terms, while relevant biochemical information can be derived from the models.

KEYWORDS:

Analysis of variance; Chemometrics; Design of experiments; Multiblock analysis; Omics; Orthogonal Partial Least Squares

PMID:
27114219
DOI:
10.1016/j.aca.2016.03.042
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Elsevier Science
Loading ...
Support Center