A Systems Biology Interpretation of Array Comparative Genomic Hybridization (aCGH) Data through Phylogenetics

OMICS. 2016 Mar;20(3):169-79. doi: 10.1089/omi.2015.0184.

Abstract

Array Comparative Genomic Hybridization (aCGH) is a rapid screening technique to detect gene deletions and duplications, providing an overview of chromosomal aberrations throughout the entire genome of a tumor, without the need for cell culturing. However, the heterogeneity of aCGH data obfuscates existing methods of data analysis. Analysis of aCGH data from a systems biology perspective or in the context of total aberrations is largely absent in the published literature. We present here a novel alternative to the functional analysis of aCGH data using the phylogenetic paradigm that is well-suited to high dimensional datasets of heterogeneous nature, but has not been widely adapted to aCGH data. Maximum parsimony phylogenetic analysis sorts out genetic data through the simplest presentation of the data on a cladogram, a graphical evolutionary tree, thus providing a powerful and efficient method for aCGH data analysis. For example, the cladogram models the multiphasic changes in the cancer genome and identifies shared early mutations in the disease progression, providing a simple yet powerful means of aCGH data interpretation. As such, applying maximum parsimony phylogenetic analysis to aCGH results allows for the differentiation between drivers and passenger genes aberrations in cancer specimens. In addition to offering a novel methodology to analyze aCGH results, we present here a crucial software suite that we wrote to carry out the analysis. In a broader context, we wish to underscore that phylogenetic analysis of aCGH data is a non-parametric method that circumvents the pitfalls and frustrations of standard analytical techniques that rely on parametric statistics. Organizing the data in a cladogram as explained in this research article provides insights into the disease common aberrations, as well as the disease subtypes and their shared aberrations (the synapomorphies) of each subtype. Hence, we report the method and make the software suite publicly and freely available at http://software.phylomcs.com so that researchers can test alternative and innovative approaches to the analysis of aCGH data.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Algorithms
  • Chromosome Aberrations
  • Comparative Genomic Hybridization / statistics & numerical data*
  • Gene Expression Regulation, Neoplastic
  • Genome, Human*
  • Humans
  • Mutation
  • Neoplasm Proteins / genetics
  • Neoplasms / genetics*
  • Neoplasms / pathology
  • Oligonucleotide Array Sequence Analysis
  • Phylogeny*
  • Software*
  • Systems Biology / methods*

Substances

  • Neoplasm Proteins