Control-free calling of copy number alterations in deep-sequencing data using GC-content normalization

Bioinformatics. 2011 Jan 15;27(2):268-9. doi: 10.1093/bioinformatics/btq635. Epub 2010 Nov 15.

Abstract

Summary: We present a tool for control-free copy number alteration (CNA) detection using deep-sequencing data, particularly useful for cancer studies. The tool deals with two frequent problems in the analysis of cancer deep-sequencing data: absence of control sample and possible polyploidy of cancer cells. FREEC (control-FREE Copy number caller) automatically normalizes and segments copy number profiles (CNPs) and calls CNAs. If ploidy is known, FREEC assigns absolute copy number to each predicted CNA. To normalize raw CNPs, the user can provide a control dataset if available; otherwise GC content is used. We demonstrate that for Illumina single-end, mate-pair or paired-end sequencing, GC-contentr normalization provides smooth profiles that can be further segmented and analyzed in order to predict CNAs.

Availability: Source code and sample data are available at http://bioinfo-out.curie.fr/projects/freec/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Base Composition
  • Cell Line, Tumor
  • Cytosine / analysis
  • DNA Copy Number Variations*
  • Genomics / methods*
  • Guanine / analysis
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Neoplasms / genetics*
  • Software*

Substances

  • Guanine
  • Cytosine