Send to

Choose Destination
See comment in PubMed Commons below
Methods Mol Biol. 2013;930:527-47. doi: 10.1007/978-1-62703-059-5_22.

Principal components analysis.

Author information

  • 1AG Bioinformatics, University of Potsdam, Potsdam-Golm, Germany.


Principal components analysis (PCA) is a standard tool in multivariate data analysis to reduce the number of dimensions, while retaining as much as possible of the data's variation. Instead of investigating thousands of original variables, the first few components containing the majority of the data's variation are explored. The visualization and statistical analysis of these new variables, the principal components, can help to find similarities and differences between samples. Important original variables that are the major contributors to the first few components can be discovered as well.This chapter seeks to deliver a conceptual understanding of PCA as well as a mathematical description. We describe how PCA can be used to analyze different datasets, and we include practical code examples. Possible shortcomings of the methodology and ways to overcome these problems are also discussed.

[PubMed - indexed for MEDLINE]
PubMed Commons home

PubMed Commons

How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Springer
    Loading ...
    Support Center