Format

Send to

Choose Destination
Proteins. 2002 Feb 1;46(2):161-70.

Singular value decomposition analysis of protein sequence alignment score data.

Author information

1
Dipartimento Scientifico Tecnologico, Facoltà di Scienze, Università di Verona, Verona, Italy. fogolari@sci.univr.it

Abstract

One of the standard tools for the analysis of data arranged in matrix form is singular value decomposition (SVD). Few applications to genomic data have been reported to date mainly for the analysis of gene expression microarray data. We review SVD properties, examine mathematical terms and assumptions implicit in the SVD formalism, and show that SVD can be applied to the analysis of matrices representing pairwise alignment scores between large sets of protein sequences. In particular, we illustrate SVD capabilities for data dimension reduction and for clustering protein sequences. A comparison is performed between SVD-generated clusters of proteins and annotation reported in the SWISS-PROT Database for a set of protein sequences forming the calycin superfamily, entailing all entries corresponding to the lipocalin, cytosolic fatty acid-binding protein, and avidin-streptavidin Prosite patterns.

PMID:
11807944
DOI:
10.1002/prot.10032
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center