Hierarchical clustering and multidimensional scaling analysis. The top 96 genes as ranked by the ANN models were used for the analysis. a, Multidimensional scaling analysis. Shown here are two projections of the MDS plot of the training samples. EWS are depicted as yellow circles, RMS as red, BL as blue and NB as green. The samples clustered closely according to the 4 different cancer categories. b, Hierarchical clustering of the samples and genes. Each row represents one of the 96 cDNA clones and each column a separate sample. A pseudo-colored representation of the relative red intensity is shown such that a red color indicates high expression and green color low expression, with scale shown below. On the right are the IMAGE id., gene symbol, class in which the gene is highly expressed (see Supplementary Methods), and the ANN rank. *, genes that have not been reported to be associated with these cancers. c, Enlargement of the hierarchical clustering dendrogram of the samples in b. All 63 training and the 20 test SRBCTs correctly clustered within their diagnostic categories. In both cases where two samples were derived from the same cell line, BL-C2 & C4, and NB-C2 and C7, each mapped adjacent to one another in the same cluster. The scale shows the Pearson correlation coefficient used to construct the dendrogram. The Pearson correlation cutoff was 0.54, when the samples clustered into the four diagnostic categories.