Figure 4A Centered Gaussian Probability Distribution of Unit Variance (Black), Corresponding to the Random Distribution Obtained in the Null Models, and the Values Actually Observed in Our Clusters (Arrows)
Values reported on the abscissae are z-scores, i.e., the deviations to the mean normalized by the standard deviation.
Red solid and blue dashed arrows correspond to E. coli K12 and B. subtilis, respectively. Short arrows point to the values of the z-scores that we measure for the fraction of pairs of genes within a common operon and belonging to the same cluster.
Long arrows refer to the same quantities for pairs of genes within a common metabolic pathway.
Note that, as the Gaussian distribution is meant to show, our z-scores are highly significant, e.g., zscore, ≥ 8 ↦ probability = 6 × 10−16 to occur by chance. See also that values of the z-scores previously obtained, using general-purpose clustering methods, were much smaller: 5.30 and 3.29, for operons and metabolic pathways, respectively.