Significantly clustered 5′ UTR motifs in the BioGRID human protein–protein interaction network. (
A) LESMoN identified 200 motif family representatives with clustering
P-values < 10
−6 that are displayed in a hierarchical clustering tree. Conservation fold enrichment, clustering and GO enrichment
P-values for each motif are color-coded. GO enrichment
P-values were computed with Ontologizer () using a Fisher’s exact test. The 36 GO terms shown here are those that are significantly (
P-value < 10
−7) associated with the most motifs, considering only terms that include

500 human genes. (
B) The family representative motifs with a conservation fold enrichment

2.25 are shown as sequence logos (generated by Weblogo ()), where nucleotide heights are proportional to their frequencies in 5′ UTRs. Each represented motif is given an identification number (from 1 to 17). (
C) For these 17 motifs, the motif and its reverse complement occurrences in promoters, 5′ UTRs and coding exons in actual and locally randomized sequences are shown.