Distribution of correlations (X-axes) between genes (Y-axes) and the closest sCNV marker (in cis) in TU (A) and AN (B) tissues. The distribution of the real data (blue solid lines) was compared to permutation of the gene to marker connection (green dashed lines). No significant associations were seen in AN above what would be expected by chance. In TU there was a pronounced bias to positive correlations. C. Distribution of correlations between all genes (Y-axis) and all sCNV markers, shown linearly by chromosome location through the genome (X-axis). Chromosome boundaries are indicated by the vertical green lines and are numbered. Using a correlation cut-off of >0.3 (p<4.7e-4, FDR <0.02), the count of all genes (not including cis genes, green), positively correlated trans genes (red), or negatively correlated trans genes (blue) is indicated for each marker (trans here was defined as genes and sCNV markers falling on separate chromosomes). Several hotspots were apparent in which many genes were associated with sCNV at a particular locus, especially for regions on chromosomes 1, 2, 6, 7, 12, 14 and 20 (see text for additional discussion). D. Genes associated with sCNV hotspots in HCC and cell lines significantly overlap. Hotspot sCNV markers were selected by identifying regions associated with >500 genes (Pearson correlation coefficient>0.3) and then selecting the single top marker per chromosome. The genes associated with each hotspot marker were compared (blue circles, size equivalent to number of genes) and significant overlaps (Fishers Exact Test p<1e-6) are shown as edges connecting pairs of nodes. A similar analysis was performed on a collection of cancer cell lines (CCL, green circles). In this case a smaller number of total genes were measured (23,404 vs 37,585), so an equivalent fraction of the total genes (>370) significantly associated with a sCNV marker was required. The thickness of the edges connecting the nodes represents the enrichment of the overlapping genes in comparison to that expected by chance (observed overlap divided by expected; enrichment <5 fold – thin line, >5 – thick lines). As shown the genes associated with hotspots were shared within datasets. For example, the genes in HCC linked to the hotspots of chromosomes 2, 6, 12, 14 and 20 significantly overlap each other. Hotspot overlaps between CLL and HCC involving the same genomic regions are highlighted in red (chromosomes 1, 14 and 20).