(a) Five hundred and ninety-seven host genes were associated with HPV status in HNSC, at a false discovery rate (q)<0.05 and with an absolute log2 median expression ratio >2. Known cancer genes in the Cancer Gene Census are indicated. The colour code indicates log2-transformed mRNA levels relative to the overall median. (b) PCA analysis of tumour mRNA expression profiles in CESC, HNSC and BLCA. Although there were systematic expression differences between cancer types, HPV-positive tumours clustered together regardless of type. (c) HPV-positive CESC tumours were subdivided by their viral gene expression patterns: E7-, E6/E7- and E4/E5/E7-expressing tumour subsets were tested for differential expression of host genes relative to remaining samples. One hundred and twenty host genes were differentially expressed in the E6/E7 subset, using criteria described above. (d) Validation of the E6/E7 signature. Most of the 120 genes were consistently induced/repressed in E6/E7 compared with E7 samples, also when only considering HPV16 (red)- or HPV18 (green)-positive tumours. In addition, most genes in the signature showed consistent expression changes in HNSC E6/E7 compared with E6 tumours (blue). E6*, truncated and probably non-functional E6 open reading frame.