Nine data segments (DSs) with different confidences related to CC and BP ontologies (A) and the selection of positives and negatives, as well as GSPs and GSNs (B). Nine DSs contain 5 010 195 protein pairs encompassing 3166 proteins in total. Each DS is labeled as number of proteins/number of protein pairs and the proportion of protein pairs covered by the DS (A). Similar labels are also shown for positives and negatives, as well as GSPs and GSNs (B). Three confidence levels of the protein pairs annotated in the CC ontology are high (H; RSSCC in (0.8, 1]), medium (M; RSSCC in (0.3, 0.8]) and low (L; RSSCC in [0, 0.3]); while three levels of protein pairs annotated in BP ontology are high (H; RSSBP in (0.8, 1]), medium (M; RSSBP in (0.4, 0.8]) and low (L; RSSBP in [0, 0.4]). The nine DSs are divided into two parts, which are positives (HH and MH in rose color) and negatives (the remaining seven DSs in lime). The gold standard positive dataset (GSPs; HH in red) is that part of the positives with the highest confidence while the gold standard negative dataset (GSNs; ML+LL in green) is that part of the negatives with the lowest confidence.