(A) A gene expression profile dataset is represented as a matrix, where columns indicate different samples and rows indicate different genes. Given a transcription factor of interest,
TF, a modulator and a target to be tested by MINDy (
M and
t respectively) are chosen among the remaining genes. Some modulator-target combinations may be eliminated
a priori based on functional or statistical constraints (blue rectangle). For instance, one may want to consider only ubiquitin ligases as candidate modulators. All the samples are then sorted according to the expression of the selected modulator
M. The set of samples (e.g., 35%) with the lowest and highest expression of the modulator are then selected. These sample sets are labeled
M-low and
M-high. In each of the two sample sets, samples are finally sorted according to the
TF expression. Three cases are possible when comparing the TF-target correlation in
M-high vs.
M-low.
Scenario 1 (
Positive Inferred Modulator): Significant increase in MI (i.e. more correlation in the M-high set than in the M-low set);
Scenario 2 (
Negative Inferred Modulator): Significant decrease in MI (i.e. less correlation in the M-high set than in the M-low set);
Scenario 3&4 (
Not a Modulator): No significant MI change is observed.
(B) Synthetic network analysis using MINDy.
TF: the transcription factor of interest;
K: a protein kinase;
cTF: a co-transcription factor;

: activated TF through phosphorylation by
K;

: transcriptionally active complex formed between TF and cTF;
G1,
G2 and
G3 are three downstream targets of TF. More details on the synthetic network are provided in the Methods.
(C) Networks reconstructed by ARACNe and MINDy. ARACNe (left) is based on pairwise MI, thus captures only the unconditional interaction between TF and G
1. MINDy, on the other hand, infers three-way interactions using the CMI. When K is used as candidate modulator (middle), it correctly identified the conditional interaction between TF and G
3; similarly, it also correctly inferred cTF as the modulator of TF-G
2 interaction (right). Interactions are labeled with their respective
p-values. Interactions that are statistically significant, after correction for multiple testing, are shown in black with red
p-values.