(A) The vertical axis is the number of 19 nt sequences of the both-strand mismatch tolerance shown in the horizontal axis. The solid line is the distribution for the non-redundant sequence set. For comparison, the dotted line shows the distribution when all the RefSeq and Unique UniGene sequences are used without removing any duplicates. Observe the dramatic reduction of redundant sequences of mismatch tolerance zero and the increases in the number of sequences of mismatch tolerance one or two. (B) The statistical chart (A) is recomputed for plus-strand mismatch tolerance. Note that the number of 19 nt sequences of mismatch tolerance three increases twofold. (C) The horizontal axis shows the requirement for the minimum number of effective, both/plus-strand specific siRNA candidates on a single mRNA. The vertical axis is the fraction of qualified genes in RefSeq that fulfill the constraint in the horizontal axis. Note that the fraction of qualified genes decreases severely when more both-strand specific siRNA sequences are designed, while the fraction decreases much more slowly when plus-strand specific siRNA are designed. This indicates the usefulness of the plus-strand specificity for designing multiple effective siRNA sequences on a single mRNA sequence. (D) The statistical chart in (A) is restricted to redundant 19 nt sequences and recalculated. This demonstrates that the non-redundant sequence set is indispensable for evaluating mismatch tolerances of redundant sequences correctly.