The linear epitopes of the human proteome space. For each window size, the longest consecutive amino acid stretch with all windows under a threshold value (e.g., no more than six out of eight amino acid residues identical to a protein from another gene), was determined for each of the 22,983 human genes in Ensembl. The maximum consecutive length found for the proteins encoded by each gene was selected as representative for that gene. The number of human genes (Y-axis) for each category of maximum consecutive length (X-axis) is presented for window sizes of A. Eight amino acids (threshold values 5, 6, and 7 identical amino acids). (B) Ten amino acids (threshold values 7, 8, and 9 identical amino acids). (C) Twelve amino acids (threshold values 7, 8, 9, 10, and 11 identical amino acids).