Examples of estimated local structure quality (the Cα distance) of E. coli proteins. The Cα distance is represented in the ‘sausage representation’, where the radius of the tube is proportional to the estimated Cα distance. The Cα distance is predicted by the two-layer linear regression model (Fig. 4). (A), (C) and (E) are predicted by the GTOP-Nest procedure and (B), (D) and (F) are predicted by the SPARKS2 procedure. Associated graphs show predicted Cα distance. Black/gray boxes in the graph show location of α helices/β strands in the chain. (A), yccK (predicted sulfite reductase subunit), the template structure used is 1sauA. The sequence identity (seqID) between the gene and the template is 38%. The global RMSD is predicted 0.7 Å. (B), rpsK (30S ribosomal subunit protein S11), template: 1s1hK, seqID: 35%. Predicted global RMSD: 0.6 Å. (C), yiqP (function unknown), template: 1iktA, seqID: 15%, predicted RMSD: 3.1 Å. (D), holD (DNA polymerasae psi subunit), template: 1em8B, seqID: 80%, predicted RMSD: 3.14 Å. (E), yfgD (predicted oxidoreductase), template: 1rw1A, seqID: 15%, predicted RMSD: 5.3 Å. (F), rhaR (positive regulatory gene), template: 1ft9A, seqID: 7%, predicted RMSD: 5.3 Å.