![]() | ![]() |
Formats:
|
||||||||||||
Assembly reflects evolution of protein complexes 1MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK. 2Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, UK. Correspondence and requests for materials should be addressed to E.D.L., C.V.R., or S.A.T. (Email: homomers/at/gmail.com). Author Contributions E.D.L., E.B.E., C.V.R. and S.A.T. designed the experiments and wrote the manuscript; E.D.L. and E.B.E. performed the bioinformatics and mass spectrometry experiments, respectively. Abstract A homomer is formed by self-interacting copies of a protein unit. This is functionally important1,2, as in allostery3-5, and structurally crucial because mis-assembly of homomers is implicated in disease6,7. Homomers are widespread, with 50–70% of proteins with a known quaternary state assembling into such structures8,9. Despite their prevalence, their role in the evolution of cellular machinery10,11 and the potential for their use in the design of new molecular machines12,13, little is known about the mechanisms that drive formation of homomers at the level of evolution and assembly in the cell9,14. Here we present an analysis of over 5,000 unique atomic structures and show that the quaternary structure of homomers is conserved in over 70% of protein pairs sharing as little as 30% sequence identity. Where quaternary structure is not conserved among the members of a protein family, a detailed investigation revealed well-defined evolutionary pathways by which proteins transit between different quaternary structure types. Furthermore, we show by perturbing subunit interfaces within complexes and by mass spectrometry analysis15, that the (dis)assembly pathway mimics the evolutionary pathway. These data represent a molecular analogy to Haeckel's evolutionary paradigm of embryonic development, where an intermediate in the assembly of a complex represents a form that appeared in its own evolutionary history. Our model of self-assembly allows reliable prediction of evolution and assembly of a complex solely from its crystal structure. Although homomers are central to biology, only anecdotal knowledge exists on their principles of evolution and assembly, and no unifying theory has been proposed. Large increases in structural data in recent years, however, have enabled us to study quaternary structure or spatial arrangement of subunits on a data set of 5,375 unique structures. This data set is ~tenfold greater than any studied previously16 (Methods). On the basis of this data set, we quantify how often proteins change their quaternary structure, and identify the evolutionary routes taken to do so. Subsequently, as evolution of a complex can be viewed as assembly over a long timescale, we compare evolutionary routes with (dis)assembly routes probed by mass spectrometry. Homomers can be separated into two main classes of open or closed symmetry. The first class corresponds to open structures that would polymerize to infinity in the absence of limiting factors. Such assemblies (for example, tubulin and actin) are rare in our data set (3%), probably because their innate dynamic character renders them difficult to crystallize. In contrast, closed symmetries are finite in space, and most homomers adopt either cyclic or dihedral symmetry (Fig. 1a
It has long been observed that smaller complexes are more abundant than larger ones, and even numbers of subunits are favoured over odd numbers8,9,17. Here we confirm this observation, with 62% of complexes being dimers. We quantify the different types of symmetries found in homomers and show that the abundance of complexes with even numbers of subunits is due to the prevalence of dihedral complexes. Whenever an option exists for cyclic or dihedral, on average we find an 11-fold preference for dihedral complexes (Fig. 1b Notably, dihedral and cyclic symmetries are geometrically related: a complex with Dn symmetry can be formed from n dimers with C2 symmetry or from two n-mers with Cn symmetry19 (Fig. 1a
When quaternary structure is not conserved, we speculate that pathways linking geometrically related symmetries represent both evolutionary and assembly routes. For example, a dihedral tetramer (D2) can be described as a dimer of dimers, where a back-to-back dimerization patch forms a first dimer, and a second face-to-face dimerization patch forms the dimer of dimers. This is not true of a cyclic tetramer (C4), where subunits interact in a face-to-back manner, such that two different surface patches are involved in forming an interface (Fig. 1a Following this idea, we looked at evolutionary relationships in terms of sequence similarity between different quaternary structures to unveil the routes most commonly taken to build larger complexes (Fig. 2b Notably in this stepwise scenario, two evolutionary routes lead to a dihedral complex (Dn): either from n dimers or from two cyclic n-mers (Fig. 2b It is notable that this signature of complex formation (hierarchy in interface sizes) is conserved throughout evolution. This can be interpreted in at least two different although not mutually exclusive ways: (1) once the complex is formed there is no need to dramatically change the interface size, analogous to the classical explanation for the marginal stability of proteins20 (that is, selective pressure becomes almost non-existent beyond the point where proteins fold); and (2) maintaining a hierarchy of interface strengths is important for a precise order during assembly21,22, in which case the largest interface would reflect the main intermediate species during assembly. To test this hypothesis we targeted ten complexes for study using electrospray mass spectrometry (Fig. 4a
Initially we verified that the complexes could be generated intact and corresponded to the stoichiometry described in the protein data bank (PDB). The mass spectra recorded for two hexamers with D3 symmetry and one 14-mer with D7 symmetry revealed that the intact homomer is maintained in each case (Fig. 4c To address whether the disassembly process was the reverse of the assembly pathway, we attempted to reassemble a subset of the complexes studied by dilution of the denaturant and/or manipulation of the ionic strength. In ~50% of the complexes examined we were able to reassemble the original homomer. These results—together with previous studies where reassembly was found to be strongly dependent on factors such as ionic strength, temperature and concentration of denaturant23,24—indicate that disassembly is the reverse of assembly under the appropriate conditions. To complement our experimental observations, we found six additional complexes for which (dis)assembly intermediates had been reported (Fig. 4b Overall, through analysis of a large set of homomers, we have shown that the evolutionary pathway of a homomer can be inferred from its atomic structure morphology. This allowed us to predict the (dis)assembly pathway of homomers in solution, and design mass-spectrometry-based experiments to validate our predictions. Results revealed that the (dis)assembly pathway, which takes place on a protein-folding timescale (~seconds), mimics the evolutionary pathway that has taken place over a considerably longer timescale (~millions of years). This is the first time that a general principle for formation and assembly of homomers has been demonstrated. We hope that this will stimulate further studies, as relationships between folding, complex formation and aggregation are only beginning to be explored. METHODS SUMMARY Data set of homomers Randomization of evolutionary routes To assess the significance of the number of evolutionary relationships between proteins with different quaternary structures, we compared the observed numbers to a random model of quaternary structure transitions where evolutionary relationships are reassigned randomly in proportion to the size of each quaternary structure type. Prediction of evolutionary routes The size of an interface is given by the number of amino acids in contact, as defined previously8. We predict evolutionary intermediates by taking the ‘closed’ subcomplex containing the largest interface. In cyclic complexes with three or more subunits, each subunit buries two equivalent surfaces. Thus, these interfaces are counted twice when compared to dimer interfaces. Intact complexes Complexes were donated by crystallographers and taken from a random selection from the PDB. For further details see Methods. Generating subcomplexes Intact complexes were disrupted through change in ionic strength or the stepwise addition of dimethylsulphoxide, methanol or acetonitrile. This process is illustrated in Supplementary Fig. 4 and solution conditions are summarized in Supplementary Table 2.
Supplementary Discussion, Figures, Tables, References Click here to view.(818K, pdf) Full Methods Click here to view.(186K, pdf) Acknowledgements We thank the collaborators listed in Supplementary Table 2 for supplying the different complexes and acknowledge H. Hernandez, J. Freeke and L. Lane for assistance with mass spectrometry. We also thank C. Chothia, J. Clark and M. Babu for discussions. This work was supported by the Medical Research Council, the EMBO Young Investigators Programme, the Royal Society and the Waters Kundert Trust. Footnotes Full Methods and any associated references are available in the online version of the paper at www.nature.com/nature. Supplementary Information is linked to the online version of the paper at www.nature.com/nature. Reprints and permissions information is available at www.nature.com/reprints. References 1. Cabezon E, et al. Homologous and heterologous inhibitory effects of ATPase inhibitor proteins on F-ATPases. J. Biol. Chem. 2002;277:41334–41341. [PubMed] 2. Hardy LW, et al. Atomic structure of thymidylate synthase: target for rational drug design. Science. 1987;235:448–455. [PubMed] 3. Iber D, Clarkson J, Yudkin MD, Campbell ID. The mechanism of cell differentiation in Bacillus subtilis. Nature. 2006;441:371–374. [PubMed] 4. Marianayagam NJ, Sunde M, Matthews JM. The power of two: protein dimerization in biology. Trends Biochem. Sci. 2004;29:618–625. [PubMed] 5. Monod J, Wyman J, Changeux JP. On the nature of allosteric transitions: a plausible model. J. Mol. Biol. 1965;12:88–118. [PubMed] 6. Dobson CM. Protein folding and misfolding. Nature. 2003;426:884–890. [PubMed] 7. Hayouka Z, et al. Inhibiting HIV-1 integrase by shifting its oligomerization equilibrium. Proc. Natl Acad. Sci. USA. 2007;104:8316–8321. [PubMed] 8. Levy ED, Pereira-Leal JB, Chothia C, Teichmann SA. 3D complex: a structural classification of protein complexes. PLoS Comput. Biol. 2006;2:e155. [PubMed] 9. Goodsell DS, Olson AJ. Structural symmetry and protein function. Annu. Rev. Biophys. Biomol. Struct. 2000;29:105–153. [PubMed] 10. Ispolatov I, Yuryev A, Mazo I, Maslov S. Binding properties and evolution of homodimers in protein-protein interaction networks. Nucleic Acids Res. 2005;33:3629–3635. [PubMed] 11. Pereira-Leal JB, Levy ED, Kamp C, Teichmann SA. Evolution of protein complexes by duplication of homomeric interactions. Genome Biol. 2007;8:R51. [PubMed] 12. Grueninger D, et al. Designed protein-protein association. Science. 2008;319:206–209. [PubMed] 13. Janin J. Biochemistry. Dicey assemblies. Science. 2008;319:165–166. [PubMed] 14. Blundell TL, Srinivasan N. Symmetry, stability, and dynamics of multidomain and multicomponent protein systems. Proc. Natl Acad. Sci. USA. 1996;93:14243–14248. [PubMed] 15. Hernandez H, et al. Subunit architecture of multimeric complexes isolated directly from cells. EMBO Rep. 2006;7:605–610. [PubMed] 16. Brinda KV, Vishveshwara S. Oligomeric protein structure networks: insights into protein-protein interactions. BMC Bioinformatics. 2005;6:296. [PubMed] 17. Monod J. Nobel Symposium 11: Symmetry and Function of Biological Systems at the Macromolecular Level. Almqvist & Wiksell; Stockholm: 1968. 18. Lukatsky DB, Shakhnovich BE, Mintseris J, Shakhnovich EI. Structural similarity enhances interaction propensity of proteins. J. Mol. Biol. 2007;365:1596–1606. [PubMed] 19. Claverie P, Hofnung M, Monod J. Sur certaines implications de l'hypothèse d'équivalence stricte entre les protomères des protéines oligomériques. C. R. Séanc. Acad. Sci. 1968;266:1616–1618. 20. DePristo MA, Weinreich DM, Hartl DL. Missense meanderings in sequence space: a biophysical view of protein evolution. Nature Rev. Genet. 2005;6:678–687. [PubMed] 21. Bahadur RP, Rodier F, Janin J. A dissection of the protein-protein interfaces in icosahedral virus capsids. J. Mol. Biol. 2007;367:574–590. [PubMed] 22. Powers ET, Powers DL. A perspective on mechanisms of protein tetramer formation. Biophys. J. 2003;85:3587–3599. [PubMed] 23. Luke K, Wittung-Stafshede P. Folding and assembly pathways of co-chaperonin proteins 10: Origin of bacterial thermostability. Arch. Biochem. Biophys. 2006;456:8–18. [PubMed] 24. Cheesman C, Ruddock LW, Freedman RB. The refolding and reassembly of Escherichia coli heat-labile enterotoxin B-subunit: analysis of reassembly-competent and reassembly-incompetent unfolded states. Biochemistry. 2004;43:1609–1617. [PubMed] 25. Kress W, Mutschler H, Weber-Ban E. Assembly pathway of an AAA+ protein: tracking ClpA and ClpAP complex formation in real time. Biochemistry. 2007;46:6183–6193. [PubMed] 26. Levy ED. PiQSi: Protein quaternary structure investigation. Structure. 2007;15:1364–1367. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||
J Biol Chem. 2002 Nov 1; 277(44):41334-41.
[J Biol Chem. 2002]Science. 1987 Jan 23; 235(4787):448-55.
[Science. 1987]Nature. 2006 May 18; 441(7091):371-4.
[Nature. 2006]J Mol Biol. 1965 May; 12():88-118.
[J Mol Biol. 1965]Nature. 2003 Dec 18; 426(6968):884-90.
[Nature. 2003]BMC Bioinformatics. 2005 Dec 10; 6():296.
[BMC Bioinformatics. 2005]PLoS Comput Biol. 2006 Nov 17; 2(11):e155.
[PLoS Comput Biol. 2006]Annu Rev Biophys Biomol Struct. 2000; 29():105-53.
[Annu Rev Biophys Biomol Struct. 2000]J Mol Biol. 1965 May; 12():88-118.
[J Mol Biol. 1965]J Mol Biol. 2007 Feb 2; 365(5):1596-606.
[J Mol Biol. 2007]Nat Rev Genet. 2005 Sep; 6(9):678-87.
[Nat Rev Genet. 2005]J Mol Biol. 2007 Mar 23; 367(2):574-90.
[J Mol Biol. 2007]Biophys J. 2003 Dec; 85(6):3587-99.
[Biophys J. 2003]Arch Biochem Biophys. 2006 Dec 1; 456(1):8-18.
[Arch Biochem Biophys. 2006]Biochemistry. 2004 Feb 17; 43(6):1609-17.
[Biochemistry. 2004]Biochemistry. 2007 May 29; 46(21):6183-93.
[Biochemistry. 2007]PLoS Comput Biol. 2006 Nov 17; 2(11):e155.
[PLoS Comput Biol. 2006]Structure. 2007 Nov; 15(11):1364-7.
[Structure. 2007]PLoS Comput Biol. 2006 Nov 17; 2(11):e155.
[PLoS Comput Biol. 2006]