Skip to main content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Theochem. 2004 May; 677(1): 73–76.
Published online 2004 Apr 2. doi: 10.1016/j.theochem.2004.02.018
PMCID: PMC7141560
PMID: 32287546

Structural similarity between HIV-1 gp41 and SARS-CoV S2 proteins suggests an analogous membrane fusion mechanism

Abstract

SARS-associated coronavirus (SARS-CoV) has been identified as the causal agent of a new emerging disease: severe acute respiratory syndrome (SARS). Its spike protein S2 is responsible for mediating fusion of viral and cellular membrane. In this study, we modeled the 3D structure of S2 subunit and compared this model with the core structure of gp41 from HIV-1. We found that SARS-CoV S2 and gp41 share the same two α helices, suggesting that the two viruses could follow an analogous membrane fusion mechanism. Further ligand-binding analysis showed that two inhibitors GGL and D-peptide from HIV-1 gp41 may serve as inhibitors for SARS-CoV entry.

Keywords: SARS-CoV, Human immunodeficiency virus type 1, Structural similarity, Inhibitor

1. Introduction

Coronavirus spike protein plays a very important role in virus entry, virus–receptor interaction, variations in host range and tissue tropism. The S proteins of majority of coronaviruses are cleaved into two functional subunits, S1 and S2. Liu et al. [1] indicated that the S protein of SARS-associated coronavirus (SARS-CoV) also forms S1 and S2 domains. The peripheral S1 portion is responsible for cellular receptor recognition, while the membrane-spanning S2 portion mediates the fusion of viral and cellular membrane, hence S protein determines the specificity of host and virulence of coronavirus [2]. Similarly, there are two non-valently associated subunits in the human immunodeficiency virus type 1 (HIV-1) envelope glycoprotein, gp120 and gp41, gp120 directs target-cell recognition and viral tropism through interaction with the cell-surface receptor CD4, while the membrane-spanning gp41 promotes fusion of the viral and cellular membranes so that viral contents are released into the host cell [3].

Sequence analysis revealed that there are some similar motifs in HIV-1 gp41 and SARS-CoV S2 proteins. Gallaher and Garry [4] identified an N-terminal leucine/isoleucine zipper-like sequence and an aromatic-rich region. Kliger and Levanon [5] reported a C-terminal heptad repeat in the upstream of an aromatic-rich region. These discoveries probably lead to development of new therapeutic strategy against SARS-CoV. The goal of this study is to probe whether any similarity exists in the 3D structure of HIV-1 gp41 and SARS-CoV S2 proteins and identification of possible inhibitor-binding sites.

2. Materials and methods

The core structure of HIV-1 gp41 was downloaded from Protein Data Bank (1AIK). The sequence of spike protein was downloaded from GenBank (NP_828851). Liu et al. [1] found that the region 641–1247 of SARS-CoV S protein matches to conserved coronavirus S2 domain PF01601 in HMM database, which is subsequently used in the fold prediction of S2 subunit by 3D Jury meta predictor [6]. The proteins with significant high 3D score were used as templates to construct 3D models of S2 by modeller program [7]. The quality of 3D model was evaluated by proq program [8] and ‘correct’ models were chosen for structure comparison with gp41 by LGA program [9]. The visualization of 3D structure was generated by PROTEIN EXPLORER (http://www.proteinexplorer.org).

3. Results and discussion

Meta-predictor (3D Jury) revealed that there are three significant hits (3D score >50) for S2 subunit of SARS-CoV: 1dg3_A (Human guanylate binding Protein (signaling protein), 3D score 117, threading server ShotGun on 3), 1f5n_A (Human guanylate binding Protein (signaling protein), 3D score 95, threading server mGenTHREADER) and 1ezf_A (Human squalene synthase (transferase), 3D score 82, threading server INBGU). Three templates were used to generate 3D model of S2. The quality evaluation of these models was conducted by proq program in which two measurements (ProQ-LG and ProQ-MX) were used. The corresponding scores are below: ProQ-LG=0.583 and ProQ-MX=0.026 for 1dg3_A, ProQ-LG=1.409 and ProQ-MX=0.034 for 1f5n_A, ProQ-LG=2.503 and ProQ-MX=0.106 for 1ezf_A. In view of the fact that cutoffs for ‘correct model’ are ProQ-LG>1.5 or ProQ-MX>0.1, so the ‘correct’ model for S2 is the model built on template 1ezf_A, although other templates obtained marginally higher 3D scores. The alignment between S2 and template 1ezf_A with ClustalW [10] and secondary structure of S2 predicted by PsiPred v2.3 [11] are displayed in Table 1 . Fig. 1 shows that the 3D model of S2 subunit is primarily consisted of α helices. In fact, Liu et al. [1] predicted the secondary structure of S protein by eight techniques and found that there are nine successive alpha-helices in S2. Based on the presence of hydrophilic and hydrophobic amino acids alternately, they speculated that these helices could be the amphipathic alpha-helices, which collapse into coiled-coils, bring a fusion peptide back toward the transmembrane domain and lead to the fusion of cellular and viral membranes [2].

Table 1

Alignment between SARS-CoV S2 protein and template 1EZF_A and secondary structure of S2

An external file that holds a picture, illustration, etc.
Object name is gr1.jpg

3D model of SARS-CoV S2.

The LGA method is designed to compare protein structures or fragments of protein structures in sequence dependent and sequence independent modes. We superimposed the protein structure of S2 into the core structure of gp41 (1aik) by LGA. Surprisingly we found that the N chain and C chain of gp41 are well overlapped with two helices in the 3D model of S2 subunit (Fig. 2). For SARS-CoV S2 subunit, the corresponding residues include P879-N942, for HIV-1 gp41, the corresponding residues cover S546-G572 (N chain) and W628-L661 (C chain), each of residue pairs has a distance <5 Å (Table 2).

An external file that holds a picture, illustration, etc.
Object name is gr2.jpg

Comparison between gp41 from HIV-1 and SARS-CoV S2. N and C chains of gp41 are colored blue and yellow, respectively. S2 is colored white.

Table 2

Structural similarity between SARS-CoV S2 and HIV-1 gp41 proteins

SARS-CoV S2No. of res.HIV-1 gp41No. of res.RMSDSARS-CoV S2No. of res.HIV-1 gp41No. of res.RMSD
P879S546_N1.973Q908W628_C2.699
F880G547_N2.433F909E630_C3.28
A881I548_N1.687N910W631_C4.096
M882V549_N1.176K911D632_C3.314
Q883Q550_N2.052A912R633_C4.069
M884Q551_N2.094I913E634_C4.296
A885Q552_N1.078S914I635_C3.357
Y886N553_N1.738Q915N636_C3.839
R887N554_N2.282I916
F888L555_N1.796Q917N637_C2.111
N889L556_N1.398E918Y638_C3.013
G890R557_N2.154S919T639_C2.177
I891A558_N1.984L920S640_C1.469
G892I559_N1.911T921L641_C2.303
V893E560_N1.962T922I642_C2.404
T894A561_N1.998T923H643_C1.588
Q895Q562_N2.015S924S644_C2.077
N896Q563_N2.436T925L645_C3.591
V897H564_N3.129A926I646_C3.826
L898L565_N2.223L927E647_C2.728
Y899L566_N1.514G928E648_C2.996
E900Q567_N2.452K929S649_C3.953
N901L568_N4.328L930Q650_C3.291
Q902T569_N3.414Q931N651_C2.464
K903V570_N2.291D932Q652_C2.637
Q904W571_N2.691V933Q653_C3.039
I905V934E654_C1.927
A906N935K655_C0.506
N907G572_N3.757Q936N656_C0.841
N937E657_C1.845
A938Q658_C1.872
Q939E659_C2.903
A940L660_C3.788
L941
N942L661_C4.77

It has been shown that that there are some similar structural motifs in HIV-1 gp41 and SARS-CoV S2 protein [4], [5]: (1) N-terminal leucine/isoleucine heptad repeat sequence on residues 913–1000; (2) C-terminal leucine/isoleucine heptad repeat motif on residues 1151–1185. While our results reveal that SARS-CoV S2 and HIV gp41 share very similar helix structure on residues 879–942, these discoveries suggest a similar membrane fusion mechanism for the two viruses.

Naturally, a question arises: could the inhibitors for anti-HIV-1 therapy be used to fight against SARS-CoV? For example, GGL, a HIV-1 specific cell entry inhibitor that can bind to the coiled-coil core of gp41 and efficiently inhibit HIV-1 envelope-mediated cell-cell fusion [12], and another D-peptide inhibitor, which targets the gp41 coiled-coil pocket and inhibits HIV-1 entry [13]. Fig. 3 showed the binding interaction between GGL and S2 protein, the residues involved are: 901–918. Fig. 4 showed the binding interaction between D-peptide and S2 protein, the residues involved are: 899–915. This suggests GGL and D-peptide inhibitors from HIV-1 gp41 could be used as potential inhibitors for SARS-CoV entry.

An external file that holds a picture, illustration, etc.
Object name is gr3.jpg

(A) Binding interaction between S2 and inhibitor GGL (represented by spacefill). (B) Chemical structure of GGL, its formula is: C5H9NO4.

An external file that holds a picture, illustration, etc.
Object name is gr4.jpg

(A) Binding interaction between S2 and D-peptide inhibitor of HIV-1 gp41 (represented by spacefill. (B) Chemical structure for two molecules of D-peptide DAL (5(C3H7NO2)) and DAR (2(C6H15N4O2)).

References

1. Liu S., Guo T., Ji X., Sun Z. Bioinformatical study on the proteomics and evolution of SARS-CoV. Chinese Science Bulletin. 2003;48:1277–1287. [PMC free article] [PubMed] [Google Scholar]
2. Gallagher T.M., Buchmeier M.J. Coronavirus spike proteins in viral entry and pathogenesis. Virology. 2001;279:371–374. [PMC free article] [PubMed] [Google Scholar]
3. Chan D.C., Fass D., Berger J.M., Kim P.S. Core structure of gp41 from the HIV envelope glycoprotein. Cell. 1997;89:263–273. [PubMed] [Google Scholar]
4. Gallaher W.R., Garry R.F. Model of the pre-insertion region of the spike (S2) fusion glycoprotein of the human SARS coronavirus: implications for antiviral therapeutics. Virology. 2003 http://www.virology.net/sars/s2model.html. [Google Scholar]
5. Kliger Y., Levanon E.Y. Cloaked similarity between HIV-1 and SARS-CoV suggests an anti-SARS strategy. BMC Microbiology. 2003;3:20–26. [PMC free article] [PubMed] [Google Scholar]
6. Ginalski K., Elofsson A., Fischer D., Rychlewski L. 3D-Jury: a simple approach to improve protein structure predictions. Bioinformatics. 2003;19:1015–1018. [PubMed] [Google Scholar]
7. Sali A., Blundell T.L. Comparative protein modeling by satisfaction of spatial restrains. Journal of Molecular Biology. 1993;234:779–815. [PubMed] [Google Scholar]
8. Wallner B., Elofsson A. Can correct protein models be identified? Protein Science. 2003;12:1073–1086. [PMC free article] [PubMed] [Google Scholar]
9. Zemla A. LGA: a method for finding 3D similarities in protein structures. Nucleic Acids Research. 2003;31:3370–3374. [PMC free article] [PubMed] [Google Scholar]
10. Thompson J.D., Higgins D.G., Gibson T.J. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice. Nucleic Acids Research. 1994;22:4673–4680. [PMC free article] [PubMed] [Google Scholar]
11. Jones D.T. Protein secondary structure prediction based on position-specific scoring matrices. Journal of Molecular Biology. 1999;292:195–202. [PubMed] [Google Scholar]
12. Zhou G., Ferrer M., Chopra R., Kapoor T.M., Strassmaier T., Weissenhorn W., Skehel J.J., Oprian D., Schreiber S.L., Harrison S.C., Wiley D.C. The structure of an HIV-1 specific cell entry inhibitor in complex with the HIV-1 gp41 trimeric core. Bioorganic and Medicinal Chemistry. 2000;8(9):2219–2227. [PubMed] [Google Scholar]
13. Eckert D.M., Malashkevich V.N., Hong L., Carr P.A., Kim P.S. Inhibiting HIV-1 entry: discovery of D-peptide inhibitors that target the gp41 coiled-coil pocket. Cell. 1999;99:103–115. [PubMed] [Google Scholar]

Articles from Theochem are provided here courtesy of Elsevier