5I08,6NZK,6NZK,6NZK,3JCL,5I08,5I08


Conserved Protein Domain Family
HKU1-CoV-like_Spike_SD1-2_S1-S2_S2

?
cd22380: HKU1-CoV-like_Spike_SD1-2_S1-S2_S2 
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from human HKU1 and OC43 coronaviruses and related betacoronaviruses in the A lineage
This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses in the embecovirus subgenus (A lineage), including highly pathogenic human coronaviruses (CoVs), HKU1 and OC43 CoVs, as well as murine hepatitis virus (MHV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of MHV is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.
Statistics
?
PSSM-Id: 411967
Aligned: 50 rows
Threshold Bit Score: 1231.18
Created: 20-May-2020
Updated: 25-Oct-2021
Structure
?
Program:
Drawing:
Aligned Rows:
  next features
Conserved site includes 13 residues -Click on image for an interactive view with Cn3D
Feature 1:N-linked glycosylation sites [structural motif]
Evidence:
  • Comment:betacoronavirus Spike proteins typically contain 22-30 N-linked glycosylation sites per protomer, depending on the species
  • Structure:6NZK: Human OC43-CoV spike (S) protein may have a total of 20 N-linked glycan sites per protomer, which likely play a role in protein folding and immune evasion, with each trimer displaying 60 N-linked glycosylation sites.
    View structure with Cn3D

Sequence Alignment
?
Format: Row Display: Color Bits: Type Selection:
Feature 1                                              #                   #                  #
5I08_A     611 LYGITGQGIFKEVSAAYYNNWQNLLYDsNGNIIGFKDFLTNKTYTILPCYSGRVSAAFYQNsssPALLYRNLKCSYVLNN 690  Human coronaviru...
6NZK_A     636 LYGILGQGIFVEVNATYYNSWQNLLYDsNGNLYGFRDYITNRTFMIRSCYSGRVSAAFHANssePALLFRNIKCNYVFNN 715  Human coronaviru...
5I08_B     611 LYGITGQGIFKEVSAAYYNNWQNLLYDsNGNIIGFKDFLTNKTYTILPCYSGRVSAAFYQNsssPALLYRNLKCSYVLNN 690  Human coronaviru...
5I08_C     611 LYGITGQGIFKEVSAAYYNNWQNLLYDsNGNIIGFKDFLTNKTYTILPCYSGRVSAAFYQNsssPALLYRNLKCSYVLNN 690  Human coronaviru...
Q5MQD0     626 LYGITGQGIFKEVSAVYYNSWQNLLYDsNGNIIGFKDFVTNKTYNIFPCYAGRVSAAFHQNassLALLYRNLKCSYVLNN 705  Human coronaviru...
Q14EB0     624 LYGITGQGIFKEVSAAYYNNWQNLLYDsNGNIIGFKDFLTNKTYTILPCYSGRVSAAFYQNsssPALLYRNLKCSYVLNN 703  Human coronaviru...
AYR18616   617 LYGITGRGIFKEVSADYYNSWQNLLYDvNGNLYGFKDYQTNKTYTIRPCYSGRVSAALHQEapePALLYRNLKCNYVYNN 696  Betacoronavirus sp.
AID16649   621 LYGITGRGIFKEVSADYYNSWQNLLYDvNGNLYGFKDYQTNKTYTIRPCYSGRVSAALHQEapePVLLYRNLKCSYVYNN 700  Longquan Rl rat ...
ABS87264   626 LFGITGQGVFKEVKADYYHSWQNLLYDvNGNLEGFRDIITNKTYTIRSCYSGRVSAAYHQDapePALLYRNLKCDYVFNN 705  Murine hepatitis...
AAF19386   625 LYGITGQGIFKEVKADYYHSWQNLLYDvNGNLIGFRDFVANKSYTIRSCYSGRVSAAYHQDapePALLYRNLKCDYVFNN 704  Murine hepatitis...
Feature 1                             #                                                     #  
5I08_A     691 Isfis--qpfYFDSYLGCVLNAVNlTSYSVSSCDLRMGSGFCIDYa---lPSSGGSGSGISSPYRFVTFEPFNVsfVNDS 765  Human coronaviru...
6NZK_A     716 SltrqlqpinYFDSYLGCVVNAYNsTAISVQTCDLTVGSGYCVDY-----SKNGGSGGAITTGYRFTNFEPFTVnsVNDS 790  Human coronaviru...
5I08_B     691 Isfis--qpfYFDSYLGCVLNAVNlTSYSVSSCDLRMGSGFCIDYa---lPSSGGSGSGISSPYRFVTFEPFNVsfVNDS 765  Human coronaviru...
5I08_C     691 Isfis--qpfYFDSYLGCVLNAVNlTSYSVSSCDLRMGSGFCIDYa---lPSSGGSGSGISSPYRFVTFEPFNVsfVNDS 765  Human coronaviru...
Q5MQD0     706 Islt---tqpYFDSYLGCVFNADNlTDYSVSSCALRMGSGFCVDYnspssSSSRRKRRSISASYRFVTFEPFNVsfVNDS 782  Human coronaviru...
Q14EB0     704 Isfis--qpfYFDSYLGCVLNAVNlTSYSVSSCDLRMGSGFCIDYa---lPSSRRKRRGISSPYRFVTFEPFNVsfVNDS 778  Human coronaviru...
AYR18616   697 SisreaqplkYFDSYLGCVVNADNyTDDSVSVCDLRMGGGFCVDY-----SIAHRGRRALSTGYRFTSFEPYNVsiVNDS 771  Betacoronavirus sp.
AID16649   701 SisreaqplkYFDSYLGCVVNADNyTDDSVHTCDLRMGSGFCVDY-----STARRKRRDLSTGYRFTTFEPYNVsvVNDS 775  Longquan Rl rat ...
ABS87264   706 NisreetplnYFDSYLGCVVNADNsTEEAVAVCDLRMGSGLCVNY-----STSHRARRSISTGYKLTTFEPFTVsiVNDS 780  Murine hepatitis...
AAF19386   705 NisreetplnYFDSYLGCVVNADNsTEEAVDACDLRMGSGLCVNY-----STSHRARSSVSTGYKLTTFEPFTVriVNDS 779  Murine hepatitis...
Feature 1                    #                                                                 
5I08_A     766 VETVGGLFEIQIPTNFTIAGHEEFIQTSSPKVTIDCSAFVCSNYAACHDLLSEYGTFCDNINSILNeVNDLLDITQLQVA 845  Human coronaviru...
6NZK_A     791 LEPVGGLYEIQIPSEFTIGNMVEFIQTSSPKVTIDCAAFVCGDYAACKSQLVEYGSFCDNINAILTeVNELLDTTQLQVA 870  Human coronaviru...
5I08_B     766 VETVGGLFEIQIPTNFTIAGHEEFIQTSSPKVTIDCSAFVCSNYAACHDLLSEYGTFCDNINSILNeVNDLLDITQLQVA 845  Human coronaviru...
5I08_C     766 VETVGGLFEIQIPTNFTIAGHEEFIQTSSPKVTIDCSAFVCSNYAACHDLLSEYGTFCDNINSILNeVNDLLDITQLQVA 845  Human coronaviru...
Q5MQD0     783 IESVGGLYEIKIPTNFTIVGQEEFIQTNSPKVTIDCSLFVCSNYAACHDLLSEYGTFCDNINSILDeVNGLLDTTQLHVA 862  Human coronaviru...
Q14EB0     779 VETVGGLFEIQIPTNFTIAGHEEFIQTSSPKVTIDCSAFVCSNYAACHDLLSEYGTFCDNINSILNeVNDLLDITQLQVA 858  Human coronaviru...
AYR18616   772 VESVGGLYEIQIPINFTIGSHEEFIQTTSPKVTIDCAAFVCSDYAACRQQLVEYGTFCDNINTILSeVNGLLDNTQLQVA 851  Betacoronavirus sp.
AID16649   776 VEAVGGLYEIQIPINFTIGSHEEFIQTSSPKVTIDCAAFVCSDYAACRQQLVEYGTFCDNINTILSeVNGLLDNTQLQVA 855  Longquan Rl rat ...
ABS87264   781 VQSVGGLYEMQIPINFTIGQHQEFIQTRAPKVTIDCAAFVCGDYTACRQQLVEYGSFCDNINAILGeVNNLIDTMQLQVA 860  Murine hepatitis...
AAF19386   780 VESVDGLYELQIPTNFTIASHQEFVQTRSPKVTIDCAAFVCGGHTACRQQLVEYGSFCDNINAILGeVNNLIDTMQLQVA 859  Murine hepatitis...
Feature 1                                                                             #        
5I08_A     846 NALMqGVTLSSNLNTNLHSDVDNIDFKSLLGCLGsqcg------sssRSLLEDLLFNKVKLSDVGFVEAynNCTGGseir 919  Human coronaviru...
6NZK_A     871 NSLMnGVTLSTKLKDGVNFNVDDINFSPVLGCLGsecs-----kassRSAIEDLLFDKVKLSDVGFVEAynNCTGGaeir 945  Human coronaviru...
5I08_B     846 NALMqGVTLSSNLNTNLHSDVDNIDFKSLLGCLGsqcg------sssRSLLEDLLFNKVKLSDVGFVEAynNCTGGseir 919  Human coronaviru...
5I08_C     846 NALMqGVTLSSNLNTNLHSDVDNIDFKSLLGCLGsqcg------sssRSLLEDLLFNKVKLSDVGFVEAynNCTGGseir 919  Human coronaviru...
Q5MQD0     863 DTLMqGVTLSSNLNTNLHFDVDNINFKSLVGCLGphcg------sssRSFFEDLLFDKVKLSDVGFVEAynNCTGGseir 936  Human coronaviru...
Q14EB0     859 NALMqGVTLSSNLNTNLHSDVDNIDFKSLLGCLGsqcg------sssRSLLEDLLFNKVKLSDVGFVEAynNCTGGseir 932  Human coronaviru...
AYR18616   852 NSLMqGVTLSSRLKSGIDLDVDDINFDPVMGCLGsscg------psyRSTIEDLLFNKVKIADVGFVEAynNCTGGnelr 925  Betacoronavirus sp.
AID16649   856 NSLMqGVTLSSTLKSGIDLDVDDINFNPVMGCLGsscg------ssyRSTIEDLLFNKVKIADVGFVEAynNCTGGnelr 929  Longquan Rl rat ...
ABS87264   861 SALIqGVTLSSRLADGIGGQIDDINFSPLLGCLGsdcgegttaalkgRSVIEDMLFDKVKLSDVGFVEAynNCTGGqevr 940  Murine hepatitis...
AAF19386   860 SALIqGVTLSSRLSDGIGGQIDDINFSPLLGCLGsdcgevtmaaqtgRSAIEDVLFDKVKLSDVGFVEAynNCTGGqevr 939  Murine hepatitis...
Feature 1                                                                                      
5I08_A     920 DLLCVQSFNGIKVLPPILSETQISGYTTAATVAAMFpPWSAAAGVPFSLNVQYRINGLGVTMDVLNKNQKLIANAFNKAL 999  Human coronaviru...
6NZK_A     946 DLICVQSYKGIKVLPPLLSENQFSGYTLAATSASLFpPWTAAAGVPFYLNVQYRINGLGVTMDVLSQNQKLIANAFNNAL 1025 Human coronaviru...
5I08_B     920 DLLCVQSFNGIKVLPPILSETQISGYTTAATVAAMFpPWSAAAGVPFSLNVQYRINGLGVTMDVLNKNQKLIANAFNKAL 999  Human coronaviru...
5I08_C     920 DLLCVQSFNGIKVLPPILSETQISGYTTAATVAAMFpPWSAAAGVPFSLNVQYRINGLGVTMDVLNKNQKLIANAFNKAL 999  Human coronaviru...
Q5MQD0     937 DLLCVQSFNGIKVLPPILSESQISGYTTAATVAAMFpPWSAAAGIPFSLNVQYRINGLGVTMDVLNKNQKLIATAFNNAL 1016 Human coronaviru...
Q14EB0     933 DLLCVQSFNGIKVLPPILSETQISGYTTAATVAAMFpPWSAAAGVPFSLNVQYRINGLGVTMDVLNKNQKLIANAFNKAL 1012 Human coronaviru...
AYR18616   926 DLICVQSFNGIKVLPPVLSESQISGYTTAATAASIFpPWSAAAGVPFSLSVQYRINGLGVTMDVLSENQKLIANAFNNAL 1005 Betacoronavirus sp.
AID16649   930 DLICVQSFNGIKVLPPVLSESQISGYTTAATAASIFpPWSAAAGVPFSLSVQYRINGLGVTMDVLSENQKLIANAFNNAL 1009 Longquan Rl rat ...
ABS87264   941 DLLCVQSFNGIKVLPPVLSESQISGYTAGATASAMFpPWSAAAGVPFSLSVQYRINGLGVTMNVLSENQKMIASAFNNAI 1020 Murine hepatitis...
AAF19386   940 DLLCVQSFNGIKVLPPVLSENQISGYTAGATVSAMF-PWSAAAGVPFSLSVQYRINGLGVTMNVLSENQKMIASAFNNAI 1018 Murine hepatitis...
Feature 1                                                                                      
5I08_A    1000 LSIQNGFTATNSALAKIQSVVNANAQALNSLLQQLFNKFGAISSSLQEILSRLDNLEAQVQIDRLINGRLTALNAYVSQQ 1079 Human coronaviru...
6NZK_A    1026 YAIQEGFDATNSALVKIQAVVNANAEALNNLLQQLSNRFGAISASLQEILSRLDALEAEAQIDRLINGRLTALNAYVSQQ 1105 Human coronaviru...
5I08_B    1000 LSIQNGFTATNSALAKIQSVVNANAQALNSLLQQLFNKFGAISSSLQEILSRLDNLEAQVQIDRLINGRLTALNAYVSQQ 1079 Human coronaviru...
5I08_C    1000 LSIQNGFTATNSALAKIQSVVNANAQALNSLLQQLFNKFGAISSSLQEILSRLDNLEAQVQIDRLINGRLTALNAYVSQQ 1079 Human coronaviru...
Q5MQD0    1017 LSIQNGFSATNSALAKIQSVVNSNAQALNSLLQQLFNKFGAISSSLQEILSRLDALEAQVQIDRLINGRLTALNAYVSQQ 1096 Human coronaviru...
Q14EB0    1013 LSIQNGFTATNSALAKIQSVVNANAQALNSLLQQLFNKFGAISSSLQEILSRLDNLEAQVQIDRLINGRLTALNAYVSQQ 1092 Human coronaviru...
AYR18616  1006 GAIQKGFDATNSALAKIQNVVNANAEALNNLLLQLSNRFGAISASLQEILSRLDALEAQVQIDRLINGRLTALNAYVSQQ 1085 Betacoronavirus sp.
AID16649  1010 GAIQNGFDATNSALAKIQNVVNANAEALNNLLLQLSNRFGAISASLQEILSRLDALEAQVQIDRLINGRLTALNAYVSQQ 1089 Longquan Rl rat ...
ABS87264  1021 GAIQEGFDATNSALAKIQSVVNANAEALNNLLQQLSNRFGAISASLQEILSRLDALEAQAQIDRLINGRLTALNAYVSKQ 1100 Murine hepatitis...
AAF19386  1019 GAIQEGFAATNSALAKMQFVVNANAEALNNLLNQLSNRFGAISASLQEILSRLDALEAQAQIDRLINGRLTALNAYVSKQ 1098 Murine hepatitis...
Feature 1                                                                                      
5I08_A    1080 LSDITLIKAGASRAIEKVNECVKSQSPRINFCGNGnHILSLVQNAPYGLLFIHFSYKPTSFKTVLVSPGLCLSGDRgiap 1159 Human coronaviru...
6NZK_A    1106 LSDSTLVKFSAAQAMEKVNECVKSQSSRINFCGNGnHIISLVQNAPYGLYFIHFSYVPTKYVTARVSPGLCIAGDRgiap 1185 Human coronaviru...
5I08_B    1080 LSDITLIKAGASRAIEKVNECVKSQSPRINFCGNGnHILSLVQNAPYGLLFIHFSYKPTSFKTVLVSPGLCLSGDRgiap 1159 Human coronaviru...
5I08_C    1080 LSDITLIKAGASRAIEKVNECVKSQSPRINFCGNGnHILSLVQNAPYGLLFIHFSYKPTSFKTVLVSPGLCLSGDRgiap 1159 Human coronaviru...
Q5MQD0    1097 LSDISLVKFGAALAMEKVNECVKSQSPRINFCGNGnHILSLVQNAPYGLLFMHFSYKPISFKTVLVSPGLCISGDVgiap 1176 Human coronaviru...
Q14EB0    1093 LSDITLIKAGASRAIEKVNECVKSQSPRINFCGNGnHILSLVQNAPYGLLFIHFSYKPTSFKTVLVSPGLCLSGDRgiap 1172 Human coronaviru...
AYR18616  1086 LSDITLVKFSASQAIEKVNECVKSQSPRINFCGNGnHILSLVQNAPYGLYFIHFSYVPTAFKTAYVSPGLCISGNRglap 1165 Betacoronavirus sp.
AID16649  1090 LSDITLVKFSASQAIEKVNECVKSQSPRINFCGNGnHILSLVQNAPYGLYFIHFSYVPIAFKTAYVSPGLCIAGDRglap 1169 Longquan Rl rat ...
ABS87264  1101 LSDMTLVKVSAAQAIEKVNECVKSQSPRINFCGNGnHILSLVQSAPYGLYFIHFSYVPTSFTTVNVSPGLCISGDRglap 1180 Murine hepatitis...
AAF19386  1099 LSDMTLVKVSAAQAIEKVNECVKSQSSRINFCGNGnHILSLVQNAPYGLYFIHFSYVPTSFTTANVSPGLCISGDRglap 1178 Murine hepatitis...
Feature 1              #                             #         #                  #            
5I08_A    1160 kqgyFIKQNDSWMFTGSSYYYPEPISDKNvVFMNSCSVNFTKAPFiyLNNSIPNLSDFEAELSLWFKNHTSIAPNLTFn- 1238 Human coronaviru...
6NZK_A    1186 ksgyFVNVNNTWMYTGSGYYYPEPITENNvVVMSTCAVNYTKAPYvmLNTSIPNLPDFKEELDQWFKNQTSVAPDLSLd- 1264 Human coronaviru...
5I08_B    1160 kqgyFIKQNDSWMFTGSSYYYPEPISDKNvVFMNSCSVNFTKAPFiyLNNSIPNLSDFEAELSLWFKNHTSIAPNLTFn- 1238 Human coronaviru...
5I08_C    1160 kqgyFIKQNDSWMFTGSSYYYPEPISDKNvVFMNSCSVNFTKAPFiyLNNSIPNLSDFEAELSLWFKNHTSIAPNLTFn- 1238 Human coronaviru...
Q5MQD0    1177 kqgyFIKHNDHWMFTGSSYYYPEPISDKNvVFMNTCSVNFTKAPLvyLNHSVPKLSDFESELSHWFKNQTSIAPNLTLnl 1256 Human coronaviru...
Q14EB0    1173 kqgyFIKQNDSWMFTGSSYYYPEPISDKNvVFMNSCSVNFTKAPFiyLNNSIPNLSDFEAEFSLWFKNHTSIAPNLTFn- 1251 Human coronaviru...
AYR18616  1166 kagyFVQENNQWMFTGSSYYYPEPITDKNsVVMSSCSVNYTKAPDvfLNNSIPNLPDFKEELDQWFKNQTSVAPDLSLdl 1245 Betacoronavirus sp.
AID16649  1170 kagyFVKENGQWLFTGSSYYYPEPITDKNsVVMSTCAVNYTKAPDvfLNNSIPNLPDFKEELDQWFKNQTSIAPDLSLdl 1249 Longquan Rl rat ...
ABS87264  1181 kagyFVQDNGEWKFTGSGYYYPEPINDKNsVVMSSCAVNYTKAPEvfLNTSIPNLPDFKEELDKWFKNQTSIAPDLSLdf 1260 Murine hepatitis...
AAF19386  1179 kagyFVQDDGEWKFTGSNYYYPEPITDKNsVVMSSCAANYTKAPEvfLNTSIPNLPDFKEELDKWFKNQTSIAPDLSLdf 1258 Murine hepatitis...
Feature 1         #                     #         
5I08_A    1239 sHINATFLDLY-YEMNVIQESIKSLNGSGYIPEAP 1272 Human coronavirus HKU1 (isolate N5)
6NZK_A    1265 -YINVTFLDLLiKRMKQIEDKIEEIESKQKKIENE 1298 Human coronavirus OC43
5I08_B    1239 sHINATFLDLY-YEMNVIQESIKSLNGSGYIPEAP 1272 Human coronavirus HKU1 (isolate N5)
5I08_C    1239 sHINATFLDLY-YEMNVIQESIKSLNGSGYIPEAP 1272 Human coronavirus HKU1 (isolate N5)
Q5MQD0    1257 hTINATFLDLY-YEMNLIQESIKSLNNSYINLKDI 1290 Human coronavirus HKU1 (isolate N1)
Q14EB0    1252 sHINATFLDLY-YEMNVIQESIKSLNSSFINLKEI 1285 Human coronavirus HKU1 (isolate N2)
AYR18616  1246 eKINVTFLDLH-DEMNRIQEAIKQLNQSYINLKEI 1279 Betacoronavirus sp.
AID16649  1250 eKINVTFLDLH-DEMNRIQEAIKKLNESYINLKEI 1283 Longquan Rl rat coronavirus
ABS87264  1261 eKLNVTFLDLT-DEMNRIQESIKKLNESYINLKEV 1294 Murine hepatitis virus
AAF19386  1259 eKLNVTLLDLT-DEMNRIQDAIKKLNESYINLKDV 1292 Murine hepatitis virus strain 2

| Disclaimer | Privacy statement | Accessibility |
NCBI Home NCBI Search NCBI SiteMap