Sequence comparison of the N genes of five strains of the coronavirus mouse hepatitis virus suggests a three domain structure for the nucleocapsid protein

Virology. 1990 Nov;179(1):463-8. doi: 10.1016/0042-6822(90)90316-j.

Abstract

To obtain information about the structure and evolution of the nucleocapsid (N) protein of the coronavirus mouse hepatitis virus (MHV), we determined the entire nucleotide sequences of the N genes of MHV-A59, MHV-3, MHV-S, and MHV-1 from cDNA clones. At the nucleotide level, the N gene sequences of these viral strains, and that of MHV-JHM, were more than 92% conserved overall. Even higher nucleotide sequence identity was found in the 3' untranslated regions (3' UTRs) of the five strains, which may reflect the role of the 3' UTR in negative-strand RNA synthesis. All five N genes were found to encode markedly basic proteins of 454 or 455 residues having at least 94% sequence identity in pairwise comparisons. However, amino acid sequence divergences were found to be clustered in two short segments of N, putative spacer regions that, together, constituted only 11% of the molecule. Thus, the data suggest that the MHV N protein is composed of three highly conserved structural domains connected to each other by regions that have much less constraint on their amino acid sequences. The first two conserved domains contain most of the excess of basic amino acid residues; by contrast, the carboxy-terminal domain is acidic. Finally, we noted that four of the five N genes contain an internal open reading frame that potentially encodes a protein of 207 amino acids having a large proportion of basic and hydrophobic residues.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Biological Evolution
  • Capsid / genetics*
  • Cloning, Molecular
  • Genes, Viral*
  • Molecular Sequence Data
  • Murine hepatitis virus / genetics*
  • Sequence Homology, Nucleic Acid
  • Species Specificity
  • Viral Core Proteins / genetics*

Substances

  • Viral Core Proteins

Associated data

  • GENBANK/M35253
  • GENBANK/M35254
  • GENBANK/M35255
  • GENBANK/M35256