• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of jvirolPermissionsJournals.ASM.orgJournalJV ArticleJournal InfoAuthorsReviewers
J Virol. Nov 2011; 85(21): 11538–11539.
PMCID: PMC3194942

Complete Genome Sequence of a Chinese Virulent Porcine Epidemic Diarrhea Virus Strain


CH/S is a virulent porcine epidemic diarrhea virus (PEDV) strain and is used as the virulent strain to evaluate the protection rates of vaccines against PEDV infection in China. Here, we report the complete genome sequence of strain CH/S, which may aid in understanding the molecular characteristics of this strain.


Porcine epidemic diarrhea virus (PEDV), a member of genus Alphacoronavirus (family Coronaviridae, order Nidovirales), is a single-stranded positive-sense RNA virus that causes an acute and highly contagious enteric disease characterized by severe enteritis, vomiting, and watery diarrhea in swine. Strain CH/S occurred in a swine breeding farm in 1986 in Shanghai and was confirmed as PEDV by electron microscopy, direct immunofluorescence testing, and serum neutralization testing (4). To date, the complete genome sequence of only Belgian strain CV777 has been reported (9). It is thus necessary to analyze the complete genome sequence of CH/S and understand its molecular characteristics.

The 5′ and 3′ ends of the genome of CH/S were confirmed by using a Smarter rapid amplification of cDNA ends (RACE) kit (Clontech, Japan), and the other parts were generated by 12 overlapping cDNA fragments to encompass the entire genome and determined by genome walking sequencing. The coronavirus genome is a single-stranded positive-sense RNA genome of 27 to 32 kb that is infectious (2, 10). The complete genome sequence of CH/S is 28,026 nucleotides (nt) in length [excluding the poly(A) tail]. The 5′ untranslated region (UTR) existing on the extreme 5′ end of the genome has 292 nt and contains a positioned short AUG-initiated open reading frame (ORF), which consists of 39 nt (nt 95 to 133) and encodes a peptide of 12 amino acids. Such small ORFs are present in all coronaviruses sequenced so far (1, 3, 5, 8, 11, 12, 13, 15, 16, 17). The 3′ UTR existing on the extreme 3′ end of the genome is 334 nt in length and possesses an octameric sequence, GGAAGAGC, beginning at base 73 upstream from the poly(A) tail. The genome of CH/S contains six genes, the replicase (Rep), the spike (S), ORF3, envelope (E), membrane (M), and nucleoprotein (N) genes, arranged in the order 5′-Rep-S-ORF3-E-M-N-3′. Two long ORFs (ORF1a and ORF1b) are of 12,351 nt (nt 293 to 12643) and 8,037 nt (nt 12598 to 20634) in length and overlap by 46 nt. At the overlapping region, there is a specific seven-nucleotide “slippery” sequence (UUUAAAC) and a pseudoknot structure (ribosomal frameshifting signal), which are required for the translation of ORF1b (7).

Sequences at the 5′ end of each gene represent signals for the transcription of subgenomic mRNAs of coronavirus (10, 14). These sequences, known as transcription-regulating sequences (TRSs), include a stretch of a highly conserved sequence designated the core sequence (CS), located at sites immediately upstream of most of the genes (7). The CSs in the genome of CH/S are the hexameric motifs 5′-XUA(A/G)AC-3′. The sequence CUAAAC found at the starting sites of gene 1 and the N gene, the GUAAAC sequence found at the starting site of the S gene, the CUAGAC sequence found at the starting sites of gene 3 and the E gene, and the AUAAAC sequence found at the starting site of the M gene are identical to those found in the same sites in the CV777 strain (6).

It is hoped that these data will not only facilitate future investigations of the molecular pathogenesis of PEDV but also contribute to the elucidation of the structure and function of the coronavirus gene-derived products.

Nucleotide sequence accession number.

The complete genome sequence of PEDV strain CH/S was deposited in GenBank under accession no. JN547228.


This work was supported by grants from the National Key Laboratory of Veterinary Biotechnology (SKLVBP201028), National Natural Science Foundation of China (30901081), Basic Scientific Research Fund of Central Research Academies and Institutes (ZGKJ201012), and Technology Research and Development Program of Harbin (2009AA6AN029).


1. Boursnell M. E. G., et al. 1987. Completion of the sequence of the genome of the coronavirus avian infectious bronchitis virus. J. Gen. Virol. 68:57–77 [PubMed]
2. Brian D. A., Baric R. S. 2005. Coronavirus genome structure and replication. Curr. Top. Microbiol. Immunol. 287:1–30 [PubMed]
3. Chouljenko V. N., Lin X. Q., Storz J., Kousoulas K. G., Gorbalenya A. E. 2001. Comparison of genomic and predicted amino acid sequences of respiratory and enteric bovine coronaviruses isolated from the same animal with fatal shipping pneumonia. J. Gen. Virol. 82:2927–2933 [PubMed]
4. Cui X., et al. 1990. Studies on the detection of porcine epidemic diarrhea virus by immunofluorescent techniques. Chin. J. Prev. Vet. Med. 5:20–24(In Chinese).
5. Das Sarma J., Fu L., Hingley S. T., Lavi E. 2001. Mouse hepatitis virus type-2 infection in mice: an experimentalmodel system of acute meningitis and hepatitis. Exp. Mol. Pathol. 71:1–12 [PubMed]
6. Duarte M., Gelfi J., Lambert P., Rasschaert D., Laude H. 1993. Genome organization of porcine epidemic diarrhoea virus. Adv. Exp. Med. Biol. 342:55–60 [PubMed]
7. Enjuanes L., et al. 2001. Coronavirus derived expression systems. J. Biotechnol. 88:183–204 [PubMed]
8. Herold J., Raabe T., Schelle-Prinz B., Siddell S. G. 1993. Nucleotide sequence of the human coronavirus 229E RNA polymerase locus. Virology 195:680–691 [PubMed]
9. Kocherhans R., Bridgen A., Ackermann M., Tobler K. 2001. Completion of the porcine epidemic diarrhoea coronavirus (PEDV) genome sequence. Virus Genes 23:137–144 [PubMed]
10. Lai M. M. C., Cavanagh D. 1997. The molecular biology of coronaviruses. Adv. Virus. Res. 48:1–100 [PubMed]
11. Leparc-Goffart I., et al. 1997. Altered pathogenesis of a mutant of the murine coronavirus MHV-A59 is associated with a Q159L amino acid substitution in the spike protein. Virology 239:1–10 [PubMed]
12. Penzes Z., et al. 2001. Complete genome sequence of transmissible gastroenteritis coronavirus PUR46-MAD clone and evolution of the Purdue virus cluster. Virus Genes 23:105–118 [PubMed]
13. Rota P. A., et al. 2003. Characterization of a novel coronavirus associated with severe acute respiratory syndrome. Science 300:1394–1399 [PubMed]
14. Sawicki S. G., Sawicki D. L. 1998. A new model for coronavirus transcription. Adv. Exp. Med. Biol. 440:215–218 [PubMed]
15. Thiel V., Herold J., Schelle B., Siddell S. G. 2001. Infectious RNA transcribed in vitro from a cDNA copy of the human coronavirus genome cloned in vaccinia virus. J. Gen. Virol. 82:1273–1281 [PubMed]
16. Woo P. C., et al. 2005. Characterization and complete genome sequence of a novel coronavirus, coronavirus HKU1, from patients with pneumonia. J. Virol. 79:884–895 [PMC free article] [PubMed]
17. Yoo D., Pei Y. 2001. Full-length genomic sequence of bovine coronavirus (31 kb): completion of the open reading frame 1a/1b sequences. Adv. Exp. Med. Biol. 494:73–76 [PubMed]

Articles from Journal of Virology are provided here courtesy of American Society for Microbiology (ASM)
PubReader format: click here to try


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...