![]() | ![]() |
Formats:
|
||||||||||
Copyright © 2008 The authors. Evidence for a Complex Mosaic Genome Pattern in a Full-length Hepatitis C Virus Sequence 1 Institute of Virology, National Reference Centre for HCV, Essen University Hospital, University of Duisburg-Essen, Essen, Germany 2 Laboratory of Clinical Virology, Department of Microbiology and Immunology, Rega Institute for Medical Research, University of Leuven, Leuven, Belgium Correspondence: R.S. Ross, Institute of Virology, National Reference Centre for HCV, Essen University Hospital, University of Duisburg-Essen, Essen, Germany. Tel: +49 201 7233561; Fax: +49 201 7235039; Email: stefan.ross/at/uni-due.de 3Both authors contributed equally to this study. This article is published under the Creative Commons Attribution By licence. For further information go to: http://creativecommons.org/licenses/by/3.0. Abstract The genome of the hepatitis C virus (HCV) exhibits a high genetic variability. This remarkable heterogeneity is mainly attributed to the gradual accumulation of mutational changes, whereas the contribution of recombination events to the evolution of HCV remains controversial so far. While performing phylogenetic analyses including a large number of sequences deposited in the GenBank, we encountered a full-length HCV sequence (AY651061) that showed evidence for inter-subtype recombination and was, therefore, subjected to a detailed analysis of its molecular structure. The obtained results indicated that AY651061 does not represent a “simple” HCV 1c isolate, but a complex 1a/1c mosaic genome, showing five putative breakpoints in the core to NS3 regions. To our knowledge, this is the first report on a mosaic HCV full-length sequence with multiple breakpoints. The molecular structure of AY651061 is reminiscent of complex homologous recombinant variants occurring among other members of the flaviviridae family, e.g. GB virus C, dengue virus, and Japanese encephalitis virus. Our finding of a mosaic HCV sequence may have important implications for many fields of current HCV research which merit careful consideration. Keywords: flaviviruses, hepatitis C virus, sequence AY651061, recombination, recombination analysis software, phylogeny programs Introduction The hepatitis C virus (HCV) is a single-stranded RNA pathogen belonging to the genus Flavivirus in the flaviviridae family (Garnier et al. 2002). Six major HCV genotypes and almost 80 confirmed or at least provisionally assigned subtypes have been identified (Simmonds et al. 2005), generally showing a distinct geographic distribution (Lavanchy and MacMahon, 2000; Weck, 2005). HCV exhibits a high genetic variability. This remarkable heterogeneity is mainly attributed to the gradual accumulation of mutational changes, primarily due to the error-prone nature of the RNA-dependent RNA-polymerase (Simmonds, 2004; Simmonds et al. 2005). The contribution of recombination events to the evolution of HCV, however, remains controversial. Researchers paid considerable attention to the identification and characterisation of possible HCV recombinants after a first mosaic HCV genome had been described in 2002 (Kalinina et al. 2002). Consequently, during the last six years the occurrence of additional inter-genotypic (Kageyama et al. 2006; Noppornpanth et al. 2006; Legrand-Abravanel et al. 2007), inter-subtype (Colina et al. 2004), or inter-quasispecies (Moreno et al. 2006) recombinant variants with yet unknown replicative and clinical potentials were reported worldwide. In chimpanzees, inoculated simultaneously with HCV subtypes 1a, 1b, 2a, and 3a, recombination between the different genomes was also noted (Gao et al. 2007). Although multiple recombination events were reported for other members of the flaviviridae family (Twiddy and Holmes, 2003), we are not aware of any such observations in HCV. In this communication, we therefore present for the first time evidence for numerous inter-subtype breakpoints in an HCV full-length sequence. Materials and Methods During a survey of approximately 1,200 partial HCV core sequences retrieved from GenBank (http://www.ncbi.nlm.nih.gov/Genbank/index.html. Last accessed August 6, 2008), we encountered several ambiguous HCV genotype and subtype assignments (Ross et al. 2008). Among these deviating HCV isolates was a full-length sequence that had been deposited in GenBank under the accession number AY651061. The details of the amplification and cloning procedures used to generate this particular sequence can be inferred from the recent patent US7348011B2 (available at: http://depatisnet.dpma.de/. Last accessed August 6, 2008). In brief, 11 overlapping nucleotide fragments covering the entire genome were amplified by RT-PCR from a serum sample of an Indian patient with chronic HCV infection. After purification of the amplicons from the gel, the fragments were inserted into a pET21 (+) vector. Subsequently, competent E. coli BL21 (DE3) cells were transformed with the obtained plasmid DNA and selected on a LB agar plate with antibiotic and ITPG/X-gal. Various clones expressing high levels of the inserted HCV fragments were chosen for inoculation of LB medium. Plasmid DNA was prepared by an alkaline lysis method. All expanded clones were digested to excise the respective HCV fragments, which were subjected to confirmatory sequencing. The multiple sequences thus obtained for the different regions of the HCV genome were joined by Chromas and Chromas-pro software. Before submission to the GenBank, the entire genome was also cloned and sequenced. AY651061 was subsequently classified as a subtype 1c variant. Since our analyses based on nucleotides 461–676 (numbering according to Choo et al. 1991) of AY651061 consistently indicated a clustering with genome fragments from HCV subtype 1a and not 1c variants, this phylogenetic incongruence prompted us to carry out a more detailed investigation of this specific strain. First, we performed maximum likelihood (ML) analyses of the core, E1, E2, p7, NS2 and NS3 regions of AY651061 and of HCV reference strains from GenBank by using Paup* v. 4.0 (Swafford, 2008). For each genomic region, the evolutionary model was selected by Modeltest 3.7 (available at: http://darwin.uvigo.es/software/modeltest.html. Last accessed August 6, 2008) (Posada and Crandall, 1998). Phylogenies were estimated by an extensive ML approach with nucleotide substitution models and rate heterogeneity parameters (proportion of invariable sites and alpha shape of the discredited gamma distribution) as determined by the program. Bootstrap analysis (5,000 replicates) was performed using the neighbour-joining (NJ) method. Next, putative recombination events and corresponding breakpoints were identified with SimPlot, v. 3.5.1 (available at: http://sray.med.som.jhmi.edu/SCRoftware/simplot/. Last accessed August 6, 2008) (Lole et al. 1999). The window width and the step size were set at 400 bp and 20 bp respectively. Bootscanning was performed with AY651061 as a query sequence. Finally, ML trees of the genome fragments between the identified breakpoints were reconstructed by using Paup*, v. 4.0. Also for these tree reconstructions, the exhaustive ML analysis was conducted with the model settings as selected by Modeltest 3.7, and NJ bootstrap analysis (5,000 replicates) was performed. Results Our initial observation on a 216 bp core fragment of AY651061 was corroborated by further phylogenetic analyses of the E1 to NS3 regions of this particular viral strain. As shown in Figure 1
Discussion To our knowledge, this is the first report on a putative HCV full-length sequence with multiple breakpoints. The interpretation of the findings reported in this communication, like the conclusions drawn in most comparable studies on other members of the family flaviviridae (Holmes et al. 1999; Worobey et al. 1999; Worobey and Holmes, 2001; Twiddy and Holmes, 2003), was evidently based on scrutinising information deposited in the GenBank database by the use of biomathematical tools. The details on the strategy of genome sequencing and cloning of AY651061 available from patent US7348011B2 on the one hand show that the size and location of the individual PCR fragments utilised to generate the full-length sequence do not correspond to the recombination breakpoints identified by our investigation. This observation, in conjunction with the fact that several clones of both the subgenomic fragments and the entire genome were analysed by R.V. Guntaka and co-workers, strongly argue against the consideration that the mosaic structure of AY651061 was simply the result of sequencing errors involving a contaminated or multiple infected sample (Meyerhans et al. 1990; Odelberg et al. 1995). On the other hand, we could not entirely exclude this possibility since we did not have direct access to the original material containing the Khajal HCV isolate, therefore preventing us from further molecular analyses like the use of HCV 1a and 1c subtype-specific oligonucleotide primers spanning the identified recombination breakpoints. The AY651061 sequence has already been described in a report on HCV recombination published by Cristina and Colina (2006). These authors, however, included AY651061 as a non-recombinant reference sequence in their SimPlot analyses and, therefore, their impression that the query sequence D10749 is a 1a/1c recombinant form with breakpoints in the E1/E2 regions now has to be put under question. A similar classification artefact due to the inclusion of recombinants as reference sequence has been identified for HIV-1 (Abecasis et al. 2007). Interestingly, the putative breakpoints that we identified in AY651061 were almost evenly distributed over the first 4,000 nucleotides of the genome, covering the core to NS3 regions. Thus, these recombination events were not located predominantly in the NS2/NS3 (Kalinina et al. 2002; Kageyama et al. 2006; Noppornpanth et al. 2006; Legrand-Abravanel et al. 2007) or the NS5 regions (Colina et al. 2004; Moreno et al. 2006) that had been described previously as the most likely sites for the occurrence of HCV recombination events. The molecular structure of AY651061, as revealed by our study, is highly reminiscent of findings in full-length sequences of another member of the genus Flavivirus, i.e. GB virus C (GBV-C), a pathogen closely related to HCV (Simons et al. 1995). In GBV-C, numerous homologous recombinations were detected, leading to the formation of genomes with a rather complex mosaic composition (Twiddy and Holmes, 2003). Worobey and Holmes (2001), for instance, reported on three such GBV-C sequences, one of which showed signs of no less then nine apparent recombination events involving genetic material from at least four different sources and three GBV-C subtypes. Besides the observations in GBV-C sequences, multiple recombinations were also detected in a number of mosquito-borne flaviviruses, among them different serotypes of dengue virus (Holmes et al. 1999; Worobey et al. 1999; Tolou et al. 2001; Twiddy and Holmes, 2003), Japanese encephalitis virus (Twiddy and Holmes, 2003; Gould et al. 2004), and St. Louis encephalitis virus (Twiddy and Holmes, 2003). Our first observation on a complex mosaic genome pattern in HCV, has to be confirmed and extended by further reports. In the light of the ever increasing amount of HCV sequences available in publically accessible databases, the growing awareness of the possibility of HCV recombination, and the advent of more powerful biomathematical tools that facilitate the detection and characterisation of multiple recombination events (Worobey and Holmes, 1999), we are confident that more sequences will eventually be added to the list of complex mosaic HCV genomes. However, they will probably remain a limited fraction of all available HCV genome sequences. Undoubtedly, the recognition of additional multiple recombinant HCV forms will raise numerous new questions related to HCV research and may also lead to a reconsideration of the current concept of HCV genotyping which is essentially based on the intrinsic assumption that the genotype and subtype assignment inferred from one region also holds true for the genome as a whole (Nolte, 2001; Simmonds et al. 2005; Weck, 2005). Acknowledgements These investigations were supported in part by grants from the German Federal Ministry of Education and Science (HepNet projects 2.2 and 4.2) and the German Federal Ministry of Health (grant by the Robert Koch Institute to the German National Reference Centre for Hepatitis C). The authors are grateful to Frank-Michael Schmidt, EngD (Essen, Germany), for his help in getting access to patent US7348011B2. Footnotes Conflict of Interest The authors declare that they do not have any conflict of interest. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||
Hepatology. 2005 Oct; 42(4):962-73.
[Hepatology. 2005]J Gen Virol. 2004 Nov; 85(Pt 11):3173-88.
[J Gen Virol. 2004]J Virol. 2002 Apr; 76(8):4034-43.
[J Virol. 2002]J Med Virol. 2006 Nov; 78(11):1423-8.
[J Med Virol. 2006]J Virol. 2006 Aug; 80(15):7569-77.
[J Virol. 2006]J Virol Methods. 2008 Mar; 148(1-2):25-33.
[J Virol Methods. 2008]Proc Natl Acad Sci U S A. 1991 Mar 15; 88(6):2451-5.
[Proc Natl Acad Sci U S A. 1991]Bioinformatics. 1998; 14(9):817-8.
[Bioinformatics. 1998]J Virol. 1999 Jan; 73(1):152-60.
[J Virol. 1999]Mol Biol Evol. 1999 Mar; 16(3):405-9.
[Mol Biol Evol. 1999]Proc Natl Acad Sci U S A. 1999 Jun 22; 96(13):7352-7.
[Proc Natl Acad Sci U S A. 1999]Mol Biol Evol. 2001 Feb; 18(2):254-61.
[Mol Biol Evol. 2001]J Gen Virol. 2003 Feb; 84(Pt 2):429-40.
[J Gen Virol. 2003]Nucleic Acids Res. 1990 Apr 11; 18(7):1687-91.
[Nucleic Acids Res. 1990]Virol J. 2006 Jun 30; 3():53.
[Virol J. 2006]J Virol. 2007 Aug; 81(16):8543-51.
[J Virol. 2007]J Virol. 2002 Apr; 76(8):4034-43.
[J Virol. 2002]J Med Virol. 2006 Nov; 78(11):1423-8.
[J Med Virol. 2006]J Virol. 2006 Aug; 80(15):7569-77.
[J Virol. 2006]Nat Med. 1995 Jun; 1(6):564-9.
[Nat Med. 1995]J Gen Virol. 2003 Feb; 84(Pt 2):429-40.
[J Gen Virol. 2003]Mol Biol Evol. 2001 Feb; 18(2):254-61.
[Mol Biol Evol. 2001]Mol Biol Evol. 1999 Mar; 16(3):405-9.
[Mol Biol Evol. 1999]Proc Natl Acad Sci U S A. 1999 Jun 22; 96(13):7352-7.
[Proc Natl Acad Sci U S A. 1999]J Gen Virol. 1999 Oct; 80 ( Pt 10)():2535-43.
[J Gen Virol. 1999]Hepatology. 2005 Oct; 42(4):962-73.
[Hepatology. 2005]