Genetic characterisation of the influenza viruses circulating in Bulgaria during the 2019–2020 winter season

Influenza viruses have a high potential for genetic changes. The objectives of this study were to analyse influenza virus circulation in Bulgaria during the 2019/2020 season, to perform a phylogenetic and molecular analyses of the haemagglutinin (HA) and neuraminidase (NA) sequences of representative influenza strains, and to identify amino acid substitutions compared to the current vaccine strains. Seasonal influenza viruses A(H3N2), A(H1N1)pdm09 and B/Victoria-lineage were detected using a real-time RT-PCR in 323 (23.3%), 149 (10.7%) and 138 (9.9%) out of 1387 patient samples studied, respectively. The HA genes of A(H3N2) viruses analysed belonged to clades 3C.3a (21 strains) and 3C.2a (5 strains): subclades 3C.2a1b + T131K, 3C.2a1b + T135K-B and 3C.2a1b + T135K-A. The clade 3C.3a and subclade 3C.2a1b viruses carried 5 and 14–17 substitutions in HA, as well as 3 and 9 substitutions in NA, respectively, in comparison with the A/Kansas/14/2017 vaccine virus, including some substitutions in the HA antigenic sites A, B, C and E. All 21 A(H1N1)pdm09 viruses sequenced fell into 6B.1A5A subclade. Amino acid sequence analysis revealed the presence of 7–11 substitutions in HA, compared to the A/Brisbane/02/2018 vaccine virus, three of which occurred in antigenic site Sb, along with 6–9 changes at positions in NA. All 10 B/Victoria-lineage viruses sequenced belonged to clade 1A with a triple deletion in HA1 (genetic group 1A(Δ3)B) and carried 7 and 3 substitutions in HA and NA, respectively, with respect to the B/Colorado/06/2017 vaccine virus. The results of this study confirm the rapid evolution of influenza viruses and the need for continuous antigenic and genetic surveillance. Supplementary Information The online version contains supplementary material available at 10.1007/s11262-021-01853-w.


Introduction
Influenza is one of the most common infectious illnesses in humans that affects annually 5-10% of the adult population and 20-30% of children causing a huge number of outpatient visits, hospitalisations and fatal cases, thus imposing a substantial public health burden [1]. Since 2009, two subtypes of influenza A viruses, A(H1N1)pdm09 and A(H3N2), and two genetic lineages of B viruses, B/Victoria and B/Yamagata, have been circulating during seasonal epidemics. There are temporal and geographical variations in their prevalence, clinical activity, as well as in the antigenic and genetic characteristics of the circulating strains.
Influenza viruses belong to the Orthomyxoviridae family and have a segmented, negative-sense, single-stranded RNA genome, each segment of which encodes at least one protein [2]. The surface glycoproteins haemagglutinin (HA) and neuraminidase (NA) form discrete spikes and play an important part in a viral life cycle. HA mediates the binding of the virus to the sialic acids as cellular receptors and the entry of the viral ribonucleoprotein into the cell cytoplasm by fusing of viral envelope and cell membrane. NA destroys sialic acid residues on the cell surface and provides the release of newly formed influenza viruses from the infected cells. HA is the primary target for virus-neutralising antibodies and together with NA is a key component of influenza vaccines.
Vaccination is currently the best way to reduce the influenza morbidity and mortality, but the effectiveness of influenza vaccines is often suboptimal due to the highly variable nature of influenza viruses and their ability to evade the existing humoral immunity. Understanding the molecular Edited by Hartmut Hengel. 1 3 mechanisms by which influenza viruses avoid the immunity induced by a prior natural infection or vaccination is crucial for the development of effective prevention and control strategies. Due to the absence of a proofreading capability of the viral RNA polymerase complex and the high replication rate of influenza viruses, point mutations occur constantly in the viral genome. Mutations which enable the virus to overcome host immune protection, are accumulated in HA and, to a lesser extent, in NA as a result of immune selection pressure, a process known as antigenic drift [3]. Mutant viruses with altered antigenic characteristics are capable of causing reinfections in immune individuals and repeated epidemics in the general population. It has been found that a limited number of amino acid substitutions at the antibodybinding sites around the receptor-binding pocket can result in escape recognition of neutralising antibodies. With the help of escape mutants, five epitopes of neutralising antibodies, known as antigenic sites, have been identified in the globular heads of the HAs of both A(H1N1) (Ca1, Ca2, Cb, Sa, Sb) and A(H3N2) (A-E) viruses [4,5]. Influenza type B viruses contain four major immunodominant epitopes in the HA1 subunit comprising the 120-loop, 150-loop, 160-loop, and 190-helix [6]. In addition to mutations, glycosylation of HA and NA (attachment of an oligosaccharide to the sequon N X S/T, where X may represent any amino acid except P) is another important mechanism used by viruses to evade the antibody-mediated immunity. Recent studies have suggested that the attached glycan chains shield antigenically significant regions reducing access of antibodies and protect the enzymatic sites of NA [7]. The frequent emergence of new genetic variants of influenza viruses that are rapidly replacing previously circulating strains, necessitates regular updates in the influenza vaccines composition in order to match epidemic strains.
Continuous monitoring of circulating influenza viruses is needed to identify new genetic variants of influenza viruses with the potential to avoid host immune defence, with increased virulence, transmissibility or reduced susceptibility to antiviral drugs. The main objectives of this study were to analyse the influenza virus circulation in Bulgaria during the 2019/2020 season and to determine the genetic and molecular characteristics of the detected viruses as compared to the current vaccine strains.

Study population and specimen collection
From October 2019 to March 2020, patients experiencing acute respiratory infection (ARI) from all 28 regions of the country were enrolled in the national influenza surveillance programme (Fig. 1). The diagnosis of each patient was determined by their attending physician based on standard clinical criteria. Nasopharyngeal specimens from the enrolled patients were collected using polyester collection swabs during the visit to the general practitioner or within the first 24 h of admission. The containers containing a nasopharyngeal swab placed into 2 ml of virus transport medium, were stored at 4 °C for up to 72 h and shipped to the National Laboratory "Influenza and ARI", which is recognised by the World Health Organization (WHO) as a National Influenza Centre. Specimens were processed immediately or stored at − 80 °C before testing.

Extraction of viral nucleic acids and real-time RT-PCR
Viral RNAs were extracted automatically from nasopharyngeal specimens using a commercial ExiPrep Dx Viral DNA/ RNA kit and ExiPrep 16DX equipment (BioNeer, Korea) in accordance with the manufacturer's instructions. Clinical specimens were screened for influenza viruses by a real-time RT-PCR method using a SuperScript III Platinum® One- Step qRT-PCR System (Invitrogen, USA). Primers, probes and positive controls were provided by the International Reagent Resource, USA. Amplification was performed with a CFX96 thermal cycler (Bio-Rad) in accordance with the protocol recommended by the Centers for Disease Control and Prevention (CDC), Atlanta, USA (reverse transcription at 50 °C for 30 min, Taq activation at 95 °C for 2 min, followed by 45 cycles of denaturation at 95 °C for 15 s and annealing at 55 °C for 30 s) [8].

Viral isolation and antigenic characterisation
PCR-positive clinical specimens with a high viral load (C t value < 28) were selected and inoculated onto cultures of Madin Darby canine kidney (MDCK) and MDCK-SIAT1 (a cell line expressing increased levels of sialyl-α2,6galactose moieties [9]). Cultures were incubated at 35 °C in an atmosphere of 5% CO 2 and observed daily for 7 days for visual cytopathic effect. The presence of virus in culture was confirmed by haemagglutination assay following standard protocols using a 1% suspension of guinea pig red blood cells. Antigenic characterisation of influenza isolates was performed by the haemagglutination inhibition (HAI) assay following the WHO Manual, using vaccine viruses/ antigens and their corresponding antisera kindly provided by the WHO Collaborating Centres (WHO-CCs) in London and CDC, Atlanta [10]. More detailed HAI assays of representative Bulgarian influenza isolates with panels of reference viruses and antisera were performed at the WHO-CCs in London and Atlanta. Viral isolates were considered as antigenically related to the vaccine virus if they showed no more than a fourfold reduced HAI titre with antiserum raised against the vaccine virus, as compared to the homologous titre. A reduction of at least eightfold in the HAI titres was considered a signal of antigenic difference.

Genetic characterisation
The sequencing of influenza viruses detected in Bulgaria during the 2019/2020 season was performed at the WHO-CCs in London and Atlanta. Consensus sequences have been deposited in the EpiFlu database of the Global Initiative on Sharing All Influenza Data (GISAID) [11]. For phylogenetic analyses, sequences of study viruses, reference viruses with known genetic group identities determined by the WHO-CCs [12], vaccine strains, and viruses representing different countries of Europe during the 2019/2020 season, were retrieved from the EpiFlu database of GISAID (Supplementary Table). Multiple alignments for HA and NA sequences were performed using the MUSCLE algorithm embedded in the MEGA, version 6.06 software [13]. The best-fit nucleotide substitution models for phylogenetic analysis of HA (Hasegawa-Kishino-Yano model with a gamma distribution, HKY + G) and NA (Tamura 3-parameter model with gamma distribution, T92 + G) were determined using MEGA 6.06. Phylogenetic trees were constructed using Maximum Likelihood method. HA and NA nucleotide sequences of the Bulgarian influenza viruses analysed in this study are available in GISAID under the identifiers presented in the Supplementary Table.

Deduced amino acid sequence analysis and prediction of N-glycosylation motifs
Deduced amino acid sequences were generated by translating nucleotide sequences with the standard genetic code using MEGA 6.06 software. The amino acid sequence identities were calculated using FluServer [14]. The presence of putative N-linked glycosylation sites, N X S/T (sequon), where X can be any amino acid except proline, was identified using the NetNGlyc 1.0 Server [15].

Statistics
The detection rates of the individual viruses were compared using Chi-square or Fisher's exact tests for categorical variables. P values < 0.05 were considered statistically significant.

Results
The 2019/2020 influenza epidemic lasted for 6 weeks (weeks 2-8 of 2020) and peaked at week 5. A second, small peak in morbidity due to B/Victoria-lineage viruses was registered at week 10 of 2020 (Fig. 2).

Patient characteristics
A total of 1387 patients presenting with ARI were tested for influenza viruses. About 23.3% (323/1387) of these patients attended outpatient healthcare centres, 76.7% (1064/1387) were hospitalised patients, of which 27 were treated in intensive care units (ICUs). The patients' ages ranged from 13 days to 88 years; 676 (48.7%) of the participants were male. Most clinical samples (683, 49.2%) were obtained from patients living in the capital Sofia. The number of samples tested from other regions ranged from 10 to 73 (Fig. 1).  20.2%, P < 0.05), and the percentage of detected B/Victoria-lineage viruses was higher in Sofia city (13% vs 6.7%, P < 0.05).

Influenza virus detection
The first influenza detection, an A(H1N1)pdm09 virus, occurred in week 51/2019, and the last, an A(H3N2) virus, occured in week 11/2020. Increased circulation of influenza B/Victoria-lineage viruses was seen at weeks 8-10 of 2020 (Fig. 2). After week 11/2020, influenza virus testing was discontinued due to overloading of the laboratory with SARS-CoV-2 testing.

Phylogenetic and antigenic characterisation
A total of 57 influenza viruses detected in 13 clinical specimens and 70 cell-culture isolates, were sequenced at WHO-CCs in London and Atlanta and deposited in the EpiFlu database of GISAID. Phylogenetic analysis was performed to determine the affiliation of Bulgarian strains to the genetic groups of globally circulating influenza viruses. The antigenic characterisation using HAI assays was aimed to determine the antigenic similarity of the Bulgarian isolates with the current vaccine strains.
Phylogenetic analysis of the HA genes of all 21 A(H1N1) pdm09 viruses sequenced showed that they clustered within the subclade 6B.1A5A (Fig. 4). Ferret antisera raised against egg-propagated vaccine virus A/Brisbane/02/2018 inhibited the eleven antigenically characterised A(H1N1) pdm09 viruses at titres equal to, or 2-to 4-fold lower than the homologous titre of the antiserum.
All the 10 B/Victoria-lineage viruses belonged to HA clade 1A with a three amino acid deletion in HA1 (positions 162-164) (Fig. 5). The nine viruses analysed by HAI assay were poorly recognised by antisera raised against egg-propagated vaccine virus, B/Colorado/06/2017, which contains a two amino acid deletion in HA (positions 162 and 163).

HA and NA protein sequence analysis
HA and NA protein sequences of influenza viruses circulating in Bulgaria during the 2019/2020 season were compared to those of egg-propagated vaccine strains in order to identify substitutions that could affect vaccine effectiveness.

Discussion
In the present study, the circulation pattern and genetic characteristics of influenza viruses detected in Bulgaria during the 2019/2020 season were examined. Influenza A(H3N2) viruses prevailed (53.1%), and A(H1N1)pdm09 and B/Victoria-lineage viruses circulated in similar proportions (24.5% and 22.7%, respectively). No B/Yamagata-lineage viruses were identified. Cumulative data for the WHO European region showed that most viruses detected (73%) were type A with a slight predominance (56%) of A(H1N1)pdm09 viruses over A(H3N2) viruses (44%), and 27% were type B, with 98% of them determined to be B/Victoria-lineage [23].
Among the seasonal influenza viruses, A(H3N2) have been characterised by the highest rate of evolution and significant genetic diversity [26][27][28]. Since 2009, seven genetic groups and multiple clades/subclades based on the HA gene have been defined within this subtype. In 2014, two genetically and antigenically divergent genetic clades, 3C.2a and 3C.3a, emerged and continued to circulate widely worldwide in the following seasons [29]. Viruses in clade 3C.2a rapidly became predominant in most geographic areas including in Bulgaria [25,30]. Over this period, 3C.2a viruses diverged into several subclades: 3C.2a1 (3C.2a1a and 3C.2a1b), 3C.2a2, 3C.2a3, and 3C.2a4 [16]. Unlike the previous four seasons, during the 2019/2020 influenza epidemic in Bulgaria, the majority (80.8%) circulating A(H3N2) viruses fell into the genetic clade 3C.3a, and the remainder (19.2%) belonged to subclade 3C.2a1b, which included three subdivisions and was not represented in the vaccine. The clade 3C.3a viruses were antigenically and genetically close to the vaccine virus, while those of subclade 3C.2a1b and its subdivisions differed significantly due to the presence of a great number of amino acid mismatches, 6-9 of which were within the HA antigenic sites A, B, C and E (Table 1). Antigenic sites B and A, located on the top of HA1 around the RBS are main targets of human neutralising antibodies. It has been found that from 1968 to 2003, the major antigenic changes have been caused mainly by single amino acid substitutions which occur at only seven positions in HA (145 in site A and 155, 156, 158, 159, 189, 193 in site B) near the RBS [31]. In our study, substitution S159Y was found in one of these key positions. HAs of A(H3N2) viruses have been characterised by a high degree of glycosylation, which probably facilitates immune evasion. Previous studies showed that the number of HA glycosylation sequons in A(H3N2) viruses increases in the years following their introduction into the human population. While the pandemic A(H3N2) strain Hong Kong/1/68 harboured two N-glycosylation sites at positions 81 and 165 on the head of HA, the most recent representatives contained 7 additional N-glycosylation sites on the HA globular head and 5 on the stem region [16]. Six potential N-glycosylation motifs were identified at the antigenic regions of Bulgarian A(H3N2) viruses. Attachment of oligosaccharide chains to the antigenic sites of HA may alter the antigenicity and virulence of influenza viruses [32]. The glycan at position 158 is located immediately adjacent to the amino acid residues at positions 156, 158 and 159 that are responsible for major antigenic cluster transitions.
Since the 2009/2010 pandemic, the A(H1N1)pdm09 virus has evolved continuously and has undergone significant diversification in eight major genetic groups and several clades/subclades based on the HA gene. The appearance of the recent clade 6B.1 viruses at the beginning of the NH 2015/2016 season was accompanied by many severe and fatal influenza cases in a number of countries, including Bulgaria [33,34]. The vast majority of recently globally circulating viruses formed a subclade designated 6B.1A5A and all sequenced Bulgarian A(H1N1)pdm09 viruses belonged to this subclade. A total of 7-11 amino acid variations (three located at antigenic site Sb and two at site Sa) were identified in HA and 6-9 in NA compared to the vaccine virus, A/ Brisbane/2/2018 ( Table 1). Attachment of glycan at position 162 of the antigenic site Sa is likely to affect antibody recognition. Despite these genetic changes, subclade 6B.1A5A viruses remained antigenically similar to the vaccine virus as assessed by the use of post-infection ferret antisera raised against the vaccine virus [12]. However, due to the increasing number of viruses A(H1N1)pdm09+D187A and Q189E, which were antigenically distinguishable from the vaccine virus by panels of post-vaccination human sera from the 2019-2020 NH influenza season, A/Guangdong-Maonan/ SWL1536/2019 virus was recommended as a vaccine component for the 2020/2021 NH influenza season [35].
In Bulgaria, all type B viruses detected in 2019/2020 belonged to the Victoria-lineage. Previously, an increased circulation of B/Victoria-lineage viruses was observed in the country in the 2015-2016 season [33]. Recently circulating viruses with a double (Δ162-163) or a triple deletion (Δ162-164) in HA1 appeared for the first time during the 2016/2017 season [36]. Seven amino acid substitutions in HA1, four of which were located in the antigenic 120-loop (n = 3) and 190-helix (n = 1), and 5 substitutions in NA were identified in Bulgarian strains with respect to the current vaccine virus (Table 1). According to previous studies, the 120-loop exhibits a highest degree of variations [37]. The 190-helix is a part of the RBS. Amino acid substitutions within these regions could potentially alter antigenicity and virulence [22,38,39]. This statement was confirmed by the results of antigen characterisation, which showed that the Bulgarian B/Victoria-lineage viruses were antigenically distinct from the contemporary vaccine component.
In conclusion, co-circulation of seasonal influenza viruses A(H3N2), A(H1N1)pdm09 and B/Victoria-lineage was observed during the 2019/2020 season in Bulgaria. The results of this study showed the emergence of new genetic variants, some of which had alterations in antigenicity compared to the current vaccine strains. The study confirms the rapid evolution of influenza viruses and the need for continuous antigenic and genetic surveillance.

Data availability
The datasets analysed during the current study are available in the GISAID database (https:// www. gisaid. org/).

Conflict of interest
The authors declare they have no financial interests.
Ethical approval This work was performed in line with The Code of Ethics of the World Medical Association (Declaration of Helsinki) for experiments involving humans. The study was approved by the Institutional Review Board/ Institutional Ethics Committee of the National Centre of Infectious and Parasitic Diseases (Number IRB 00006384).

Consent to participate
Written informed consent to participate in the study was obtained from all individual participants (or parents/ legal guardians in the case of children under 16) before specimen collection and testing.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.