• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of jcmPermissionsJournals.ASM.orgJournalJCM ArticleJournal InfoAuthorsReviewers
J Clin Microbiol. Apr 2007; 45(4): 1190–1199.
Published online Jan 24, 2007. doi:  10.1128/JCM.02078-06
PMCID: PMC1865833

Identification of Variable-Number Tandem-Repeat (VNTR) Sequences in Legionella pneumophila and Development of an Optimized Multiple-Locus VNTR Analysis Typing Scheme[down-pointing small open triangle]


The utility of a genotypic typing assay for Legionella pneumophila was investigated. A multiple-locus variable number of tandem repeats (VNTR) analysis (MLVA) scheme using PCR and agarose gel electrophoresis is proposed based on eight minisatellite markers. Panels of well-characterized strains were examined in a multicenter analysis to validate the assay and to compare its performance to that of other genotyping assays. Excellent typeability, reproducibility, stability, and epidemiological concordance were observed. The MLVA type or profile is composed of a string of allele numbers, corresponding to the number of repeats at each VNTR locus, separated by commas, in a predetermined order. A database containing information from 99 L. pneumophila serogroup 1 strains and four strains of other serogroups and their MLVA profiles, which can be queried online, is available from http://bacterial-genotyping.igmors.u-psud.fr/.

The genus Legionella currently comprises 50 validly published species, and many more await full characterization (12). Legionellae are widely distributed in the environment, where they can multiply inside amoebae and ciliates. More than one-third of characterized Legionella species have been isolated from patients, but Legionella pneumophila is found in the majority of culture-confirmed cases of Legionnaires' disease. Three subspecies have been defined: L. pneumophila subsp. pneumophila, L. pneumophila subsp. fraseri, and L. pneumophila subsp. pascullei (1).

L. pneumophila can be differentiated into at least 15 serogroups (sg) (5). L. pneumophila sg 1 isolates account for more than 90% of clinical isolates (13), with the remaining 14 sg being differently associated with human infection. A high genetic diversity is observed within L. pneumophila and even within sg 1 isolates, as shown by other studies (6, 21, 22). Sequencing of the genome of three sg 1 strains associated with large outbreaks, the original Philadelphia-1 strain, the Paris strain, and the Lens strain have confirmed the considerable intraspecific genetic divergence, with up to 10% of DNA sequences unique to one of the three strains (2, 3).

Many phenotypic and genotypic methods for epidemiological typing of L. pneumophila have been developed, some being costly and time-consuming and not always enabling interlaboratory comparison (7). Amplified fragment length polymorphism (AFLP) analysis was adopted as a standard by the European Working Group for Legionella Infections (EWGLI) (6, 8, 9) and was widely used to allocate L. pneumophila sg 1 strains into coded types by interrogation of a reference database through an internet-based protocol. However, this technique relies on the analysis of a multiband pattern, and too many variables exist which complicate unambiguous type assignation. The nucleic acid sequence typing approach, commonly called multilocus sequence typing (18), is now regarded as the method of choice to generate truly portable data. In addition, it is easily codable, and data can be stored into databases. In a typical multilocus sequence typing assay, data derived from parts of seven housekeeping genes are analyzed. A simpler sequence-based typing (SBT) scheme for L. pneumophila sg 1 was described, initially based on the investigation of three genes (11) and more recently extended to six genes, namely flaA, pilE, asd, mip, mompS, and proA (10). SBT was applied to the typing of 105 predominantly clinical isolates mostly belonging to sg 1 (10, 11). In a recent comparison of three different genotyping methods, SBT was reported to be the most rapid and the easiest to perform in an outbreak setting and provided unambiguous results (23). The major problem with SBT is the high cost which makes problematic its routine use by laboratories which are not equipped with local sequencing facilities. It is presumable that this method will be restricted to a subset of most relevant isolates. Hence, additional screening methods combining low cost with good typing performance are still needed.

Polymorphic tandem repeats have been successfully used for epidemiological typing studies of several bacterial species (17, 26). The so-called multiple-locus variable number of tandem repeats (VNTR) assays (MLVA) are based on the analysis of short to long tandemly repeated sequences (also called microsatellites, up to 9 bp, and minisatellites, more than 9 bp in length). An assay is defined by a set of loci spread throughout the bacterial genome (16). Previous studies of the polymorphism of tandem repeats in L. pneumophila suggested that VNTRs could be used for genotyping (21) in spite of the fact that this species is much more genetically heterogeneous than other species for which MLVA can be regarded as a reference method, such as Bacillus anthracis, Mycobacterium tuberculosis, and Yersinia pestis (15, 16, 20). Seven VNTRs have so far been described and used to type reference strains of L. pneumophila sg 1 to sg 14 plus 27 L. pneumophila strains isolated from the environment or from patients. The number of repeat units was assessed by measuring the size of PCR products encompassing the VNTR. This initial report was based on the single genome sequence available at that time (L. pneumophila sg 1 type strain Philadelphia-1), with the drawback that a number of primer sets would amplify only a subset of the isolates examined. The release of two additional L. pneumophila sg 1 genomes (Lens and Paris) (2) provided the basis for identification of additional markers using the methods described by Denoeud and Vergnaud (4; see also the MLVA web service, http://minisatellites.u-psud.fr). As a result, we propose here an extended MLVA comprising up to 10 VNTR loci of L. pneumophila.

The aims of the present study were to set up a standard MLVA protocol for L. pneumophila sg 1 including standard PCR and gel-based analysis, evaluate the overall performances of the standard MLVA typing system, i.e., typeability (T), reproducibility (R), epidemiological concordance (E), stability (S), and discriminatory power, and establish the first MLVA database for L. pneumophila which can be directly queried via the internet.



Participants from three institutes representing three European countries took part in the study: Genotyping Public MLVA Site, Institut de Génétique et Microbiologie, Université Paris-Sud 11, Orsay, France (center I); National Institute for Infectious Diseases Lazzaro Spallanzani, Rome, Italy (center II); and the Health Protection Agency Centre for Infections, Respiratory and Systemic Infection Laboratory, London, United Kingdom (center III). Two of these have extensive experience of participation in multicenter studies on Legionella typing (centers II and III), and one was the originating laboratory of an MLVA for Legionella (center I) (21). The study was coordinated by center III. Guidelines for appropriate use and evaluation of microbial epidemiologic typing systems were followed (25).

Bacterial strains.

The type strain of L. pneumophila, Philadelphia-1 (NCTC 11192), was obtained from the National Collection of Type Cultures, London, United Kingdom, and the Lens and Paris reference strains were kindly provided by Jerome Etienne, Centre National de Référence des Légionelles, Lyon, France. All other isolates were obtained from the EWGLI Legionella culture collection, which was established by EWGLI members to facilitate epidemiological typing studies. All isolates from this collection have a unique number (EUL number; see http://www.ewgli.org) and have previously been characterized by multiple phenotypic and genotypic methods (7-11) (Tables (Tables11 and and22).

L. pneumophila proficiency panel and three reference strainsa
Characteristics of L. pneumophila sg1 strains used to assess epidemiological concordance and stability of MLVAa

A total of 103 clinical and environmental isolates of L. pneumophila were analyzed, including 99 L. pneumophila sg 1 strains and 4 L. pneumophila strains from other serogroups. The proficiency panel comprised 10 coded strains of L. pneumophila from the EUL strain collection plus the three reference strains (Table (Table1).1). Ninety-five of the L. pneumophila sg 1 isolates, from nine European countries, were selected to produce one epidemiologically unrelated panel of 79 clinical isolates (panel 1), one epidemiologically related panel of 16 isolates (panel 2), and one stability panel of 5 isolates (panel 3). Panel 2 comprises five sets of epidemiologically related isolates and two replicates of the same isolate (EUL 120 and EUL 121), while panel 3 comprises five variants of the same strain (Table (Table2).2). Details of strains included in panels 1 to 3 have been published elsewhere (11).


For training purposes, the proficiency panel was analyzed by the three centers to establish the reproducibility (R) of the method. The laboratory protocols for MLVA were agreed upon by the three centers prior to the study and are made available to the public at http://bacterial-genotyping.igmors.u-psud.fr/. This pilot panel comprised two pairs of epidemiologically related strains (one pair belonging to sg 1 and one pair belonging to sg 8), and six other epidemiologically unrelated strains (four belonging to sg 1, one belonging to sg 6, and one belonging to sg 10). The results for these 13 strains were analyzed locally by each center and then sent to center I for interlaboratory comparative analysis. Following agreement of optimal analytical settings, the second phase of the study was initiated. This was for all three centers to analyze the 16 strains of panel 2. These data were also analyzed to determine the epidemiological concordance (E). Discriminatory power and stability (S) were determined using panel 1 and panel 3 strains, respectively. Typeability (T) was assessed by the analysis of 99 unique isolates from all panels (i.e., the proficiency panel plus panels 1 and 2, with the exclusion of panel 3).

VNTR markers.

Polymorphic tandem repeats were identified in the sequenced genomes of L. pneumophila sg 1 Philadelphia-1, Lens, and Paris reference strains using the strain comparison tool developed by Denoeud and Vergnaud (4), available at http://minisatellites.u-psud.fr/. Ten VNTR markers were used for the present study, including four from the previously described set of nine VNTR markers (21). The primers used to amplify the markers are designated Lpms1_b, Lpms3, Lpms13, Lpms17, Lpms19_b, Lpms31, Lpms33, Lpms34, Lpms35, and Lpms37, as shown in Table Table3.3. Lpms1_b and Lpms19_b correspond to new sets of primers designed for the Lpms1 and Lpms19 VNTR loci described in reference 21.

Oligonucleotide primers used and VNTRs analyzed in this study

DNA extraction and PCR amplification.

Genomic DNA from all EUL strains and the reference strains was extracted by the coordinating laboratory as described previously (11) and distributed by courier. Oligonucleotide primers targeting the 5′ and 3′ flanking regions of VNTR loci were used for amplification. The following conditions were used. PCRs were performed with 15-μl volumes containing 1 to 5 ng of DNA, 1× reaction buffer, 1.5 mM MgCl2, 1 U of Taq DNA polymerase (Roche, Promega, or Invitrogen), 200 μM concentrations of each deoxynucleoside triphosphate (dNTP), 0.3 μM concentrations of each flanking primer (Eurogentec or MWG Biotech). Amplification was locally performed with different thermocyclers (including PTC 200 or PTC 225 DNA Engine [MJ Research] and MyCycler [Bio-Rad]) using the following conditions: initial denaturation cycle for 5 min at 94°C, 35 cycles of denaturation for 30 s at 94°C, annealing for 30 s at 60°C, and elongation for 45 s at 72°C, plus a final elongation step for 10 min at 72°C.

Unless otherwise stated, 3 μl of PCR products was separated in a 2% agarose (UltraPure electrophoresis grade; Invitrogen) gel or in a 4% agarose gel comprising 2% Metaphor (FMC Bioproducts-Cambrex) plus 2% regular agarose (as above) for Lpms37. Electrophoresis was performed on 20-cm-wide gels made in 0.5× Tris-borate-EDTA buffer (Sigma), run at 8 V/cm. For each PCR run, a reference strain (L. pneumophila Philadelphia-1, Paris, or Lens) was included. DNA size markers, including the 100-bp ladder (routinely) or 20-bp ladder (only for Lpms37), were from Bio-Rad, MBI Fermentas, or Euromedex. The gels were stained after the run in 0.5 to 1.0 μg/ml ethidium bromide for 15 to 30 min, rinsed with water, and photographed under UV illumination.

Agarose gel image analysis.

The band size was determined using the software Quantity One v. 4.2.1 (Bio-Rad; available in center II) or the BioNumerics software v. 4.0 or v. 4.5 (Applied Maths; available in centers I and III). First, the position of the cursor relative to the DNA band was adjusted to achieve optimum size matching with the L. pneumophila Philadelphia-1, Lens, or Paris reference strain used as an internal control; then the cursor was similarly positioned for all the other strains run in the same gel. Size assignments were confirmed by visual inspection of gels and comparative analysis of strains for each marker. The images produced by the three centers were inspected by one center “ex post,” and then differences or errors in data reporting on this training set were adjusted and/or corrected.

Nomenclature and description of MLVA profiles.

The repeat length and number of repetitions was determined in the three sequenced genomes available at Columbia Genome Center (http://genome3.cpmc.columbia.edu/~legion/g_info.html) or Institut Pasteur (http://genolist.pasteur.fr/LegioList/) using the Microbial Tandem Repeats Database (http://minisatellites.u-psud.fr) (4, 16). Amplification of DNA from these three reference strains using primers described in Table Table33 produced amplicons of the expected size. The number of repeats in new alleles was estimated by subtracting the invariable flanking region from the amplicon size and then dividing by the repeat unit length, as determined for reference strain Philadelphia-1. Intermediate-sized alleles (which may result from intermediate-size repeat units or from small deletions in the flanking sequence) were reported as half-sized, when observed.

The polymorphism index of individual or combined VNTR loci was calculated using panel 1 strains and the Hunter-Gaston diversity index (HGDI) (14), an application of the Simpson' s index of diversity (24). The allelic profile for 8 loci (MLVA-8) was defined as the number of repeats at each VNTR locus in the order Lpms1_b, Lpms3, Lpms13, Lpms17, Lpms19_b, Lpms33, Lpms34, and Lpms35. The not amplified designation was given when no amplification was repeatedly observed at a given locus. The null (0) allele designates a locus that contains both flanking sequences but no repeat unit.

DNA sequence analysis.

The full-length sequences of selected PCR products were determined on both strands following DNA purification by the QIAquick PCR purification kit (QIAGEN). Sequencing reactions were performed using the BigDye terminator technology according to the manufacturer's recommendation (Applied Biosystems), and products were analyzed in an ABI 3100 capillary electrophoresis system equipped with the POP 4 matrix (Applied Biosystems). Data obtained with forward and reverse sequencing primers were combined, and sequences were manually aligned. DNA sequences have been deposited in the EMBL Nucleotide Sequence Database (see “Nucleotide sequence accession numbers” below).

Criteria for evaluation of L. pneumophila sg 1 MLVA.

Standard efficacy criteria of L. pneumophila sg 1 MLVA, including typeability (T), reproducibility (R), stability (S), epidemiological concordance (E), and discriminatory power (expressed as HGDI) were determined as reported elsewhere (25).

Nucleotide sequence accession numbers.

DNA sequences have been deposited in the EMBL Nucleotide Sequence Database under accession numbers AM420643 to AM420666 as follows (the corresponding strain name is shown in parentheses): for Lpms1 sequences, AM 420643 (EUL 103), AM 420644 (EUL 031), AM 420645 (EUL 048), AM 420646 (EUL 087), and AM 420647 (ATCC 35096); for Lpms17 sequences, AM 420648 (ATCC 33154), AM 420649 (EUL 052), and AM 420650 (ATCC 33290); for Lpms31 sequences, AM 420651 (EUL 006), AM 420652 (EUL 027), AM 420653 (EUL 063), AM 420654 (EUL 135), AM 420655 (Paris), AM 420656 (Lens), AM 420657 (EUL 111), AM 420658 (Philadelphia), AM 420659 (EUL 048), and AM 420660 (EUL 070); and for Lpms37 sequences, AM 420661 (Paris), AM 420662 (Lens), AM 420663 (Philadelphia), AM 420664 (EUL 111), AM420665 (EUL 032), and AM 420666 (EUL 076). Sequence data are available at http://bacterial-genotyping.igmors.u-psud.fr/.


Selection of VNTR markers and allele assignment.

With the complete sequencing of three L. pneumophila genomes, it became possible to identify new polymorphic tandem repeats by the strain comparison tool available at http://minisatellites.u-psud.fr/ (4). A large proportion of tandem repeats was present in one or two strains only and, therefore, could not be used in an MLVA scheme (data not shown).

Among the nine previously described VNTR markers (21), four (Lpms1, Lpms13, Lpms17, and Lpms19) yielded amplification products of the expected size in all reference isolates and were retained for the MLVA scheme. New optimized primers were designed for Lpms1 and Lpms19 (referred to as Lpms1_b and Lpms19_b in Table Table3),3), taking into account the published sequences for the Paris and Lens genomes. Six new VNTRs were also evaluated, namely Lpms3 (96-bp repeat unit), Lpms31 (45-bp repeat unit), Lpms33 (125-bp repeat unit), Lpms34 (125-bp repeat unit), Lpms35 (18-bp repeat unit), and Lpms37 (7- to 8-bp repeat unit) (Table (Table3).3). The number of repeated units at each locus in the sequenced genomes is shown in Table Table33.

Analysis of all markers for strains included in the proficiency, epidemiologically related, and stability panels is shown in Table Table4.4. Half-sized alleles were observed for Lpms1_b, Lpms3, Lpms17, and Lpms31. Initially, correct assignment of the number of repeats was not straightforward for Lpms31, because of the relatively high frequency of half-size alleles, and for Lpms37, because of the existence in some strains of two different repeat units (7 bp or 8 bp long), as observed upon sequencing of selected alleles (Fig. (Fig.1).1). Notably, intermediate-size alleles were observed for Lpms31 also in reference strains Paris and Lens.

FIG. 1.
Alignment of Lpms37 repeats and flanking sequences in the three reference strains and three unrelated isolates using ClustalW. Single-nucleotide differences are present in the flanking sequences, whereas in the tandem repeats, variable numbers of 7-bp ...
MLVA analysis of the proficiency, epidemiologically related, and stability panels

For all markers, except Lpms3 and Lpms31, PCR products ranged in size between 151 and 724 bp (http://bacterial-genotyping.igmors.u-psud.fr/) and could easily be resolved by agarose gel electrophoresis. Moreover, the 96-bp repeat length of Lpms3 provided good resolution also for this marker, despite the occasional occurrence of half-sized alleles (± 48 bp relative to the repeat unit). In the case of Lpms31 (45-bp repeat size), some of the observed alleles exceeded 1,000 bp, and extensive electrophoretic separation was required for precise size estimation.

Thus, an MLVA scheme is proposed, including 8 markers, which allows unambiguous type assignment using agarose gels (MLVA-8, including Lpms1_b, Lpms3, Lpms13, Lpms17, Lpms19_b, Lpms33 Lpms34, and Lpms35). The addition of the highly informative Lpms31 and Lpms37 loci (MLVA-10) requires capillary electrophoresis-based fragment analysis or sequencing for unambiguous type assignment (sequences are available at http://bacterial-genotyping.igmors.u-psud.fr/).


MLVA-8 was initially performed on a proficiency panel of 10 strains from the EUL collection plus 3 reference strains (Table (Table1)1) in the three centers to assess the reproducibility of the method. The reference strains Philadelphia-1, Paris, and Lens were analyzed together with the 10 EUL isolates. For the three reference strains, the amplicon size estimates by all three centers were consistent with the predicted length of individual VNTRs, as inferred from genome analysis (compare data in Tables Tables33 and and4).4). Although the electrophoresis patterns appear to be similar, some discrepancies were initially observed between the three centers on the sizing which was achieved by the use of locally available gel image analysis software. These discrepancies were most likely the result of gel overloading, as discussed hereafter. A second difficulty was due to the existence of alleles with unexpected size, incompatible with an exact number of repeats. This is the case, for example, for Lpms1_b in EUL 048 and EUL 056, whose allele size is intermediate between the size of a 7- and 8-repeat allele observed in EUL 157 and reference strain Philadelphia, respectively. We have arbitrarily scored this allele 7.5 (Table (Table4)4) to clearly reflect this fact. A similar size variability was shown also for other markers, namely Lpms3 and Lpms17. Sequencing of such alleles confirmed the existence of repeats with unexpected sizes. However, for all these markers, the half-sized alleles could easily be distinguished from the full-length counterparts by visual inspection of the gel.

All images were sent to center I, where sizing was repeated by using BioNumerics software as described above. The correct position of the cursor on a given gel was deduced from the reference strain lane, and minor adjustments were made, taking into account the amount of DNA present in the band. Data from all isolates tested by the three laboratories were in complete agreement (R = 1.00 for all VNTR markers).

Epidemiological concordance.

Panel 2, including 16 epidemiologically related isolates that had previously been typed by different DNA-based techniques (7-9), were genotyped by MLVA in the three centers (Table (Table4).4). EUL 048, EUL 056, and EUL 121 were also retested, since they were previously included in the proficiency panel. Amplicons were obtained by the three centers for all markers except Lpms17, Lpms34, and Lpms37 in EUL 040 and EUL 047. In fact, no amplification was obtained for these isolates by centers II and III, while a weak amplification was observed by center I using a preparation of highly concentrated Taq DNA polymerase (SilverStar; Eurogentec).

Sizing was performed independently by each center, and then images were reanalyzed by center I. Table Table44 shows the compiled data for 10 VNTR markers and comparison with SBT profiles. The epidemiological concordance was excellent for all VNTR markers. E values of 1.00 were determined for all markers except Lpms37 (E = 0.83). This high epidemiological concordance was also seen in other sets of epidemiologically related strains, including those from outbreaks in two Paris hospitals described in the study by Pourcel et al. (21) (our unpublished data).


Analysis of five variants (EUL 135 to EUL 139) in the stability panel (panel 3 in Table Table4)4) resulted in identical MLVA-8 allelic profiles, namely 9,8,9,2,4,4,1,24 (S = 1.00 for all markers).

Strain diversity and discriminatory power of MLVA.

Estimates of individual and pooled diversity index (HGDI) values obtained with the 8 VNTRs were determined by using panel 1, which is composed of 79 unrelated isolates. Figure Figure22 shows the typical organization of a gel, here for marker Lpms35, on which one reference strain (Philadelphia-1 or Lens) is run next to a group of 5 isolates. On a 20-cm-wide gel, up to 30 isolates can simultaneously be analyzed for a single VNTR marker. The data for panel 1 isolates are presented in Table Table55 according to the result of an MLVA clustering analysis performed using Hamming's distance (the categorical coefficient) and the unweighted pair group method with arithmetic mean clustering method. The SBT code is indicated for comparison. A simplified estimate of the concordance between SBT and MLVA-8 is obtained by comparing side by side the codes for each strain, as shown in Table Table5.5. Strains clustered under a single SBT code can show differences in MLVA-8 profile (for example, EUL 081 and EUL 092), and the opposite is observed for some MLVA-8 groups (for example, EUL 029 and EUL 049). The larger group with the SBT code 1,4,3,1,1,1 is split into two groups by Lpms35, and additional differences are found when Lpms31 and Lpms37 are analyzed. On the contrary, in the second larger group with MLVA-8 profile 7,8,9,2,5,3,1,12, SBT appears to be more discriminatory. HGDI values for each marker and for MLVA-8 are shown in Table Table6.6. Lpms35 provided high index of discrimination (0.88). Overall, the high HGDI values obtained for MLVA-8 (0.93) highlights the good discriminatory power of MLVA. The number of alleles observed for the different markers and their theoretical size are provided as help file in the L. pneumophila MLVA typing site (http://bacterial-genotyping.igmors.u-psud.fr/).

FIG. 2.
Analysis of the Lpms35 marker in 30 L. pneumophila isolates. Philadelphia-1 (lanes 2, 16, and 30) and Lens (lanes 9, 23, and 37) reference strains were alternately used as internal controls. Lanes and test strains are, respectively: 3, EUL 101; 4, EUL ...
Allelic profiles of 79 unrelated L. pneumophila isolates (panel 1)
HGDI for individual or combined VNTR loci (MLVA-8) calculated from 79 unrelated isolates of L. pneumophila


Analysis of 99 unique isolates from all panels showed amplification and full typeability (T = 1.00) for 5 of the 8 VNTR loci, namely Lpms1_b, Lpms3, Lpms13, Lpms33, and Lpms35. In EUL 040 and EUL 047, weak amplification with Lpms17 and Lpms34 was observed by center I, and no amplification was found by the other two centers. Moreover, no amplification was observed in all three centers for Lpms19_b in EUL 154 and EUL 155 (Table (Table4).4). All of these results were scored as failures. Therefore, a T value of 0.98 was determined for Lpms17, Lpms19_b, and Lpms34.


The primary aim of this study was to evaluate the utility of MLVA as a genotyping method for L. pneumophila, complementary or alternative to SBT and AFLP.

In an attempt to increase the repertoire of previously established VNTR markers (21), we have compared the three available L. pneumophila genome sequences and have identified 10 common markers for which conserved primers could be unambiguously derived. Amplification was obtained with almost all of the tested strains. No amplification was observed for marker Lpms19_b in EUL 154 and EUL 155 from the proficiency panel, while Lpms17, Lpms34, and Lpms37 could not consistently be amplified in EUL 040 and EUL 047. By comparison with MLVA profiles of previously analyzed strains (21), EUL 040 and EUL 047 would belong to L. pneumophila subsp. fraseri. Thus, one possible explanation for the failure of amplification with specific markers may be flanking sequence variation in these strains compared to the majority of the L. pneumophila subsp. pneumophila sg 1 strains tested.

In theory, the optimal MLVA scheme for epidemiological studies should comprise a set of markers with variable indices of diversity. Markers with a moderate diversity and small number of alleles (presumably reflecting a low mutation rate) are important to define families, whereas markers with more rapid evolution provide variability inside clonal lineages.

There is no fixed rule regarding the number of markers to be used in an MLVA, since the association of individual markers (with variable index of diversity) offers the opportunity of modulating the discriminatory power of MLVA. In its simpler version, MLVA of L. pneumophila sg 1 could consider the association of only five easily manageable markers, namely Lpms13, Lpms19_b, Lpms33, Lpms34, and Lpms35, and exclude Lpms_1b, Lpms3, and Lpms17 because of the occurrence of half-sized alleles. By combining these five VNTR markers, sufficiently high discrimination can be achieved (HGDI = 0.92, 34 genotypes). However, the addition of three more markers, namely Lpms_1b, Lpms3, and Lpms17, for which the half-sized alleles could easily be distinguished from the full-length counterparts, improves the informativeness of the assay (HGDI = 0.93, 36 genotypes) without complicating gel analysis. Although the diversity index for Lpms17 is low, it could be increased when other series of isolates are analyzed.

To combine technical simplicity with high discriminatory power, the MLVA-8 scheme proposed in this study comprises an easily manageable combination of both moderately and highly diverse markers. In addition to MLVA-8 markers, Lpms31 and Lpms37 could contribute to further discrimination in light of their high diversity index (0.86 and 0.92, respectively) and are here proposed for an extended MLVA panel. However, their correct allele size assignment requires the use of high-precision DNA sizing equipment, such as a sequencer, and the number of repetitions cannot be deduced from the size. It is therefore recommended to type these two markers when additional informativeness is required and to use the size expressed in bp for clustering analyses. Epidemiological concordance was not perfect for Lpms37, which may suggest instability of this marker. This point, however, warrants future investigation. In a previous study, sequencing of minisatellite Lpms4 in 14 reference strains revealed the existence of alleles made up of a completely different (in size and sequence) repeat unit (21). Such extreme markers cannot be used in a routine gel-based MLVA. More classically, internal motif variations can be the mark of different evolution pathways for a similarly sized end product causing homoplasy (19). Since MLVA relies on the sizing of PCR products encompassing polymorphic tandem repeats, it does not inform on the sequence and will not detect homoplasy. It is the analysis of multiple VNTRs which compensates for potentially high homoplasy levels at individual loci.

The index of discrimination of MLVA-8 is greater than that reported for a standardized nonfluorescent AFLP method (7) and similar to that of SBT (10), as also shown by the comparison of SBT and MLVA-8 profiles (Table (Table5).5). However, phylogenetic relationships between the different clusters obtained by SBT and MLVA-8 show only 57.2% congruence calculated using the Pearson correlation coefficient in BioNumerics (data not shown). This discrepancy might be explained by horizontal gene transfer and by the different evolution rate of individual SBT and VNTR markers.

The interlaboratory reproducibility of the assay was excellent, although in preliminary assays, some discrepancies were occasionally observed in the sizing of amplicons on agarose gels. These were due to the way the cursor was placed on a band and band size was estimated. When placed in the middle of a fat band resulting from DNA overloading, the observed size was generally underestimated, as indicated by the sizing of reference strains. This artifactual behavior is not taken into account by gel analysis software, which automatically positions cursors at the band peak. This error was suppressed by loading similarly low amounts of individual size markers and amplicons on the gel to get thin bands. Minimum experience and quick visual inspection is sufficient to subsequently correctly position cursors, and the proficiency panel of strains gives the opportunity to develop this experience. Distortion of the migration on the gel borders must also be taken into account when sizing is performed.

In conclusion, we emphasize that MLVA-8 is a rapid, reproducible, and epidemiologically meaningful typing tool, which can be performed at low cost using regular equipment and agarose gel electrophoresis. To further reduce the cost and improve the accuracy of the assay, especially when large numbers of isolates need to be analyzed, it may be interesting to perform multiplex PCR using fluorescent primers and to separate the products by means of a DNA sequencer. As previously shown (21), VNTR amplification can be performed on thermolysates from a single L. pneumophila colony, further facilitating and lowering the cost of the procedure. For more precise information, and particularly in the case of phylogenetic studies, the sequencing of some markers with internal variability is advisable.

In its present form, the MLVA typing scheme cannot be used to investigate other Legionella species. Indeed, no PCR products were observed when primers were tested on 16 different non-L. pneumophila species, indicating that the assay is strictly specific for L. pneumophila. However, the availability of genome sequences and access to well-characterized strain collections of other Legionella species would allow the potential of this technique to be assessed for the epidemiological typing of non-L. pneumophila species.

A first Legionella pneumophila MLVA database can be queried online at http://bacterial-genotyping.igmors.u-psud.fr/.


We thank E. Nebuloso for technical assistance in DNA sequencing and T. Harrison for helpful comments on the manuscript. A. Henrard and G. Corbineau performed the typing in University Paris-Sud 11.

Work on the accountability of dangerous pathogens in University Paris-Sud 11 is supported by Délégation Générale pour l'Armement and is part of the European defense project CEPA13.14. This work was also supported by grants from Ministero della Salute (ricerca corrente 2005 and ricerca finalizzata 2006) and ISPESL to P.V.


[down-pointing small open triangle]Published ahead of print on 24 January 2007.


1. Brenner, D. J., A. G. Steigerwalt, P. Epple, W. F. Bibb, R. M. McKinney, R. W. Starnes, J. M. Colville, R. K. Selander, P. H. Edelstein, and C. W. Moss. 1988. Legionella pneumophila serogroup Lansing 3 isolated from a patient with fatal pneumonia, and descriptions of L. pneumophila subsp. pneumophila subsp. nov., L. pneumophila subsp. fraseri subsp. nov., and L. pneumophila subsp. pascullei subsp. nov. J. Clin. Microbiol. 26:1695-1703. [PMC free article] [PubMed]
2. Cazalet, C., C. Rusniok, H. Brüggemann, N. Zidane, A. Magnier, L. Ma, M. Tichit, S. Jarraud, C. Bouchier, F. Vandenesch, F. Kunst, J. Etienne, P. Glaser, and C. Buchrieser. 2004. Evidence in the Legionella pneumophila genome for exploitation of host cell functions and high genome plasticity. Nat. Genet. 36:1165-1173. [PubMed]
3. Chien, M., I. Morozova, S. Shi, H. Sheng, J. Chen, S. M. Gomez, G. Asamani, K. Hill, J. Nuara, M. Feder, J. Rineer, J. J. Greenberg, V. Steshenko, S. H. Park, B. Zhao, E. Teplitskaya, J. R. Edwards, S. Pampou, A. Georghiou, I.-C. Chou, W. Iannuccilli, M. E. Ulz, D. H. Kim, A. Geringer-Sameth, C. Goldsberry, P. Morozov, S. G. Fischer, G. Segal, X. Qu, A. Rzhetsky, P. Zhang, E. Cayanis, P. J. De Jong, J. Ju, S. Kalachikov, H. A. Shuman, and J. J. Russo. 2004. The genomic sequence of the accidental pathogen Legionella pneumophila. Science 305:1966-1968. [PubMed]
4. Denoeud, F., and G. Vergnaud. 2004. Identification of polymorphic tandem repeats by direct comparison of genome sequence from different bacterial strains: a Web-based resource. BMC Bioinformatics 5:4. [PMC free article] [PubMed]
5. Fields, B. S., R. F. Benson, and R. E. Besser. 2002. Legionella and Legionnaires' disease: 25 years of investigation. Clin. Microbiol. Rev. 15:506-526. [PMC free article] [PubMed]
6. Fry, N. K., B. Afshar, P. Visca, D. Jonas, J. Duncan, E. Nebuloso, A. Underwood, and T. G. Harrison. 2005. Assessment of fluorescent amplified fragment length polymorphism analysis for epidemiological genotyping of Legionella pneumophila serogroup 1. Clin. Microbiol. Infect. 11:704-712. [PubMed]
7. Fry, N. K., S. Alexiou-Daniel, J. M. Bangsborg, S. Bernander, M. Castellani Pastoris, J. Etienne, B. Forsblom, V. Gaia, J. H. Helbig, D. Lindsay, P. Christian Lück, C. Pelaz, S. A. Uldum, and T. G. Harrison. 1999. A multicenter evaluation of genotypic methods for the epidemiologic typing of Legionella pneumophila serogroup 1: results of a pan-European study. Clin. Microbiol. Infect. 5:462-477. [PubMed]
8. Fry, N. K., J. M. Bangsborg, A. Bergmans, S. Bernander, J. Etienne, L. Franzin, V. Gaia, P. Hasenberger, B. Baladron Jimenez, D. Jonas, D. Lindsay, S. Mentula, A. Papoutsi, M. Struelens, S. A. Uldum, P. Visca, W. Wannet, and T. G. Harrison. 2002. Designation of the European Working Group on Legionella infection (EWGLI) amplified fragment length polymorphism types of Legionella pneumophila serogroup 1 and results of intercentre proficiency testing using a standard protocol. Eur. J. Clin. Microbiol. Infect. Dis. 21:722-728. [PubMed]
9. Fry, N. K., J. M. Bangsborg, S. Bernander, J. Etienne, B. Forsblom, V. Gaia, P. Hasenberger, D. Lindsay, A. Papoutsi, C. Pelaz, M. Struelens, S. A. Uldum, P. Visca, and T. G. Harrison. 2000. Assessment of intercentre reproducibility and epidemiological concordance of Legionella pneumophila serogroup 1 genotyping by amplified fragment length polymorphism analysis. Eur. J. Clin. Microbiol. Infect. Dis. 19:773-780. [PubMed]
10. Gaia, V., N. K. Fry, B. Afshar, P. C. Lück, H. Meugnier, J. Etienne, R. Peduzzi, and T. G. Harrison. 2005. Consensus sequence-based scheme for epidemiological typing of clinical and environmental isolates of Legionella pneumophila. J. Clin. Microbiol. 43:2047-2052. [PMC free article] [PubMed]
11. Gaia, V., N. K. Fry, T. G. Harrison, and R. Peduzzi. 2003. Sequence-based typing of Legionella pneumophila serogroup 1 offers the potential for true portability in legionellosis outbreak investigation. J. Clin. Microbiol. 41:2932-2939. [PMC free article] [PubMed]
12. Harrison, T. G. 2005. Legionella, p. 1761-1785. In S. P. Borriello, P. R. Murray, and G. Funke (ed.), Topley & Wilson's Microbiology & Microbial Infections. Hodder Arnold, London, United Kingdom.
13. Helbig, J. H., S. Bernander, M. Castellani Pastoris, J. Etienne, V. Gaia, S. Lauwers, D. Lindsay, P. C. Lück, T. Marques, S. Mentula, M. F. Peeters, C. Pelaz, M. Struelens, S. A. Uldum, G. Wewalka, and T. G. Harrison. 2002. Pan-European study on culture-proven Legionnaires' disease: distribution of Legionella pneumophila serogroups and monoclonal subgroups. Eur. J. Clin. Microbiol. Infect. Dis. 21:710-716. [PubMed]
14. Hunter, P. R., and M. A. Gaston. 1988. Numerical index of the discriminatory ability of typing systems: an application of Simpson's index of diversity. J. Clin. Microbiol. 26:2465-2466. [PMC free article] [PubMed]
15. Le Flèche, P., M. Fabre, F. Denoeud, J. L. Koeck, and G. Vergnaud. 2002. High resolution, on-line identification of strains from the Mycobacterium tuberculosis complex based on tandem repeat typing. BMC Microbiol. 2:37. [PMC free article] [PubMed]
16. Le Flèche, P., Y. Hauck, L. Onteniente, A. Prieur, F. Denoeud, V. Ramisse, P. Sylvestre, G. Benson, F. Ramisse, and G. Vergnaud. 2001. A tandem repeats database for bacterial genomes: application to the genotyping of Yersinia pestis and Bacillus anthracis. BMC Microbiol. 1:2. [PMC free article] [PubMed]
17. Lindstedt, B. A. 2005. Multiple-locus variable number tandem repeats analysis for genetic fingerprinting of pathogenic bacteria. Electrophoresis 26:2567-2582. [PubMed]
18. Maiden, M. C., J. A. Bygraves, E. Feil, G. Morelli, J. E. Russell, R. Urwin, Q. Zhang, J. Zhou, K. Zurth, D. A. Caugant, I. M. Feavers, M. Achtman, and B. G. Spratt. 1998. Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. Proc. Natl. Acad. Sci. USA 95:3140-3145. [PMC free article] [PubMed]
19. Olsen, K. M. 1999. Minisatellite variation in a single-copy nuclear gene: phylogenetic assessment of repeat length homoplasy and mutational mechanism. Mol. Biol. Evol. 16:1406-1409. [PubMed]
20. Pourcel, C., F. André-Mazeaud, H. Neubauer, F. Ramisse, and G. Vergnaud. 2004. Tandem repeats analysis for the high resolution phylogenetic analysis of Yersinia pestis. BMC Microbiol. 4:22. [PMC free article] [PubMed]
21. Pourcel, C., Y. Vidgop, F. Ramisse, G. Vergnaud, and C. Tram. 2003. Characterization of a tandem repeat polymorphism in Legionella pneumophila and its use for genotyping. J. Clin. Microbiol. 41:1819-1826. [PMC free article] [PubMed]
22. Samrakandi, M. M., S. L. G. Cirillo, D. A. Ridenour, L. E. Bermudez, and J. D. Cirillo. 2002. Genetic and phenotypic differences between Legionella pneumophila strains. J. Clin. Microbiol. 40:1352-1362. [PMC free article] [PubMed]
23. Scaturro, M., M. Losardo, G. De Ponte, and M. L. Ricci. 2005. Comparison of three molecular methods used for subtyping of Legionella pneumophila strains isolated during an epidemic of legionellosis in Rome. J. Clin. Microbiol. 43:5348-5350. [PMC free article] [PubMed]
24. Simpson, E. H. 1949. Measurement of diversity. Nature 163:688.
25. Struelens, M. J., and members of the European Study Group on Epidemiological Markers (ESGEM) of the European Society for Clinical Microbiology and Infectious Diseases. 1996. Consensus guidelines for appropriate use and evaluation of microbial epidemiologic typing systems. Clin. Microbiol. Infect. 2:2-11. [PubMed]
26. Vergnaud, G., and C. Pourcel. 2006. Multiple locus VNTR (variable number of tandem repeat) analysis (MLVA), p. 83-104. In E. Stackebrandt (ed.), Molecular identification, systematics and population structure of prokaryotes. Springer-Verlag, Berlin, Germany.

Articles from Journal of Clinical Microbiology are provided here courtesy of American Society for Microbiology (ASM)
PubReader format: click here to try


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...