A Trilocus Sequence Typing Scheme for Hospital Epidemiology and Subspecies Differentiation of an Important Nosocomial Pathogen, Enterococcus faecalis▿ †
Associated Data
Abstract
In this study, we present a trilocus sequence typing (TLST) scheme based on intragenic regions of two antigenic genes, ace and salA (encoding a collagen/laminin adhesin and a cell wall-associated antigen, respectively), and a gene associated with antibiotic resistance, lsa (encoding a putative ABC transporter), for subspecies differentiation of Enterococcus faecalis. Each of the alleles was analyzed using 50 E. faecalis isolates representing 42 diverse multilocus sequence types (STM; based on seven housekeeping genes) and four groups of clonally linked (by pulsed-field gel electrophoresis [PFGE]) isolates. The allelic profiles and/or concatenated sequences of the three genes agreed with multilocus sequence typing (MLST) results for typing of 49 of the 50 isolates; in addition to the one exception, two isolates were found to have identical TLST types but were single-locus variants (differing by a single nucleotide) by MLST and were therefore also classified as clonally related by MLST. TLST was also comparable to PFGE for establishing short-term epidemiological relationships, typing all isolates classified as clonally related by PFGE with the same type. TLST was then applied to representative isolates (of each PFGE subtype and isolation year) of a collection of 48 hospital isolates and demonstrated the same relationships between isolates of an outbreak strain as those found by MLST and PFGE. In conclusion, the TLST scheme described here was shown to be successful for investigating short-term epidemiology in a hospital setting and may provide an alternative to MLST for discriminating isolates.
Enterococci are commensal members of the gastrointestinal tract flora of humans and animals. Within the last 2 decades, enterococci have emerged as the second to third most frequent cause of nosocomial infections, including endocarditis and bloodstream, urinary tract, and wound infections, among others (8, 15, 19, 24, 39). These organisms are also known to have the ability to acquire and transfer antibiotic resistance genes and virulence-associated genes (37). Although there are more than 30 species of the genus Enterococcus, two species, Enterococcus faecalis and Enterococcus faecium, account for a vast majority of enterococcal clinical and nosocomial infections (15, 21, 35). In the past, several molecular typing studies have shown that specific lineages of pathogenic bacteria arise periodically, proliferate, and spread in the presence of selective pressures (34). Therefore, accurate typing of enterococcal strains is crucial for the identification of particular clones capable of causing infections and with the ability to spread in the hospital environment.
A number of phenotypic and genotypic typing methods have been applied to the subspecies differentiation of enterococcal isolates. Phenotypic methods which have been used in the past include serotyping (17, 22, 26) and multilocus enzyme electrophoresis (50). Genotypic methods include, among others (3, 52, 53), ribotyping (14, 38), repetitive sequence-based PCR (25, 35), multilocus variable-number tandem-repeat analysis (49, 54), pulsed-field gel electrophoresis (PFGE) (10, 12, 49), and multilocus sequence typing (MLST) (10, 26, 31, 41). Among these methods, PFGE, based on chromosomal restriction endonuclease digestion patterns, is widely used for the study of hospital outbreaks and is considered by many to be the “gold standard” molecular typing technique (48). However, this methodology has several limitations due to the facts that it is labor-intensive and the results have poor interlaboratory transportability. This technique is also unsuitable for long-term epidemiology and population studies due to changes in restriction sites, genomic rearrangements, and/or acquisition of DNA by a clonal lineage, which may markedly change the restriction pattern (41). A more appropriate typing technique for long-term epidemiology, which is currently also widely used for subspecies differentiation, is MLST. MLST, based on the allelic variations in sequences of multiple loci, unambiguously types strains (23) and offers an advantage over other techniques used for typing, such as PFGE, since the data are objective and easily stored, compared, and shared via the Internet.
Two different MLST schemes have been used successfully for differentiation of E. faecalis strains (31, 41). The first scheme, which assessed three antigenic genes and one housekeeping gene, found that the allelic profile of two antigenic genes (ace and salA) was sufficient to discriminate the 22 E. faecalis isolates included therein (31). The second MLST scheme, based on the allelic profiles of seven housekeeping genes, was used to type 110 isolates and provided insight into the population structure as well as long-term epidemiological relationships of E. faecalis strains (41). However, typing studies on other organisms, such as Salmonella enterica serovar Typhimurium and Staphylococcus aureus, have suggested that MLST based on housekeeping genes may not provide enough discriminatory power to study hospital outbreaks or to accurately determine short-term genetic relationships, which can be crucial for hospital epidemiology and infection control purposes (9, 13, 27).
Our hypothesis for this work was that a sequence-based methodology applied to genes encoding antigenic or cell surface proteins (rather than housekeeping genes) may potentially be more useful to establish short-term epidemiologic relationships in E. faecalis, since these genes would be more susceptible to evolutionary selective pressures and potentially could identify and discriminate isolates from hospital outbreaks, similar to PFGE.
In the present work, the trilocus sequence types (STT; sequence type based on three genes) of 50 isolates were compared to their multilocus sequence types (STM; sequence type based on seven housekeeping genes). To determine the applicability of trilocus sequence typing (TLST) for a clinical setting, the scheme was also used to type sets of predetermined (by PFGE) clones and was then applied to a collection of hospital isolates from Bogota, Colombia, recently reported by Arias et al. to belong to an ST-2 clonal lineage (1).
(Part of this work was presented at the 47th Interscience Conference on Antimicrobial Agents and Chemotherapy, Chicago, IL, 2007.)
MATERIALS AND METHODS
Bacterial isolates.
Fifty E. faecalis isolates representing 42 diverse STM were chosen for the development and validation (http://www.mlst.net) of the TLST scheme (Table (Table11 [isolates organized by STM]; also see Table S1 in the supplemental material [isolates organized in alphanumeric order]). The isolation dates for these organisms range from 1974 (the laboratory strain JH2-2) to 2005 (a recent clinical strain, TX2853) (2, 11, 51). These isolates were recovered from geographically divergent regions, including Thailand, The Netherlands, Spain, Denmark, Lebanon, India, Poland, Argentina, and various cities in the United States. The isolation sites were also very diverse, with isolates from a variety of clinical sources (including patient blood, wounds, and urine) and animal sources (cows, chickens, pigs, and a seal), human community fecal isolates, and laboratory strains (Table (Table1).1). Isolates belonging to the previously defined Bla+ Vanr endocarditis (BVE) clonal complex and the Argentina-Connecticut Bla+ (ACB) and Houston Vanr 1 (HV1) clones are marked with asterisks in Table Table11 (33). The BVE, ACB, and HV1 lineages were subsequently designated STM-6, STM-9, and STM-2, respectively (41). For isolates TX0635, TX2621, TX0921, and TX4260, discrepancies among allele types (in salA for TX0635 and TX0921, ace for TX2621, and yqiL for TX4260) were found versus what was previously published (6, 31, 33, 41). Since all four isolates were clinical samples, it is possible that more than one subpopulation may have been present in the original sample; it is also possible that samples were inadvertently switched with others and/or that there was a previous error in sequencing. Thus, these cultures were restreaked, and the STM and STT reported in this study for these isolates represent results for a single colony designated with the number 2 next to the isolate name, to differentiate the subpopulation used in this study from that in previous studies. Ten isolates of known PFGE types (belonging to four pulsotype groups that differed within each group by fewer than six bands) were also included.
TABLE 1.
TLST of isolates with known MLST types
| Isolate [alternative name(s)]a | Source; origin; yr of isolation or collectionb | Reference(s) | STMc | STT (allelic profile for ace, salA, lsa)d |
|---|---|---|---|---|
| OGIRF (TX4002, A0212) | Laboratory strain; before 1978 | 2, 12, 25, 31, 41, 50 | 1 | 1 (1, 1, 1) |
| TX2621-2 (A0218)* | Nosocomial blood; Houston, TX; 1996 | 6, 31, 33, 41 | 2 | 2 (8, 2, 9) |
| TX2486 (A0219)* | Nosocomial blood; Houston, TX; 1994 | 6, 25, 31, 33 | 2 | 2 (8, 2, 9) |
| TX0855 (BE83, A0220) | Nosocomial urine; Bangkok, THA; 1980 | 12, 29, 31, 33, 41 | 4 | 4 (5, 1, 32) |
| TX2783 (B-343, A0221) | Chicken product; Logrono, ESP; 1998 | 31, 33, 40, 51 | 5 | 5 (4, 1, 25) |
| TX0052 (A0225)** | Nosocomial endocarditis; Springfield, MO; 1993 | 25, 33, 41 | 6 | 6 (3, 8, 13) |
| V583 (TX2708, A0224)** | Nosocomial blood; St. Louis, MO; 1989 | 33, 37, 41, 42 | 6 | 6 (3, 8, 13) |
| TX0614 (E228, A0222)** | Nosocomial urine; Richmond, VA; 1990 | 25, 31, 33, 43, 50 | 6 | 6 (3, 8, 13) |
| TX0921-2 (HH22, A0223)** | Nosocomial urine; Houston, TX; 1981 | 12, 25, 30, 31, 33, 41, 50 | 6 | 6 (3, 8, 13) |
| JH2-2 (TX4000, A0213) | Laboratory strain; before 1974 | 12, 25, 31, 33, 41, 50 | 8 | 8 (4, 7, 20) |
| TX4260-2 (A1008) | Nosocomial peritoneum; Warsaw, POL; 2005 | 20 | 8 | 8 (4, 7, 20) |
| TX0630 (HG6280, A0214)*** | Nosocomial blood; Buenos Aires, ARG; 1989 | 12, 25, 28, 30, 31, 33, 41, 50 | 9 | 9 (7, 3, 15) |
| TX0635-2 (WH245, A0215)*** | Nosocomial; West Haven, CT; 1986 | 12, 30, 31, 33, 36, 41, 50 | 9 | 9 (7, 3, 15) |
| TX0645 (A0216) | Nosocomial; Beirut, LBN; 1989 | 30, 31, 33, 41, 50 | 10 | 10 (6, 4, 5) |
| TX0860 (BE88, A0217) | Nosocomial catheter tip; Bangkok, THA; 1980 | 12, 29, 31, 33, 41, 50 | 11 | 11 (7, 6, 1) |
| TX2137 (E1798) | Nosocomial feces; Madrid, ESP; 2001 | 41 | 16 | 16 (6, 1, 24) |
| TX4245 (E1872) | Dog cerumen; NLD; 2003 | 41 | 16 | 16 (6, 1, 24) |
| TX4247 (E1876) | Pig joint; NLD; 1994 | 41 | 20 | 20 (17, 20, 3) |
| TX2148 (E1875) | Cow milk; NLD; 1996 | 41 | 21 | 21 (17, 1, 25) |
| TX4243 (E0252) | Calf feces; NLD; 1996 | 41, 44 | 23 | 23 (6, 3, 6) |
| TX2141 (E1825) | Nosocomial blood; Madrid, ESP; 2001 | 41 | 25 | 25 (6, 10, 30) |
| TX4244 (E1022) | Community volunteer feces; NLD; 1998 | 41 | 27 | 27 (21, 25, 19) |
| TX4246 (E1873) | Tick blood; NLD; 1996 | 41 | 29 | 29 (22, 26, 31) |
| TX2134 (E1052) | Community volunteer feces; NLD; 1998 | 41 | 30 | 30 (17, 1, 21) |
| TX2145 (E1843) | Nosocomial blood; Madrid, ESP; 2001 | 41 | 33 | 33 (6, 25, 23) |
| TX2136 (E1796) | Nosocomial feces; Madrid, ESP; year UNK | 41 | 34 | 34 (28, 27, 4) |
| TX2139 (E1802) | Nosocomial feces; Madrid, ESP; year UNK | 41 | 35 | 35 (15, 15, 17) |
| TX2147 (E1845) | Nosocomial blood; Madrid, ESP; 2001 | 41 | 36 | 36 (27, 2, 10) |
| TX2143 (E1839) | Nosocomial blood; Madrid, ESP; 2001 | 41 | 37 | 37 (13, 2, 12) |
| TX2140 (E1803) | Nosocomial feces; Madrid, ESP; year UNK | 41 | 38 | 38 (26, 19, 11) |
| TX2142 (E1837) | Nosocomial blood; Madrid, ESP; 2001 | 41 | 39 | 39 (17, 25, 26) |
| TX2144 (E1840) | Nosocomial blood; Madrid, ESP; 2001 | 41 | 40 | 40 (19, 25, 19) |
| TX4248 (1877) | Seal lymph node; NLD; 2002 | 41 | 40 | 40 (19, 25, 19) |
| TX2135 (E1795) | Nosocomial feces; Madrid, ESP; year UNK | 41 | 44 | 44 (28, 14, 18) |
| TX4242 (A0834, D3) | Pig; DNK; year UNK | 44 | 47 | 47 (11, 23, 7) |
| TX2138 (E1801) | Nosocomial feces; Madrid, ESP; year UNK | 41 | 48 | 48 (12, 17, 12) |
| TX2149 (E1959) | Nosocomial feces; Madrid, ESP; 2001 | 41 | 53 | 53 (14, 24, 8) |
| TX4249 (A0203) | Nosocomial wound swab; IND; year UNK | 41 | 54 | 54 (17, 16, 29) |
| TX4251 (A0206) | Animal feces; Logrono, ESP; 2001 | 41 | 58 | 58 (25, 25, 2) |
| TX2146 (E1844) | Nosocomial blood; Madrid, ESP; 2001 | 41 | 61 | 61 (23, 20, 33) |
| TX4254 (A0802) | Nosocomial blood; Warsaw, POL; 1998 | 20 | 82 | 82 (20, 13, 27) |
| TX4255 (A0808) | Nosocomial blood; Warsaw, POL; 2005 | 20 | 88 | 88 (3, 12, 14) |
| TX4238 (A0823, D28) | Pig; DNK; year UNK | 44 | 96 | 96 (24, 18, 22) |
| TX4239 (A0825, D30) | Pig; DNK; year UNK | 44 | 97 | 97 (10, 11, 29) |
| TX4240 (A0826, D31) | Pig; DNK; year UNK | 44 | 98 | 98 (16, 22, 28) |
| TX4241 (A0828, D33) | Pig; DNK; year UNK | 44 | 99 | 58 (25, 25, 2) |
| TX4257 (A1001) | Nosocomial blood; Gorzow Wielkopolski, POL; 1998 | 20 | 130 | 130 (7, 25, 3) |
| TX4259 (A1006) | Nosocomial blood; Wejherowo, POL; 2001 | 20 | 135 | 135 (6, 11, 29) |
| TX4262 (A1012) | Nosocomial blood; Olsztyn, POL; 2002 | 20 | 140 | 140 (8, 21, 16) |
| TX2853 | Nosocomial; Rochester, MN; 2005 | 51 | 158 | 158 (3, 3, 15) |
The PFGE pattern designations of the isolates are the same as in the original publications (Table (Table2).2). The TLST technique was also applied to 16 of 48 clinical isolates, representative of each of the PFGE subtypes and isolation years (sources include surgical wounds, peritoneal fluid, and patient urine), of vancomycin-resistant E. faecalis isolates recovered in Colombian hospitals from 2001 to 2006 (Table (Table3).3). We reported recently that these isolates belong to an ST-2, vanB-carrying strain of E. faecalis originally found in Houston, TX, in 1994 which has since disseminated in several hospitals in Colombia (1).
TABLE 2.
TLST of groups of clonally related isolates of known PFGE types
| Isolate | Reference(s) | PFGE typea | STT (allelic profile for ace, salA, lsa)b |
|---|---|---|---|
| TX2528 | 5, 6, 25 | V-1 | 12 (8, 2, 9) |
| TX2527 | 5, 6, 25 | V-1a | 12 (8, 2, 9) |
| TX2512 | 6, 25 | V-1 | 12 (8, 2, 9) |
| TX0770 | 25, 29 | C-1 | 6 (3, 8, 13) |
| TX0772 | 29 | C-1a | 6 (3, 8, 13) |
| TX0768 | 25, 29 | C-1 | 6 (3, 8, 13) |
| TX0046 | 25, 50 | E-1 | T512 (12, 33, 34) |
| TX0045 | 25, 50 | E-1 | T512 (12, 33, 34) |
| TX2450a | 25 | F-1 | 40 (19, 25, 19) |
| TX2451a | 25 | F-1a | 40 (19, 25, 19) |
TABLE 3.
Representative isolates from a hospital collection in Colombia typed by TLSTa
| Isolate | Yr of isolation | Hospital | Isolation source |
|---|---|---|---|
| ERV-25 | 2001 | A | Surgical wound |
| ERV-31 | 2001 | A | Peritoneal fluid |
| ERV-37 | 2002 | B | Unknown |
| ERV-41 | 2003 | C | Unknown |
| ERV-62 | 2004 | B | Abscess |
| ERV-63 | 2004 | B | Catheter tip |
| ERV-65 | 2004 | B | Wound secretion |
| ERV-68 | 2004 | B | Hip |
| ERV-72 | 2005 | B | Wound secretion |
| ERV-73 | 2005 | B | Peritoneal fluid |
| ERV-81 | 2005 | A | Urine |
| ERV-85 | 2005 | B | Urine |
| ERV-93 | 2005 | B | Urine |
| ERV-103 | 2006 | D | Secretion |
| ERV-116 | 2006 | D | Peritoneal fluid |
| ERV-129 | 2007 | B | Surgical wound secretion |
PCR, sequencing, analysis, and allele and ST assignment.
Genomic DNA was isolated from a single-colony culture after overnight growth in 5 ml of brain heart infusion broth (Becton Dickinson, Sparks, MD), using a commercial kit (DNeasy tissue kit; Qiagen, Valencia, CA). The intragenic sequences of the ace, salA, and lsa genes were amplified and sequenced using the primers listed in Table S2 in the supplemental material. Due to observed sequence heterogeneity in the forward primer region of salAF, salAF1 was used for isolates TX2148, TX2137, TX2142, TX2134, and TX4245 (see Table S2 in the supplemental material). PCR mixtures contained the following in a 25-μl volume: 0.5 μl of template DNA, 0.5 μl of forward primer (25 pmol/μl), 0.5 μl of reverse primer (25 pmol/μl), 2.5 μl of deoxynucleoside triphosphates, 5 μl of optimized buffer B (1× buffer is 60 mM Tris-HCl [pH 8.5], 15 mM ammonium sulfate, 2 mM MgCl2) obtained from Invitrogen (Carlsbad, CA), 0.1 μl of Invitrogen Taq DNA polymerase (5 units/μl), and 15.9 μl of distilled H2O. PCR was performed with an initial denaturation at 94°C for 2 min, followed by 32 cycles of 94°C for 15 s, 55°C for 15 s, and 72°C for 1 min and a final extension of 72°C for 7 min.
The PCR amplicons were purified using a QIAquick PCR purification kit (Qiagen, Valencia, CA). Sequencing of the amplicons was performed using an Applied Biosystems BigDye Terminator V3.1 cycle sequencing kit (Foster City, CA). Each allele was sequenced from two independent amplicons, using the primers listed in Table S2 in the supplemental material. The sequences were resolved using an ABI Prism 3730 and/or Prism 3130 genetic analyzer. Sequences were assembled using DNASTAR software's SeqMan program (Lasergene, Madison, WI). Sequence types were determined by allelic variations in the three loci (no weight was given to the degree of sequence divergence between alleles). The presence of one or more nucleotide changes was defined as indicating a distinct allele type, which was assigned a number arbitrarily. An STT was defined as the combination of the types for the three alleles. In all cases except one (TX4241), the STT number assigned was chosen to match the previously assigned/published STM sequence type. TX4241 had an identical STT to another isolate, TX4251, although the STM differed, and therefore was assigned the STT of isolate TX4251. To determine the overall divergence of the sequenced gene fragments of the three loci, these sequences were then spliced together to obtain a composite, concatenated sequence for each isolate. Sequence alignments were performed by the Jotun Hein method, using the MegAlign program of DNASTAR software (Madison, WI).
PFGE and MLST.
A previously described method for PFGE was used, with some modifications (12, 29). Agarose plugs containing genomic DNA were digested with SmaI (NEB, Ipswich, MA), and electrophoresis was carried out using a clamped homogenous electrical field (CHEF-DRIII device; Bio-Rad Laboratories, Richmond, CA), with ramped pulse times beginning at 2 s and ending with 50 s, at 200 V for 23 h. The PFGE patterns were interpreted using the criteria suggested by Tenover et al. (48). MLST based on seven housekeeping genes was performed using the criteria and primers for E. faecalis designated by Ruiz-Garbajosa et al. (41). Sequence types were assigned in accordance to the database available at http://efaecalis.mlst.net/.
Statistics.
Simpson's index of diversity was calculated as proposed by Hunter and Gaston (18). To measure the clustering concordance between MLST and TLST, Hubert and Arabie's adjusted Rand index (4, 16) and the Wallace coefficient (4, 55) were calculated using Ridom Epicompare software (http://www.ridom.de/download.shtml).
RESULTS AND DISCUSSION
Selection of third locus for development of a TLST scheme based on nonhousekeeping genes.
Although our previous pilot study indicated that sequence variation in ace and salA was sufficient to differentiate the isolates of the 13 ST studied, our subsequent comparisons with other typing techniques revealed minor discrepancies in their clonal relationships, possibly due to horizontal transfer of the ace gene (31). Therefore, we hypothesized that an additional gene might improve typing discrimination. To select an additional gene, we analyzed the allelic variation in a few possible candidate genes in the sequenced strains V583 (ST-6, clonal complex 2 [CC2]) (37), OG1RF (ST-1) (2), HH22 (ST-6, CC2), and TX0104 (endocarditis-derived strain; ST-2, CC2) (B. E. Murray et al., unpublished data). Based on sequence identities and differences in the sequenced strains, three genes encoding endocarditis and biofilm-associated pili (ebpA, ebpB, and ebpC) (32, 46) and one gene (lsa; lincosamide and streptogramin A resistance) encoding a putative ABC transporter that mediates the intrinsic resistance of E. faecalis to the pristinamycin antibiotic quinupristin-dalfopristin (47) were selected for further analysis. Since lsa offered a reasonable degree of sequence divergence in six additional strains studied, we selected this gene as the third locus. Hence, the modified scheme consisted of the intragenic regions of ace, salA, and lsa.
Establishment of TLST.
For the establishment of TLST, 50 isolates of 42 diverse STM collected in nine different countries and representing human and animal sources were chosen based on their seven-housekeeping-gene STM. For the majority of the isolates, the lengths of the loci analyzed for allele assignment for ace, salA, and lsa were 828 bp (from bp 197 to 1024 of the complete open reading frame), 863 bp or 866 bp (from bp 89 to 954 or 957, due to a 3-bp in-frame deletion [positions 940 to 942] present in 52% of the allele types), and 868 bp (from bp 270 to 1137), respectively. The exceptions were salA of TX2136 (due to a 15-bp in-frame insertion between positions 688 and 689) and lsa of TX4254 (due to a 1-bp insertion between positions 210 and 211). Note that the frameshift mutation in lsa of TX4254 resulted in susceptibility to quinupristin-dalfopristin (Etest MIC = 1 to 1.5 μg/ml); this finding is consistent with published reports on other E. faecalis strains (7, 45, 47). The total number of different allele types identified for 50 isolates was 27, 27, and 33 for ace, salA, and lsa, respectively. As shown in Tables S3 to S5 in the supplemental material, the number of variable sites in each locus ranged from 65 (salA) to 71 (lsa).
Comparison of TLST to MLST.
The discriminatory abilities of MLST and TLST were first compared by the numbers of unique and related STM and STT determined by each method. The 50 isolates used for the development of the TLST scheme, representing 42 distinct STM, were differentiated into 41 different sequence types by TLST (Table (Table1).1). For 36 isolates representing singletons (a single isolate representing one ST) by MLST, 35 STT were found. Thirty-four of the 36 were distinguished as singletons by TLST, and the remaining two MLST singletons (STM-58 and STM-99) were represented by a single STT (STT-58). Closer examination identified that STM-58 is a single-locus variant (SLV) of STM-99 differing in the yqiL locus by one nucleotide; thus, the grouping of these two isolates into one STT resulted because they were indeed closely related. The remaining 14 isolates represented small groups of 2 to 4 isolates belonging to six STM. All of the isolates in each of the six groups were differentiated identically by both TLST and MLST. In conclusion, 49 of the 50 isolates were typed similarly between MLST and TLST. The remaining isolate, TX4241 (which had an identical STT to that of TX4251), was only 1 bp different from TX4251 by MLST and thus was also closely related by MLST. Furthermore, there were no differences between Simpson's indexes of diversity for TLST (D = 99.0%; confidence interval, 0.981 to 1.0) and MLST (D = 99.1%; confidence interval, 0.981 to 1.0), suggesting that the two methods have very similar discriminatory abilities.
To compare the congruence between type assignments by MLST and TLST, the adjusted Rand (4, 16) and Wallace (4, 55) coefficients were also calculated. This showed that the probability that a pair of strains which were classified as the same type by MLST were also classified as the same type by TLST was 100% (based on the Wallace coefficient) and the probability of two strains having the same STT also sharing the same STM was 91.7% (as stated above, a single STT corresponded to two STM that were SLVs that differed by a single nucleotide). The adjusted Rand index (compares the clustering/grouping of isolates sharing similar characteristics according to a given method) for MLST and TLST in this study was 0.956, meaning that the concordance between the two methods was extremely good.
Next, since a previous study had suggested that virulence genes may evolve too rapidly to be useful for assessing the similarity of strains due to evolution from a common ancestor (41), we looked further at evolutionary relationships. It is important that accurate interpretations of clonal relationships between SLVs and double-locus variants (DLVs) by TLST cannot be assessed because of the limited number of loci in the scheme; therefore, we used the percent identities calculated from the concatenated sequences of all three genes (total sequence divergence as opposed to allelic differences) for this analysis. As shown in Fig. Fig.11 and Fig. S1 in the supplemental material, the evolutionary relationships (clustering and branching between isolates/ST) defined between two different ST were in agreement at times and differed substantially at other times between the two techniques. For example, STM-21/STT-21 and STM-5/STT-5 were separated by a single branch in the tree created via the unweighted-pair group method using average linkages (UPGMA) by both techniques (Fig. (Fig.1;1; see Fig. S1 in the supplemental material). STM-21 and STM-5 are SLVs by MLST (99.9% identical by concatenated sequence), and STT-21 and STT-5 are also highly identical by TLST (99.8%), differing by only one allele in both cases. On the other hand, STM-40 was several branches away from STM-27 in the MLST UPGMA tree (see Fig. S1 in the supplemental material) due to differences in five of the seven allele types (99.5% identical); however, STT-40 and STT-27 were separated by only a single branch (differing by one allele) in the TLST UPGMA tree (Fig. (Fig.1)1) and were 99.8% identical by the concatenated TLST genes. This may be indicative of homoplasy (similarity due to convergent evolution) of the genes in the TLST scheme, which implies that this scheme is more suitable for the study of hospital epidemiology than long-term epidemiology.
UPGMA tree generated from the TLST allelic profiles of 50 isolates. The phylogenetic tree was based on the matrix of pairwise differences in TLST allelic sequences, as determined by UPGMA, and was drawn using the sequence type analysis and recombinational tests, version 2 (START2), program (http://pubmlst.org/software/analysis/start2/).
Correlation between PFGE and TLST.
To compare the discriminatory abilities of TLST and PFGE, we studied 10 isolates belonging to four pulsotype groups (which differed by fewer than six bands in each group), as identified by published PFGE results (Table (Table2).2). Four different STT, corresponding to the different PFGE types, were identified, and the isolates within each PFGE group had identical STT (i.e., 100% identical concatenated sequences), thus suggesting that TLST's usefulness in the nosocomial setting is similar to that of PFGE (Table (Table2).2). In contrast, the sequence divergence between the four different PFGE/STT groups for the concatenated sequences ranged from 1.1 to 2.4% (27 to 60 bp different). Also, as previously noted, PFGE is often not suitable for determining long-term epidemiological relationships, which can be identified by sequence typing techniques. For example, isolates which were SLVs or DLVs by MLST were generally considered to be in the same clonal complex. However, as shown in Fig. Fig.2,2, isolates belonging to different STM but considered clonally linked by MLST had differences of more than six bands by PFGE, and consequently the clonal relationships were not widely discernible by this method. Thus, TLST appears to be a fitting alternative for use in hospital epidemiology.
Application of TLST to a hospital collection.
We next applied the TLST scheme to representative isolates of a recently published hospital collection of 48 vancomycin-resistant E. faecalis strains previously typed by PFGE and MLST (Table (Table3)3) (1). By PFGE, these isolates were all considered to be a related clone (seven PFGE subtypes with ≤4 band differences) which had disseminated in six hospitals in Bogota, Colombia. The TLST scheme was applied to a representative isolate of each PFGE subtype as well as to additional isolates to represent each isolation year (Table (Table3).3). By TLST, all isolates typed were STT-2, which matched the previously characterized isolate TX2486 (representing the index isolate for the HV1 clone), an outbreak VanB-type E. faecalis strain isolated in Houston, TX, in 1994. These results are in agreement with the published PFGE and MLST results, which suggested a link between the U.S. strain and this Colombian outbreak clone (1).
Conclusion.
The development and validation of TLST in comparison to the seven-gene MLST scheme found TLST to be as discriminatory as MLST, but utilizing less than half the number of alleles and thus reducing the cost; with recent advances in technology, the reduction could be to less than half the cost of standard MLST. Also, TLST successfully characterized all PFGE-defined clonally related isolates as having the same STT and those different by PFGE type as having different STT and therefore should prove effective for hospital epidemiology for the identification of pathogenic clones. Overall, the discriminatory abilities of TLST for all 76 isolates tested were similar to those of MLST and PFGE.
Acknowledgments
We thank K. V. Singh for his advice and for help with MIC determination. We thank L. S. Jensen (Denmark), T. Coque (Spain), R. del Campo (Spain), and E. Sadowy (Poland) for sending isolates.
This work was supported in part by an NIH grant (R37 AI 47923 to B.E.M.) from the Division of Microbiology and Infectious Diseases, National Institute of Allergy and Infectious Diseases (NIAID). C.A.A. is supported by a K99/R00 Pathway to Independence award (1K99-AI72961) from NIAID. R.J.L.W. is supported by the European Union Sixth Framework Program under contract LSHE-CT-2007-037410. This work was carried out in a facility supported by NIH (CTSA) grant UL1 RR024148.
Footnotes
▿Published ahead of print on 1 July 2009.
†Supplemental material for this article may be found at http://jcm.asm.org/.


