Vaccine. 2011 Jul 12; 29(31): 4923–4932.
PMCID: PMC3133685

Multistrain genome analysis identifies candidate vaccine antigens of Anaplasma marginale


Anaplasmosis in domestic livestock is an impediment to animal health and production worldwide, especially in developing countries in Africa, Asia, and South America. Vaccines have been developed and marketed against the causative organism, Anaplasma marginale; however, these have not been widely used because of breakthrough infections caused by heterologous strains and because of the risk of disease induced by live vaccine strains themselves. Recently, molecular studies have enabled progress to be made in understanding the causes for breakthrough infections and in defining new vaccine targets. A. marginale has a system for antigenic variation of the MSP2 and MSP3 outer membrane proteins which are members of the pfam01617 gene superfamily. In this study, we used high throughput genome sequencing to define conservation of different superfamily members in ten U.S. strains of A. marginale and also in the related live vaccine strain A. marginale subspecies centrale. The comparisons included the pseudogenes that contribute to antigenic variation and other superfamily-encoded outer membrane proteins. Additionally, we examined conservation of other proteins proposed previously as vaccine candidates. These data showed significantly increased numbers of SNPs in A. marginale subspecies centrale when compared to all U.S. A. marginale strains. We defined a catalog of 19 conserved candidate vaccine antigens that may be suitable for development of a multi-component recombinant vaccine. The methods described are rapid and may be suitable for other prokaryotes where repeats comprise a substantial portion of their genomes.

Abbreviations: omp, outer membrane protein; msp, major surface protein; SNP, single nucleotide polymorphism
Keywords: Anaplasma marginale, Genomics, Rickettsiae, Vaccine candidates, Bacteria

1. Introduction

Anaplasma marginale is a pathogen of cattle in the Order Rickettsiales, causing cyclic anemia and occasionally death. The organism causes severe economic losses in livestock production worldwide [1]. Various strategies have been implemented to develop a vaccine to mitigate the impact of this disease. The first attempt at a vaccine was in the early 1900s, with the isolation of A. marginale subspecies centrale, a less virulent strain that gives some cross-protection to fully virulent strains [2]. Other vaccine attempts have included a variety of subunit vaccines, none of which provided complete protection against heterologous challenge [3,4]. In addition, while infection with one strain of A. marginale sensu stricto typically precludes infection with another, multiple cases of superinfection have been described [5–7].

Vaccine failures are due to expression of variants of the major surface proteins MSP2 and MSP3. A. marginale creates a wide array of antigenic variants by substitution of whole or partial pseudogene cassettes into a single genomic expression site by segmental gene conversion [8–11], with increasing complexity of the expressed mosaic proteins [12]. Following persistent infection, the immune system has been exposed to a majority of the simple variants, which prevents another strain with similar variants from establishing concurrent infection. However, if the second strain has a unique pseudogene, novel variants generated by segmental gene conversion allow superinfection to take place [13].

In addition to MSP2 and MSP3, a variety of other variable surface antigens have been found in A. marginale; these have been called the msp2 superfamily [14]. Generally, these are all members of the pfam01617 (Surface Ag 2), which has related members in several other bacterial genera. Several of these have been found in cross-linked surface antigen complexes, and have been suggested as vaccine candidates [15]. A recent study by Agnes et al. used sera from cattle infected with A. marginale subspecies centrale to determine antigens that are cross-protective from sensu stricto challenge [16]. Several other studies have implicated components of the bacterial type 4 secretion system as vaccine candidates [17–19].

In this paper, we examine multiple strains of A. marginale sensu stricto, using high-throughput sequencing techniques to examine the members of the pfam01617 family and the other previously suggested vaccine components to determine their degree of conservation. Proteins that are widely conserved between all strains are candidates for inclusion in cross-protective vaccines. Further, the techniques described can be used to examine other organisms with significant numbers of repeats, allowing rapid determination of conserved proteins for diagnosis and vaccine development.

2. Materials and methods

2.1. DNA isolation and genome sequencing

A. marginale genomic DNA was isolated from highly infected bovine blood taken at the acute stage of infection. Organisms were purified from uninfected erythrocytes and white cells by passage through a cellulose column (C-6288, Sigma, St. Louis, MO) and frozen [20]. Genomic DNA was isolated from organisms using Qiagen genomic DNA kits according to manufacturer protocols. To compare genomes of Florida and Florida-relapse strains bovine #205 was infected with the Florida strain and experienced maximum acute stage parasitemia of 4% on day 37 post-infection and a minimum packed cell volume (pcv) of 18.5% which resolved to the carrier state, with pcv values returning to 35% and no microscopically detectable parasitemia. Bovine #205 was kept in isolation and splenectomized on day 104 post-infection to allow disease recrudescence. Infected blood from the Florida-relapse strain was obtained on day 129 post-infection at 22.5% parasitemia and 23% pcv. A. marginale strains analyzed in the present study were Puerto Rico, Mississippi, Virginia, Florida, Florida-relapse, Florida-Okeechobee, St. Maries-Idaho, South Idaho, Oklahoma and Washington-O. Isolated DNA was provided to the Interdisciplinary Center for Biotechnology Research (ICBR) core facilities, University of Florida for library construction and sequencing on the Roche/454 Genome Sequencer according to standard manufacturer protocols. The SFF format flow files were returned by ICBR for bioinformatics analysis.

2.2. Bioinformatics

MosaikAligner was used to align individual reads with the reference genome sequences [21]. The SFF flow files were first combined and converted to .fasta and .qual files using Roche/454 Genome Sequencer FLX System software, version 2.3. MosaikBuild ( was used to convert reads and the reference sequences to the Mosaik binary format (.dat files). The alignment parameters were: hash size (−hs), 11; maximum percentage of the read length allowed to be errors (−mmp), 0.05; alignment candidate threshold (−act), 20; alignment mode (−m), all. The reference genomes were A. marginale St. Maries, Idaho strain, GenBank CP000030; A. marginale Florida strain, CP001079 and A. marginale subspecies centrale Israel strain, CP001759. MosaikText was used to convert the aligned binary data file to the text-based BAM format (−bam) and samtools [22] to sort and index the BAM file for viewing in Artemis [23,24]. Artemis allows viewing of the alignment of individual reads either zoomed in to detect gaps in alignment with respect to the annotated reference sequence or zoomed out to show SNPs over large genome regions. For these analyses, two corrections were made to the GenBank annotations:

To define the sensitivity for detecting variant genes by Mosaik alignments, we extracted all variable regions for msp2 and msp3 pseudogenes from the three fully sequenced genomes and compared their sequence identities. This was done in an all-against-all analysis of the 22 total msp2 pseudogenes and 22 total msp3 pseudogenes in the three sequenced genomes using a MATGAT matrix [25]. From this analysis we determined that the closest matches for variable regions of msp2 pseudogenes in heterologous genomes ranged from 100 to 73% identity and was 100 to 52% identity for msp3 pseudogenes (see Table 1). We defined the variable regions as the sequence encoding LGKELAY to MAGNIN for msp2 pseudogenes and that encoding LETEEL to KNRG for msp3 pseudogenes. These sequences vary slightly between pseudogenes, for example is more typically LQAEEI to KNRG for msp3 pseudogenes from A. marginale subspecies centrale, but the locations can readily be identified by alignment. Comparing pyrosequencing data to all the known msp2 and msp3 genes showed that all msp2 pseudogenes with the best match in the heterologous strain below 92% variable region identity were detected as absent (−) and all msp3 pseudogenes with below 97% variable region identity were detected as absent (−) (Table 1). Since the Mosaik alignment parameter −mmp allows a 5% error in aligning reads, we conservatively estimate that variant genes are detected as absent if they have <90% identity, but may not be detected as absent if they have >90% identity. In this study we examined the presence or absence of the pfam01617 superfamily including genes encoding OMPs 1 through 15, OPAG1-3 and MSP4 [14,26]; proteins identified by surface cross-linking including their encoding genes AM366, 712, 779, 780, 854, 1011, 1051 [15]; and type 4 secretion system genes AM030, 097, 810, 811, 812, 813, 814, 815, 1053, 1312, 1313, 1314, 1315, 1316 [19]. Numbering refers to annotations of the St. Maries, Idaho strain, CP000030. To be defined as conserved in A. marginale in Table 4 no segment of the genes was detected as absent in any comparisons of pyrosequenced data from each of 10 U.S. strains of A. marginale with the fully sequenced genomes of Florida and St. Maries, Idaho strains. Pyrosequencing data was previously obtained for A. marginale strains Puerto Rico, Mississippi and Virginia and in the present study for A. marginale strains Florida, Florida-relapse, Florida-Okeechobee, St. Maries-Idaho, South Idaho, Oklahoma and Washington-O. The average genome coverages were 40×, 12×, 63×, 59×, 76×, 47×, 117×, 37×, 96×, and 108× for the ten strains, respectively, when compared to the completed genome from the Florida strain. Since we did not have current access to the Mississippi strain and coverage was lower for this strain, we also verified that no gene was determined as not conserved solely because of absence in this one strain.

Table 1
Comparison of msp2 and msp3 genes between pyrosequenced and Sanger-sequenced Anaplasma genomes. Genes detected as present (+) or absent (−) by pyrosequencing are shown together with their % sequence identity (of the best match, by MATGAT), in ...
Table 4
Mosaik/Artemis alignment analysis identifies conserved genes encoding candidate vaccine antigens.

The number of high confidence differences between strains (Table 3) was analyzed using Roche/454 gsMapper software to generate the 454HCDiffs.txt file. The base differences and their locations were extracted with the unix grep command and imported into Excel 2008 (Microsoft, Redmond, WA). The number of differences and their respective frequencies (the percentage of different reads versus total reads that fully span the difference location) were tabulated.

Table 3
Numbers of high confidence differences between Anaplasma strain genomes (gsMapper software). Total differences as well as non-polymorphic differences (found in all reads covering the respective regions, 100% frequency) are shown. Excluding homologous ...

Finally, for coverage and SNP analyses in Fig. 4 and Table 5, the BAM files generated by Mosaik were processed by samtools version 0.12 to generate pileups. Pileups for genes of interest were extracted to determine coverage for each nucleotide position comparing to both the Florida and St. Maries strains. Final coverages for each gene of interest were graphed using Excel 2008. For SNP analysis, raw SFF files were processed by Genomics Workbench (CLC Bio, Aarhus, Denmark), and the output of the SNP identification pipeline was placed into a MySQL database. To increase the stringency of SNP identification, the database was queried for SNPs identified by samtools, and only SNPs identified by both methods are included in the final analysis.

Fig. 4
Genome coverage of pyrosequencing reads over omp15 (A – Florida, B – St. Maries) and omp4 (C – Florida, D – St. Maries) genes from Florida and St. Maries, Idaho strains of A. marginale. Genome position is shown on the ...
Table 5
Total and non-synonymous SNPs in the pfam01617 superfamily and genes encoding candidate vaccine antigens. SNPs – the number of nucleotide positions in each gene with high-confidence changes in at least one other strain (excluding gene segments ...

3. Results

3.1. Comparison of pyrosequencing with Sanger sequencing

Two complete genome sequences of A. marginale strains from the United States (Florida and St. Maries, Idaho) and one of A. marginale subspecies centrale (Israel) are available [14,26,27]. We analyzed high-throughput sequencing data from the Roche/454 instrument on 10 U.S. A. marginale strains, including the previously genome-sequenced Florida and St. Maries strains as controls. Including Florida and St. Maries strains enables a comparison to be made between the new pyrosequencing data and data obtained using Sanger sequencing. We included in this comparison a second Florida strain (Okeechobee) and a second Idaho strain (South Idaho). We also included a Florida relapse strain derived from a persistently infected animal after 129 days of infection, to examine genome changes over a short time period.

The initial analyses compared the original genome sequences with the new pyrosequencing data. This was done by aligning individual pyrosequenced reads with the completed genomes using Mosaik, with visualization of the finished alignments using Artemis. To deal with the known problem of multiple repeats in these genomes, the alignment parameters were set to allow reads to align at multiple different positions in the genome, if this was necessary. A typical result showing alignments with msp2 and msp3 genes is shown in Fig. 1. The top panel shows alignment of Florida strain pyrosequencing data with a region of the Florida genome containing an msp2/msp3 gene pair (AMF_871/872). The reads align over the complete msp2 and msp3 regions, as expected. In the middle panel, a comparison is made between the same Florida strain pyrosequencing data but with a region of the St. Maries, Idaho strain genome encompassing the msp2/msp3 gene pair AM1344/1345. In this case, the previously obtained genome data shows that AM1344 has an exact match (100% identity) with an msp2 copy in the Florida strain genome, but the closest match of the St. Maries msp3 copy AM1345 is to an msp3 copy in the Florida strain with only 78% identity (Table 1). This is revealed by a gap in the aligning sequence reads over the central (hypervariable) region of AM1345, but no gap over AM1344. The lowest panel shows an extreme case where neither the msp2 (AMF_1018) nor the msp3 (AMF_1019) pseudogene from the Florida strain aligns with reads from St. Maries. Comparison of the two genome sequences reveals closest matches between the two genomes of 91% for AMF_1018 and 55% for AMF_1019. This analysis was conducted for all msp2 and msp3 copies in the three genomes, A. marginale (Florida strain), A. marginale (St. Maries, Idaho strain) and A. marginale subspecies centrale (Israel strain). The data revealed that all msp2 and msp3 differences with <90% identity were accurately detected (Table 1).

Fig. 1
Diversity in msp2/msp3 gene pairs between A. marginale strains. Top panel, pyrosequences of Florida strain compared with the msp2/msp3 gene pair AMF_871/872 of the Florida strain genome in Artemis. Both genes are defined as present. Middle panel, pyrosequences ...

3.2. Genome diversity of Anaplasma sp.

We then compared the msp2 and msp3 pseudogenes in all 10 U.S. strains of A. marginale and A. marginale subspecies centrale, by the same method (Table 2). The results showed that no msp2 or msp3 pseudogene from any of these strains of A. marginale from the United States was shared with A. marginale subspecies centrale. Indeed, there was substantial variation in the repertoire of the msp2 and msp3 pseudogenes even within U.S. A. marginale strains, with no msp2 or msp3 copy shared between Oklahoma and St. Maries, Idaho strains and only one of each shared between Oklahoma and Florida strains. Interestingly, there was substantial variation even between strains from the same state, with no msp3 pseudogene shared between the two strains from Idaho and only two msp3 pseudogenes shared between the two strains from Florida (Okeechobee and Florida). In contrast, there was no variation detected between Florida and Florida relapse strains, suggesting that the differences observed reflected evolutionary changes rather than, for example, continuous variation by gene conversion among pseudogenes. It is known from previous analyses that msp2 and msp3 expression site sequences are different in Florida and Florida-relapse strains [10,11]. The most conserved msp2 or msp3 pseudogene was AM1250, absent in only 2/10 strains examined (WA-O and OK).

Table 2
Shared msp2 and msp3 pseudogenes between U.S. A. marginale strains and A. marginale subspecies centrale (Israel strain). No pseudogenes are shared between any of ten U.S. marginale strains and centrale. The repertoire of both msp2 and msp3 pseudogenes ...

We examined whether the diversity observed in msp2 and msp3 genes was also reflected in differences in SNP profiles across the genome. High confidence differences between the genomes obtained using Roche/454 gsMapper software are shown in Table 3. Again, few differences were detected between the previous Sanger and current Roche/454 data. Only 38 differences (at 100% frequency) were detected in the Florida strain genome and 84 in the St. Maries, Idaho genome by the two sequencing strategies. Similarly, there were few differences in the Florida relapse strain compared to Florida. Therefore, pyrosequencing data correlated well with the previously reported sequences from traditional Sanger sequencing. Comparison of pyrosequencing of the Florida strain with the previously reported sequence (CP001079) shows high confidence differences, possibly due to true SNPs or error, of one base per 31,643 nucleotides (at 100% frequency), while comparison of pyrosequencing of the St. Maries strain with the previously reported genome sequence (CP000030) yields a difference of one base per 14,258 nucleotides (at 100% frequency). As seen in previous strain comparisons [27], the number of single nucleotide polymorphisms (SNPs) between U.S. strains of A. marginale is variable, from 0.20% to 0.58% of the genome. However, all strains of A. marginale sensu stricto have significantly increased numbers of SNPs when compared to the A. marginale subsp. centrale strain, ranging from 1.8% to 2.2%.

To visualize the location of differences at the entire genome level, we utilized the “show SNP marks” feature of Artemis for visualizing BAM alignments (Fig. 2). The figure shows the 1/3 of each genome immediately preceding the origin of replication, with SNPs in red. The data show that SNPs are distributed across the genome and agree with Table 3. For example, pyrosequencing data for Florida and Florida-relapse strains closely resemble the genome data derived by Sanger-based sequencing. Furthermore, comparison of Fig. 2 with Tables 2 and 3 clearly reveals the more closely related strains to Florida, i.e. Florida-relapse and Virginia and the more distantly related strains Oklahoma, Washington-O and South Idaho. These relationships are also seen in both SNP numbers and in shared msp2 and msp3 pseudogenes. A similar SNP comparison of U.S. strains of A. marginale with A. marginale subspecies centrale (Fig. 3) shows widely distributed SNPs and many gaps between marginale and centrale where there are no aligning reads. The locations of these gaps were largely identical for all the U.S. A. marginale strains, indicating a more distant sequence relationship between all these strains and the A. marginale subspecies centrale strain.

Fig. 2
Mosaik alignment of pyrosequencing reads with the fully Sanger-sequenced Florida strain genome to show SNPs. The approximately one third of the genome preceding the origin of replication in the Florida strain is shown on the x-axis and the individual ...
Fig. 3
A similar comparison to Fig. 2, but of pyrosequencing reads from multiple A. marginale strains with the fully Sanger-sequenced A. marginale subspecies centrale strain. In addition to SNPs throughout the genomes, there are many gaps in all A. marginale ...

3.3. Conservation of genes encoding proposed vaccine antigens

We next examined the conservation of proposed vaccine antigens from the pfam01617 family, or that have been identified by other strategies. These other strategies involved cross-linking of surface proteins on live organisms by bifunctional reagents, analysis of T-cell responses of immunized and protected animals and identification of components of the type 4 secretion system recognized by T cells [14,15,17–19,26]. The data identified several proteins in each class that were conserved among all 10 U.S. strains of A. marginale (Table 4). Interestingly, none were conserved with A. marginale subspecies centrale. This suggests that relying only on antigens shared between marginale and centrale may not be an optimal strategy for development of vaccines against U.S. strains of A. marginale. Additionally, comparison of the newly sequenced strains with the previously sequenced strains showed multiple genes that are variable in one or more strains; however, no candidate antigen gene was defined as absent in all the newly sequenced strains. Some genes, such as omps2, 7, 8 and 15 were more frequently detected as absent, whereas others, such as omps10 and 14, were detected as absent in only three comparisons between different A. marginale strains. An example of detailed coverage graphs for omp4 (conserved in all strains) and omp15 (variable) genes is shown in Fig. 4. Although omp15 coverage graphs suggest variability of this gene in most strains, the variability is localized to the C-terminus when all strains are compared to Florida and to the central region of omp15 when compared to St. Maries. It should also be recognized that despite their overall conservation and definition as present, non-synonymous SNPs are present in most of the candidate antigen genes (Table 5). There appears to be no trend towards increased numbers of SNPs or decreased conservation when comparing omps that are transcribed in either ticks or cattle [33].

4. Discussion

Development of vaccines against anaplasmosis has received considerable attention over the last 50 years and has resulted in several marketed live and inactivated whole-organism vaccines [28]. None are currently available in the U.S. because of varying efficacy against heterologous strains and/or side-effects such as isoerythrolysis due to contaminating erythrocyte proteins in the vaccines. This has stimulated the search for improved vaccines and also attempts to understand the reasons for the breaks in vaccine protection against heterologous strains [29–31].

The reason for breaks in protection appear to be due to a sophisticated system for antigenic variation, whereby the expressed MSP2 and MSP3 outer membrane proteins continually change in sequence [32]. This is caused by segmental gene conversion of genomic expression sites for MSP2 and MSP3 by genomic pseudogenes [10]. The repertoire of pseudogenes determines the ability of an incoming strain to superinfect a persistently infected carrier animal [13]. We show here that the pseudogene repertoire is extremely diverse for both MSP2 and MSP3 across the U.S., even within A. marginale strains from the same state. No msp2 or msp3 pseudogene was present in all U.S. strains. Therefore, it is unlikely that a vaccine could be developed by trying to include a full repertoire of potential MSP2/MSP3 variants in a vaccine. However, other members of pfam01617 (to which both msp2 and msp3 belong) encode conserved OMPs and are expressed in A. marginale [33] and, therefore, still remain viable vaccine candidates.

Two other vaccine strategies have also been proposed recently. The first [16] relies on the protection afforded by the less virulent strain A. marginale subspecies centrale. This strain has been extensively used in the field in Australia, South Africa, Argentina, Uruguay, Israel, Zimbabwe and Malawi. Recent research has found proteins with immunogenic epitopes shared between marginale and centrale, although the overall protein sequence identities were less than 90% [16], and these have been proposed for inclusion in a subunit vaccine. Although A. marginale subsp. centrale undoubtedly provides some protection against A. marginale strains [35], controlled trials have shown low efficacy of this vaccine against heterologous isolates from South America and Africa [36–39], and infection by A. marginale subspecies centrale does not prevent subsequent superinfection by A. marginale [40]. These data have stimulated the search for less virulent strains of A. marginale to potentially replace the A. marginale subspecies centrale vaccine, and such strains have been identified in Australia and Mexico [41,42].

The second, recently proposed, vaccine strategy relies on the observation that immunization with inactivated organisms or outer membranes can induce sterile protective immunity against challenge with A. marginale [3,43]. Two investigations are particularly noteworthy in this regard: firstly, the identification of the surface proteome of A. marginale [15,17] and secondly, the identification of type 4 secretion system components recognized by T and B cells from protected cattle [19]. However, while sterile immunity against homologous challenge has been achieved, these provide only partial immunity against heterologous challenge. This may be due to the immunodominant responses induced against the hypervariable MSP2 and MSP3 proteins. Compared to these, other antigens, such as the T4SS proteins and other surface proteome molecules, are considered subdominant antigens. These induce weaker and more inconsistent antibody and T cell responses, at least in the context of complex immunogens such as whole organism and membrane vaccines that also contain MSP2 and MSP3 [19]. However, while these responses may be less robust, these antigens appear to be less variable, making them important to include in a vaccine producing pan-strain immunity.

The body of previous research in A. marginale has resulted in a large catalog of potential vaccine candidates. We attempted here to reduce the number of candidate antigens by applying high throughput genome sequencing and bioinformatics analysis to 10 U.S. strains of A. marginale. The intent was to identify the most conserved proteins from all of the above vaccine strategies that may form the core components of a broadly protective vaccine.

We initially verified that pyrosequencing was capable of accurately determining the relationships among already fully sequenced strains and the variable msp2 and msp3 pseudogenes in those strains. We correctly identified the shared msp2 and msp3 pseudogenes and those having <90% identity. This method was then applied to all 10 U.S. strains of A. marginale. Extensive diversity was observed in the repertoire of both msp2 and msp3 pseudogenes among strains, with generally more diversity observed in the complement of msp3 pseudogenes when compared to msp2.

There was also extensive diversity in SNPs among strains, distributed over most of the genome, agreeing with previous observations on a smaller subset of strains [27]. However, the members of the pfam01617 family are relatively well conserved overall, with no protein having <90% identity between all the strains examined. All of these proteins have SNPs, and SNPs within strains have a similar distribution pattern to those described for the rest of the genome in terms of the numbers of strains with polymorphisms.

A surprising observation was the more extensive diversity in A. marginale subspecies centrale when compared to all 10 U.S. A. marginale strains. The taxonomic position of centrale compared to marginale has been debated previously, with some investigators proposing a separate species, Anaplasma centrale [44–46]. However, only a few strains of A. marginale subspecies centrale are available for analysis. We suggest that resolution of this question should await genomic data on non-U.S. strains of both marginale and centrale, particularly strains from Africa. This would resolve whether there is a continuum of strain diversity among marginale strains eventually reaching that of the single currently sequenced centrale strain, originally isolated by Theiler in South Africa. A recent study [47] comparing membrane proteins from a Brazilian strain of A. marginale with Florida and St. Maries determined amino acid sequence identities of 92–100% for all OMPs investigated except OMP7, compared to 40–70% identities with the A. marginale subspecies centrale orthologs. This suggests that the diversity observed here among U.S. strains of A. marginale may at least be representative of marginale strains in North and South America.

Finally, the data reveal the candidate vaccine antigens conserved among U.S. strains of A. marginale. The catalog includes conserved members of pfam01617, as well as components of the bacterial type 4 secretion system and proteins identified by surface cross-linking. Interestingly, it does include three proteins identified previously that contain epitopes shared with A. marginale subspecies centrale, namely OMP11 (AM1255), AM779 and AM854 [16]. However, overall the list is broader than just the antigens conserved between A. marginale sensu stricto and subspecies centrale. It also eliminates less conserved proteins and housekeeping genes which share epitopes between centrale and marginale. Additionally, although conserved, OMP6 and OPAG1 can probably be eliminated from consideration as vaccine candidates as no expressed peptides were detected from the encoding genes in any life cycle stages in prior studies [33,34]. This revised catalog of 19 antigens (see Table 4) would be readily approachable for synthesis by recombinant expression technology and inclusion in a multi-component vaccine for testing. The present genomic data and previous experimental data suggest that such a vaccine may be efficacious against U.S. strains of A. marginale.

These data also illustrate the utility of next-generation sequencing techniques for identification of antigens and epitopes conserved between multiple strains. While rapid sequencing has been used extensively, this study shows its utility in examination of repetitive genes. While these techniques cannot yet assemble a genome through extensive repetitive regions, they can show regions where there is genetic similarity or where homologous regions are missing in newly sequenced strains.


We thank Drs. Guy Palmer and Katherine Kocan for making available strains of A. marginale and Dr. Savita Shanker for supervision of library construction and pyrosequencing. We acknowledge the excellent technical assistance of Anna Lundgren and funding support from the College of Veterinary Medicine and Emerging Pathogens Institute, University of Florida and Wellcome Trust grant no. GR075800M.


1. Aubry P., Geale D.W. A review of bovine anaplasmosis. Transbound Emerg Dis. 2011;58:1–30. [PubMed]
2. Theiler A. Further investigation into anaplasmosis of South African cattle. South Africa; 1911. First report of the Director of Veterinary Research. Department of Agriculture of the Union of South Africa, p. 7–14.
3. Palmer G.H., Munodzana D., Tebele N., Ushe T., McElwain T.F. Heterologous strain challenge of cattle immunized with Anaplasma marginale outer membranes. Vet Immunol Immunopathol. 1994;42:265–273. [PubMed]
4. Palmer G.H., McElwain T.F. Molecular basis for vaccine development against anaplasmosis and babesiosis. Vet Parasitol. 1995;57:233–253. [PubMed]
5. Leverich C.K., Palmer G.H., Knowles D.P., Jr., Brayton K.A. Tick-borne transmission of two genetically distinct Anaplasma marginale strains following superinfection of the mammalian reservoir host. Infect Immun. 2008;76:4066–4070. [PMC free article] [PubMed]
6. Molad T., Fleidrovich L., Mazuz M., Fish L., Leibovitz B., Krigel Y. Genetic diversity of major surface protein 1a of Anaplasma marginale in beef cattle. Vet Microbiol. 2009;136:54–60. [PubMed]
7. Rodriguez J.L., Palmer G.H., Knowles D.P., Jr., Brayton K.A. Distinctly different msp2 pseudogene repertoires in Anaplasma marginale strains that are capable of superinfection. Gene. 2005;361:127–132. [PubMed]
8. Barbet A.F., Lundgren A., Yi J., Rurangirwa F.R., Palmer G.H. Antigenic variation of Anaplasma marginale by expression of MSP2 mosaics. Infect Immun. 2000;68:6133–6138. [PMC free article] [PubMed]
9. Barbet A.F., Yi J., Lundgren A., McEwen B.R., Blouin E.F., Kocan K.M. Antigenic variation of Anaplasma marginale: major surface protein 2 diversity during cyclic transmission between ticks and cattle. Infect Immun. 2001;69:3057–3066. [PMC free article] [PubMed]
10. Brayton K.A., Palmer G.H., Lundgren A., Yi J., Barbet A.F. Antigenic variation of Anaplasma marginale msp2 occurs by combinatorial gene conversion. Mol Microbiol. 2002;43:1151–1159. [PubMed]
11. Meeus P.F., Brayton K.A., Palmer G.H., Barbet A.F. Conservation of a gene conversion mechanism in two distantly related paralogues of Anaplasma marginale. Mol Microbiol. 2003;47:633–643. [PubMed]
12. Futse J.E., Brayton K.A., Knowles D.P., Jr., Palmer G.H. Structural basis for segmental gene conversion in generation of Anaplasma marginale outer membrane protein variants. Mol Microbiol. 2005;57:212–221. [PubMed]
13. Futse J.E., Brayton K.A., Dark M.J., Knowles D.P., Jr., Palmer G.H. Superinfection as a driver of genomic diversification in antigenically variant pathogens. Proc Natl Acad Sci USA. 2008;105:2123–2127. [PMC free article] [PubMed]
14. Brayton K.A., Kappmeyer L.S., Herndon D.R., Dark M.J., Tibbals D.L., Palmer G.H. Complete genome sequencing of Anaplasma marginale reveals that the surface is skewed to two superfamilies of outer membrane proteins. Proc Natl Acad Sci USA. 2005;102:844–849. [PMC free article] [PubMed]
15. Noh S.M., Brayton K.A., Brown W.C., Norimine J., Munske G.R., Davitt C.M. Composition of the surface proteome of Anaplasma marginale and its role in protective immunity induced by outer membrane immunization. Infect Immun. 2008;76:2219–2226. [PMC free article] [PubMed]
16. Agnes J.T., Brayton K.A., Lafollett M., Norimine J., Brown W.C., Palmer G.H. Identification of Anaplasma marginale outer membrane protein antigens conserved between A. marginale sensu stricto strains and the live A. marginale subsp. centrale Vaccine. Infect Immun. 2011;79:1311–1318. [PMC free article] [PubMed]
17. Lopez J.E., Siems W.F., Palmer G.H., Brayton K.A., McGuire T.C., Norimine J. Identification of novel antigenic proteins in a complex Anaplasma marginale outer membrane immunogen by mass spectrometry and genomic mapping. Infect Immun. 2005;73:8109–8118. [PMC free article] [PubMed]
18. Lopez J.E., Palmer G.H., Brayton K.A., Dark M.J., Leach S.E., Brown W.C. Immunogenicity of Anaplasma marginale type IV secretion system proteins in a protective outer membrane vaccine. Infect Immun. 2007;75:2333–2342. [PMC free article] [PubMed]
19. Sutten E.L., Norimine J., Beare P.A., Heinzen R.A., Lopez J.E., Morse K. Anaplasma marginale type IV secretion system proteins VirB2, VirB7 VirB11, and VirD4 are immunogenic components of a protective bacterial membrane vaccine. Infect Immun. 2010;78:1314–1325. [PMC free article] [PubMed]
20. Ambrosio R.E., Potgieter F.T., Nel N. A column purification procedure for the removal of leucocytes from parasite-infected bovine blood. Onderstepoort J vet Res. 1986;53:179–180. [PubMed]
21. Smith D.R., Quinlan A.R., Peckham H.E., Makowsky K., Tao W., Woolf B. Rapid whole-genome mutational profiling using next-generation sequencing technologies. Genome Res. 2008;18:1638–1642. [PMC free article] [PubMed]
22. Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. [PMC free article] [PubMed]
23. Carver T., Berriman M., Tivey A., Patel C., Böhme U., Barrell B.G. Artemis and ACT: viewing, annotating and comparing sequences stored in a relational database. Bioinformatics. 2008;24:2672–2676. [PMC free article] [PubMed]
24. Rutherford K., Parkhill J., Crook J., Horsnell T., Rice P., Rajandream M.A. Artemis: sequence visualization and annotation. Bioinformatics. 2000;16:944–945. [PubMed]
25. Campanella J.J., Bitincka L., Smalley J. MatGAT: an application that generates similarity/identity matrices using protein or DNA sequences. BMC Bioinformatics. 2003;4:29–32. [PMC free article] [PubMed]
26. Herndon D.R., Palmer G.H., Shkap V., Knowles D.P., Jr., Brayton K.A. Complete genome sequence of Anaplasma marginale subsp. centrale. J Bacteriol. 2010;192:379–380. [PMC free article] [PubMed]
27. Dark M.J., Herndon D.R., Kappmeyer L.S., Gonzales M.P., Nordeen E., Palmer G.H. Conservation in the face of diversity: multistrain analysis of an intracellular bacterium. BMC Genomics. 2009;10:16. [PMC free article] [PubMed]
28. Kocan K.M., Blouin E.F., Barbet A.F. Anaplasmosis control past, present, and future. Ann N Y Acad Sci. 2000;916:501–509. [PubMed]
29. Musoke A.J., Palmer G.H., McElwain T.F., Nene V., McKeever D. Prospects for subunit vaccines against tick-borne diseases. Br Vet J. 1996;152:621–639. [PubMed]
30. Musoke A.J., McKeever D., Nene V. Subunit vaccines for the control of tick-borne diseases: implications for the future. Parassitologia. 1997;39:131–137. [PubMed]
31. Palmer G.H., Brown W.C., Rurangirwa F.R. Antigenic variation in the persistence and transmission of the ehrlichia Anaplasma marginale. Microbes Infect. 2000;2:167–176. [PubMed]
32. Palmer G.H., Bankhead T., Lukehart S.A. ‘Nothing is permanent but change’—antigenic variation in persistent bacterial pathogens. Cell Microbiol. 2009;11:1697–1705. [PMC free article] [PubMed]
33. Noh S.M., Brayton K.A., Knowles D.P., Agnes J.T., Dark M.J., Brown W.C. Differential expression and sequence conservation of the Anaplasma marginale msp2 gene superfamily outer membrane proteins. Infect Immun. 2006;74:3471–3479. [PMC free article] [PubMed]
34. Löhr C.V., Brayton K.A., Shkap V., Molad T., Barbet A.F., Brown W.C. Expression of Anaplasma marginale major surface protein 2 operon-associated proteins during mammalian and arthropod infection. Infect Immun. 2002;70:6005–6012. [PMC free article] [PubMed]
35. Anziani O.S., Tarabla H.D., Ford C.A., Galleto C. Vaccination with Anaplasma centrale: response after an experimental challenge with Anaplasma marginale. Trop Anim Health Prod. 1987;19:83–87. [PubMed]
36. Bock R.E., de Vos A.J. Immunity following use of Australian tick fever vaccine: a review of the evidence. Aust Vet J. 2001;79:832–839. [PubMed]
37. Brizuela C.M., Ortellado C.A., Sanabria E., Torres O., Ortigosa D. The safety and efficacy of Australian tick-borne disease vaccine strains in cattle in Paraguay. Vet Parasitol. 1998;76:27–41. [PubMed]
38. Potgieter F.T., van Rensburg L. Infectivity virulence and immunogenicity of Anaplasma centrale live blood vaccine. Onderstepoort J Vet Res. 1983;50:29–31. [PubMed]
39. Turton J.A., Katsande T.C., Matingo M.B., Jorgensen W.K., Ushewokunze-Obatolu U., Dalgliesh R.J. Observations on the use of Anaplasma centrale for immunization of cattle against anaplasmosis in Zimbabwe. Onderstepoort J Vet Res. 1998;65:81–86. [PubMed]
40. Shkap V., Leibovitz B., Krigel Y., Molad T., Fish L., Mazuz M. Concomitant infection of cattle with the vaccine strain Anaplasma marginale ss centrale and field strains of A. marginale. Vet Microbiol. 2008;130:277–284. [PubMed]
41. Osorno M.B., Solana P.M., Perez J.M., Lopez T.R. Study of an attenuated Anaplasma marginale vaccine in Mexico natural challenge of immunity in an enzootic area. Am J Vet Res. 1975;36:631–633. [PubMed]
42. Rodríguez Camarillo S.D., García Ortiz M.A., Rojas Ramírez E.E., Cantó Alarcón G.J., Preciado de la Torre J.F., Rosario Cruz R. Anaplasma marginale Yucatan (Mexico) strain: assessment of low virulence and potential use as a live vaccine. Ann N Y Acad Sci. 2008;1149:98–102. [PubMed]
43. Montenegro-James S., James M.A., Benitez M.T., Leon E., Baek B.K., Guillen A.T. Efficacy of purified Anaplasma marginale initial bodies as a vaccine against anaplasmosis. Parasitol Res. 1991;77:93–101. [PubMed]
44. Inokuma H., Terada Y., Kamio T., Raoult D., Brouqui P. Analysis of the 16S rRNA gene sequence of Anaplasma centrale and its phylogenetic relatedness to other ehrlichiae. Clin Diagn Lab Immunol. 2001;8:241–244. [PMC free article] [PubMed]
45. Lew A.E., Gale K.R., Minchin C.M., Shkap V., De Waal D.T. Phylogenetic analysis of the erythrocytic Anaplasma species based on 16S rDNA and GroEL (HSP60) sequences of A. marginale, A. centrale, and A. ovis and the specific detection of A. centrale vaccine strain. Vet Microbiol. 2003;92:145–160. [PubMed]
46. Liu Z., Luo J., Bai Q., Ma M., Guan G., Yin H. Amplification of 16S rRNA genes of Anaplasma species in China for phylogenetic analysis. Vet Microbiol. 2005;107:145–148. [PubMed]
47. Junior D.S., Araújo F.R., Almeida Junior N.F., Adi S.S., Cheung L.M., Fragoso S.P. Analysis of membrane protein genes in a Brazilian isolate of Anaplasma marginale. Mem Inst Oswaldo Cruz. 2010;105:843–849. [PubMed]

Save items

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


  • BioProject
    BioProject links
  • Gene (nucleotide)
    Gene (nucleotide)
    Records in Gene identified from shared sequence and PMC links.
  • MedGen
    Related information in MedGen
  • Nucleotide
    Primary database (GenBank) nucleotide records reported in the current articles as well as Reference Sequences (RefSeqs) that include the articles as references.
  • PubMed
    PubMed citations for these articles
  • Substance
    PubChem chemical substance records that cite the current articles. These references are taken from those provided on submitted PubChem chemical substance records.
  • Taxonomy
    Taxonomy records associated with the current articles through taxonomic information on related molecular database records (Nucleotide, Protein, Gene, SNP, Structure).
  • Taxonomy Tree
    Taxonomy Tree

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...