Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jun;42(11):7113-31.
doi: 10.1093/nar/gku322. Epub 2014 May 5.

The evolutionary dynamics of variant antigen genes in Babesia reveal a history of genomic innovation underlying host-parasite interaction

Affiliations

The evolutionary dynamics of variant antigen genes in Babesia reveal a history of genomic innovation underlying host-parasite interaction

Andrew P Jackson et al. Nucleic Acids Res. 2014 Jun.

Abstract

Babesia spp. are tick-borne, intraerythrocytic hemoparasites that use antigenic variation to resist host immunity, through sequential modification of the parasite-derived variant erythrocyte surface antigen (VESA) expressed on the infected red blood cell surface. We identified the genomic processes driving antigenic diversity in genes encoding VESA (ves1) through comparative analysis within and between three Babesia species, (B. bigemina, B. divergens and B. bovis). Ves1 structure diverges rapidly after speciation, notably through the evolution of shortened forms (ves2) from 5' ends of canonical ves1 genes. Phylogenetic analyses show that ves1 genes are transposed between loci routinely, whereas ves2 genes are not. Similarly, analysis of sequence mosaicism shows that recombination drives variation in ves1 sequences, but less so for ves2, indicating the adoption of different mechanisms for variation of the two families. Proteomic analysis of the B. bigemina PR isolate shows that two dominant VESA1 proteins are expressed in the population, whereas numerous VESA2 proteins are co-expressed, consistent with differential transcriptional regulation of each family. Hence, VESA2 proteins are abundant and previously unrecognized elements of Babesia biology, with evolutionary dynamics consistently different to those of VESA1, suggesting that their functions are distinct.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Pie charts showing the classification of predicted coding sequences in three Babesia genomes, based on three-way OrthoMCL analysis. Genes with a 1:1:1 distribution are termed ‘conserved’. Genes present in all three species with variable copy number are called ‘semi-conserved’. ves1 genes in Babesia bovis and full-length homologs in other species are represented in yellow. SmORF in B. bovis and ves-like short genes (ves2) in other species are represented in orange. The remaining species-specific genes (either single or multi-copy) are represented by green, blue and purple for Babesia bigemina, B. bovis and Babesia divergens respectively.
Figure 2.
Figure 2.
Comparison of gene order at three regions of chromosomal rearrangement. Forward and reverse strand are represented by horizontal bars, colour-coded by species (purple: Babesia divergens 1802A; green: Babesia bigemina BOND; blue: Babesia bovis T2Bo). Genes are indicated by boxes within reading frames. ves-like gene models are colour-coded as indicated by the key. Vertical grey bars between genomes represent significant BLASTn hits as calculated in ACT. (A) The region spanning 1154–1238kb of chromosome 1 in B. bigemina, which corresponds to chromosomal breakpoints in both other species. (B) The region spanning 302–364 kb of chromosome 2 in B. bigemina, which corresponds to a chromosomal breakpoint in B. divergens. (C) The region spanning 975–1335 kb of chromosome 3 in B. bigemina that is conserved in both other species but which has experienced numerous B. bigemina-specific insertions of BbigVes1b genes (shaded blue). The genomic locations of regions a-c are shown in Supplementary Figure S1A/B.
Figure 3.
Figure 3.
Ves gene repertoire in Babesia genome sequences. Gene models are drawn to scale (average lengths are shown) and are represented by shaded boxes (exons) and lines (introns). The presence of low complexity regions (typically repetitive and with variable length) and the conserved C-terminal domain (containing a single transmembrane helix) are indicated.
Figure 4.
Figure 4.
(A) Unrooted maximum likelihood phylogeny of ves1 genes from Babesia spp. based on a multiple nucleotide sequence alignment corresponding to the conserved C-terminal domain of VESA1 only (840 characters). A GTR+Γ model was applied. Support for principal nodes is indicated by non-parametric bootstraps and posterior probabilities from a Bayesian analysis using the same model. (B) Sequence similarity network based on FASTA scores generated from pair-wise comparisons of VESA1 and VESA2 amino acid sequences and generated using BioLayout Express v3.0. Individual sequences are represented by spheres, shaded by gene family, connected by lines that represent sequence homology. The network was organized such that spheres edge length is minimized and spheres are positioned nearest to their closest relatives. A lower threshold has been applied to exclude poor sequence matches, leaving only the strongest similarities as determined by FASTA. SMORF sequences were included, but no FASTA scores exceeded the threshold. Ves-like gene families are labelled as described in the text; a single Babesia bovis sequence that clusters close to BbigVes2 (BBOV_III002580) is shown with a red circle.
Figure 5.
Figure 5.
Frequency histogram showing ranked abundance of peptides detected in proteomic analysis of Babesia bigemina PR. The position of VESA1 (light green) and VESA2 (dark green) predicted proteins are shown with filled circles. Frequency histograms of VESA-like proteins only are shown in the insets.
Figure 6.
Figure 6.
Comparison of event costs required to reconcile ves1 and ves2 gene phylogenies. For each ves1 and ves2 gene family, phylogenies were estimated for positionally-conserved genes, i.e. loci conserved in both the reference strain and one other strain. In the absence of recombination after the strains diverge, such trees should have the same topology. Significance of topological congruence is assessed through phylogenetic reconciliation using the programme Jane 4, whereby evolutionary events are posited to explain topological disparities between the trees. Each histogram shows the frequency distribution of event costs for 100 randomized trees generated by permuting the reference strain phylogeny, compared to the observed event cost (vertical dashed line). Where observed and randomized event costs overlap, this indicates that there is no significant agreement between the trees, which we interpret as evidence for recombination. P-values represent the probability of obtaining the observed cost in randomized co-phylogenies (i.e. of observed tree similarity being due to chance), and are mean averages taken over all cost combinations.
Figure 7.
Figure 7.
Evidence for recombination among gene copies for ves-like gene families in Babesia spp. using two programs: 3seq (A) and PhiPack (B). The proportion of sub-alignments showing significant phylogenetic incompatibility (Ppi) is shown for ves1 and ves2 gene families, with bars shaded by species as previously.
Figure 8.
Figure 8.
Comparison of P-values in tests for phylogenetic incompatibility (using 3seq). Mean P-value is shown for each ves1 and ves2 gene family ± one standard deviation. The values are converted to a negative natural log scale for ease of comparison. Gene families are shaded as previously. The results of pair-wise t-tests between all P-values returned in each analysis are indicated (n.s. = not significant). Note that most P values for BdivVes1b tests were zero, although a minority returned non-zero (but highly significant) log values. This circumstance is responsible for the apparently large standard deviation around this mean. In fact, all BdivVes1b sequence triplets returned highly significant tests for phylogenetic incompatibility. *** P < 0.001, * P < 0.05.

Similar articles

Cited by

References

    1. Deitsch K.W., Lukehart S.A., Stringer J.R. Common strategies for antigenic variation by bacterial, fungal and protozoan pathogens. Nat. Rev. Microbiol. 2009;7:493–503. - PMC - PubMed
    1. Scherf A., Lopez-Rubio J.J., Riviere L. Antigenic variation in Plasmodium falciparum. Annu. Rev. Microbiol. 2008;62:445–470. - PubMed
    1. Morrison L.J., Marcello L., McCulloch R. Antigenic variation in the African trypanosome, molecular mechanisms and phenotypic complexity. Cell. Microbiol. 2009;11:1724–1734. - PubMed
    1. Horn D., McCulloch R. Molecular mechanisms underlying the control of antigenic variation in African trypanosomes. Curr. Opin. Microbiol. 2010;13:700–705. - PMC - PubMed
    1. Boni M.F. Vaccination and antigenic drift in influenza. Vaccine. 2008;26:C8–C14. - PMC - PubMed

Publication types

Substances