Genome-wide diversity and gene expression profiling of Babesia microti isolates identify polymorphic genes that mediate host-pathogen interactions

Silva, Joana C.; Cornillot, Emmanuel; McCracken, Carrie; Usmani-Brown, Sahar; Dwivedi, Ankit; Ifeonu, Olukemi O.; Crabtree, Jonathan; Gotia, Hanzel T.; Virji, Azan Z.; Reynes, Christelle; Colinge, Jacques; Kumar, Vidya; Lawres, Lauren; Pazzi, Joseph E.; Pablo, Jozelyn V.; Hung, Chris; Brancato, Jana; Kumari, Priti; Orvis, Joshua; Tretina, Kyle; Chibucos, Marcus; Ott, Sandy; Sadzewicz, Lisa; Sengamalay, Naomi; Shetty, Amol C.; Su, Qi; Tallon, Luke; Fraser, Claire M.; Frutos, Roger; Molina, Douglas M.; Krause, Peter J.; Ben Mamoun, Choukri

doi:10.1038/srep35284

Download PDF

Article
Open access
Published: 18 October 2016

Genome-wide diversity and gene expression profiling of Babesia microti isolates identify polymorphic genes that mediate host-pathogen interactions

Joana C. Silva^1,2,
Emmanuel Cornillot^3,4,
Carrie McCracken¹,
Sahar Usmani-Brown^5,6,
Ankit Dwivedi^3,4,
Olukemi O. Ifeonu¹,
Jonathan Crabtree¹,
Hanzel T. Gotia¹,
Azan Z. Virji⁵,
Christelle Reynes⁷,
Jacques Colinge⁴,
Vidya Kumar⁵,
Lauren Lawres⁵,
Joseph E. Pazzi⁸,
Jozelyn V. Pablo⁸,
Chris Hung⁸,
Jana Brancato⁶,
Priti Kumari¹,
Joshua Orvis¹,
Kyle Tretina¹,
Marcus Chibucos^1,2,
Sandy Ott¹,
Lisa Sadzewicz¹,
Naomi Sengamalay¹,
Amol C. Shetty¹,
Qi Su¹,
Luke Tallon¹,
Claire M. Fraser¹,
Roger Frutos^9,10,
Douglas M. Molina⁸,
Peter J. Krause⁶ &
…
Choukri Ben Mamoun⁵

Scientific Reports volume 6, Article number: 35284 (2016) Cite this article

3851 Accesses
63 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Babesia microti, a tick-transmitted, intraerythrocytic protozoan parasite circulating mainly among small mammals, is the primary cause of human babesiosis. While most cases are transmitted by Ixodes ticks, the disease may also be transmitted through blood transfusion and perinatally. A comprehensive analysis of genome composition, genetic diversity, and gene expression profiling of seven B. microti isolates revealed that genetic variation in isolates from the Northeast United States is almost exclusively associated with genes encoding the surface proteome and secretome of the parasite. Furthermore, we found that polymorphism is restricted to a small number of genes, which are highly expressed during infection. In order to identify pathogen-encoded factors involved in host-parasite interactions, we screened a proteome array comprised of 174 B. microti proteins, including several predicted members of the parasite secretome. Using this immuno-proteomic approach we identified several novel antigens that trigger strong host immune responses during the onset of infection. The genomic and immunological data presented herein provide the first insights into the determinants of B. microti interaction with its mammalian hosts and their relevance for understanding the selective pressures acting on parasite evolution.

Babesia duncani multi-omics identifies virulence factors and drug targets

Article Open access 13 April 2023

Antigen Discovery, Bioinformatics and Biological Characterization of Novel Immunodominant Babesia microti Antigens

Article Open access 12 June 2020

A high-quality Ixodes scapularis genome advances tick science

Article 19 January 2023

Introduction

Babesia microti, the primary etiologic agent of human babesiosis, is an emerging health threat worldwide and particularly in the United States. It circulates in a tick vector – mammalian reservoir host cycle, with humans as dead-end hosts. Transmission to humans is primarily effected by ticks in the genus Ixodes, but can also occur through blood transfusion and, rarely, through transplacental transmission¹. The first documented case of human babesiosis attributed to B. microti in the United States was reported in Nantucket Island, MA in 1969². Over the past decade, there has been a significant increase in the number of babesiosis cases among both immunocompromised and immunocompetent patients^1,2,3,4. Patients with asplenia, HIV infection, cancer, hemoglobinopathy, organ transplantation, or those on immunosuppressive drugs or who acquire the infection through blood transfusion, manifest particularly severe disease, sometimes requiring hospital admission and occasionally ending in death or prolonged relapsing illness^1,4.

Current therapies for the treatment of human babesiosis consist of combinations of atovaquone plus azithromycin or quinine plus clindamycin^1,5. Although these drugs have been extensively used in recent years, quinine and clindamycin combination is associated with major side effects, and drug failure has been reported with atovaquone and azithromycin. In some cases treatment can be achieved with higher drug doses, longer treatment duration and/or exchange transfusion, while in others the use of alternative drugs is needed due to microbial resistance^{3,5,6,7,8,9,10}. Furthermore, the mechanism by which these drugs exert their anti-Babesia activity has only recently begun to be elucidated. Recent studies using a short-term in vitro culture as well as immunocompromised mice have shown that, of the four drugs used for treatment of human babesiosis, only atovaquone shows efficacy against the parasite in mouse red blood cells both in vitro and in vivo¹¹. These results, along with the shortcomings of available diagnostic tools to distinguish between past and active infection to prevent transfusion-transmitted babesiosis, have stimulated efforts to improve therapies and diagnostics^{11,12,13,14,15,16,17,18,19}.

There is limited knowledge of B. microti diversity in the context of pathogenesis and host-pathogen interactions^20,21,22. Similarly, it is uncertain how parasite variability and host adaptation may impact its virulence, its successful transmission to humans, and disease diagnosis and therapy. The paucity of information about this parasite is due in part to the lack of data on genetic variation among isolates, lack of a continuous in vitro culture system for propagation of the parasite in human or mouse red blood cells, as well as the absence of tools and resources to manipulate its genome in order to characterize gene function in microbial development and virulence.

Efforts aimed at understanding B. microti population diversity and structure, and differentiate between parasite genotypes, have in the past relied on the use of PCR amplification of the 18S rDNA, β-tubulin and the chaperonin-containing t-complex polypeptide 1 (CCT7) (reviewed in ref. 23). More recently, Goethert and colleagues used a genotyping approach based on variable number of tandem repeat loci and identified at least two major populations and shed new light on the mode of expansion of the parasite in southern New England²⁴.

The first B. microti genome sequencing and analysis were conducted on an isolate named the R1 strain, which provided initial information about genome composition, structure, metabolism, and the phylogenetic placement of the species^25,26. The availability of new sequencing technologies has made it possible to perform genome-wide profiling of genetic polymorphisms in a large number of species including several protozoan parasites^{27,28,29,30,31}. These analyses have significantly improved our understanding of the diversity and evolution of these pathogens, provided insights into their virulence, host modulation and the genetic basis of drug resistance phenotypes, and helped to develop novel approaches for disease control and prevention, and informed public health policy^{32,33,34,35,36,37}.

Herein we report a systematic and comprehensive study of the genetic and transcriptional diversity of seven B. microti isolates. Using data from genomic and transcriptomic analyses, we have re-annotated the entire genome of the reference B. microti R1 strain, characterized the genomic diversity among isolates, and identified the full complement of genes encoding the parasite’s secretome and surface proteome. We show that polymorphism in genes that encode secreted and surface proteins accounts for most of the variations found among the B. microti isolates analyzed. We generated and screened a protein array consisting of 174 B. microti proteins using sera from mice infected with B. microti and sera of uninfected mouse controls. We identified 30 new parasite antigens that trigger a strong host response and determined that some of these antigens are encoded by the genes that are most polymorphic among these isolates. The fact that the most highly expressed genes in B. microti are unique to this species suggests that it has evolved novel mechanisms for survival within human red blood cells and to interact with its mammalian host. The genetic patterns reported here are valuable tools for a better understanding of B. microti pathogenesis and mode of transmission, and may contribute to improve disease diagnosis and control.

Results

Genome variability among B. microti isolates

To advance our understanding of B. microti genetic diversity and adaptation to the host environment, we performed high-throughput, whole genome sequencing of seven B. microti isolates (Supplementary Table S1), re-sequenced the reference R1 strain, and generated RNAseq data from six of these isolates (Supplementary Table S2). All nucleic acid material was obtained from the intra-erythrocytic life cycle stage of parasites propagated in rodents. Nearly all (98.3%) of the 3,567 protein-coding genes in the B. microti genome were found to be expressed, as defined by an average RPKM ≥ 10 and ≥4X coverage. Our analyses revealed that gene expression is highly correlated among isolates (0.81 < r < 0.94; Fig. 1), with intron-exon splice sites clearly defined in the vast majority of genes (http://jbrowse.igs.umaryland.edu/b_microti). This information provided exceptional transcript resolution and made possible extensive manual curation and validation of the structure of nearly all protein-coding genes, resulting in improved structural characterization of 52% of all these genes (Supplementary Tables S3–S5). The updated nuclear genome annotation consists of 3,615 genes, 3,567 of which encode proteins, making it one of the most gene-dense genomes identified so far among the Apicomplexa (Supplementary Table S3). The updated re-annotation identified 70 new genes, with changes to previously predicted genes being relatively minor. Based on RNAseq analysis, ~64% of all exons in the new annotation were correctly predicted in the original annotation and ~34% were re-annotated. Only 4% of all nucleotides that are part of protein-coding sequences had been incorrectly assigned to introns or intergenic regions in the original annotation (Supplementary Table S4). Remarkably, B. microti genes are characterized by a preponderance of unusually small introns ranging in size between 18 and 21 nucleotides (some of which are in frame), a rare occurrence among eukaryotes (Supplementary Fig. S1). We also performed deep sequencing of the B. microti R1 isolate in order to validate the sequence of the reference genome. Only 36 differences were found (Tables 1 and 2), mostly associated with chromosome ends (78%). Of these, 35 are insertions/deletions (indels), with 25 being in intergenic regions. Nine of these differences (one SNP and eight indels) may correspond to sequencing errors in the original assembly, as the alternate variant was found not only in the R1 re-sequencing data but also in all seven newly sequenced B. microti isolates. The remaining 27 differences either also correspond to sequencing errors in the original assembly or else may have accumulated during passaging of the R1 isolate in gerbils.

Table 1 Distribution of SNPs relative to R1, in seven B. microti isolates and in the re-sequenced R1 isolate.

Full size table

Table 2 Distribution of positions with indels relative to R1, in at least one of the seven B. microti isolates and in the re-sequenced R1 isolate^a.

Full size table

Analysis of genetic variation among genomes revealed a remarkable dearth of genetic diversity (Tables 1 and 2, Fig. 2), despite the fact that the isolates were collected from different geographic areas, across several decades, and represent recent clinical infections as well as long-established lab strains (Supplementary Table S1). A total of 889 variable positions, defined by either single nucleotide polymorphisms (SNPs) or short indels, were found in the 6,395,281 bp B. microti R1 genome assembly (Tables 1 and 2, Supplementary Table S8). The average pairwise difference between each isolate and the reference R1 is 588 SNPs (Table 1), corresponding to 0.9 SNPs/10Kb, a frequency over one order of magnitude lower than current SNP density estimates for the human malaria parasite Plasmodium falciparum³⁸. The majority of the variable positions were found to be R1-specific, with 515 SNPs and 103 indels unique to this isolate (Tables 1 and 2, Supplementary Table S7, and Fig. 3A). On the other hand, only 262 variable sites (with SNP or indels) were found among the other seven genomes, with an average of 14 mutations being unique to each isolate, and 150 shared by two or more isolates (Supplementary Table S6). SNPs were enriched in non-coding segments (Fig. 3B), which correspond to 26% of the genome but accumulated ~38% of all SNPs (Chi-square, P < 0.0001). The distribution of indels was even more skewed, with only 20% of all indels found in coding sequences (CDSs) and nearly half of them result in in-frame mutations. Of the 3568 protein-coding genes, only 205 carried a combined total of 257 amino acid-altering mutations, including indels of length not a multiple of three, and non-synonymous, read-through or non-sense SNPs.

Twenty seven genes contain nearly one third of all non-synonymous mutations (31 SNPs) and indels (48 indels) (Figs 2 and 4). More than half of these 27 genes encode surface or secreted antigens, including five members of the BMN2 gene family, which accumulated 23 of the 257 non-synonymous mutations (Figs 2 and 4). The most variable gene identified in this analysis is BBM_04g09980, with 11 mutations, and is located in the sub-telomeric region of chromosome IV. This gene is also differentially expressed between isolates (Fig. 5). Interestingly, the intergenic regions flanking BBM_04g09980 are also highly polymorphic. Overall, our analyses revealed that chromosome ends account for 9.3% of genome variations observed in the genome of B. microti (Fig. 6A). The genes in this region of the chromosomes are variable between strains (Fig. 6A) and are associated with the presence of indels. SNPs in these regions were often below the quality threshold applied for calling because of the presence of several sequences that are repeated multiple times. Sequencing of PCR products confirmed the scarcity of SNPs at chromosome ends.

Analysis of microsatellites in the genome of the B. microti isolates revealed 336 micro- and mini-satellites ranging in length between 2 and 375 bp. Among these, 12 are variable among strains with 8 found in coding sequences, 3 in intergenic regions, including previously described BMV4²⁴, and 1 in an intron. Clustering of the seven isolates based on these microsatellites showed that they form three major clusters one comprised of G1 and PRA-99, the second of Naushon, N11–50 and GreenwichYale_Lab_Strain_1 (LabS1), with the ATCC-30222 and the R1 isolate forming a sister group to the other isolates (Fig. 6B).

Identification of B. microti genes encoding components of the secretome and potentially under immune selection

To identify proteins of B. microti that might play a role in host-parasite interaction and immune modulation, we curated the proteome for all possible members of the B. microti secretome, including GPI-anchored, secreted and transmembrane proteins, based on primary sequence attributes as well as on homology to members of the malarial secretome (Supplementary Table S8). B. microti proteins predicted to localize to intracellular organelles were excluded from this set (Fig. 7). The B. microti secretome consists of 420 proteins, 19 of which are GPI-anchored (GPI), 196 are predicted to be soluble secreted proteins (SEC) and 205 associated with a membrane (TM). This set encompasses all previously described antigens such as the BmP94 antigen (BBM_04g08155), the maltese-cross seroactive antigen (BBM_04g07535), most of the BMN genes, and several small multigene families (Tpr, Vesa, Rhomboid, CRMP, PSOP, LCCL and CPW-WPC)^{26,39,40,41,42,43,44,45}. More than half of the genes encoding components of the B. microti secretome are unique to this species. 299 of which encode hypothetical proteins, including 82 that have homologs in P. falciparum. Whereas approximately 60% of all B. microti proteins have homologs in P. falciparum, that proportion is only 36% among the 420 proteins of the B. microti secretome^{46,47,48,49,50}. Conversely, of the >500 P. falciparum proteins predicted to be secreted (http://mpmp.huji.ac.il/), only 151 have homologs in B. microti. As evidence of major differences between B. microti and P. falciparum in their interaction with the host cell and its remodeling, none of P. falciparum red blood cell (RBC)-targeted proteins are found in B. microti (Supplementary Table S8) and no homologs to the components of the P. falciparum PTEX translocon are found in this parasite⁵¹. Furthermore, of all the known microneme and rhoptry proteins of P. falciparum, only 16 are found in B. microti, including nine homologs of rhoptry-associated proteins (BmRAP1–9), homologs of components of the moving junction BmAMA1, BmRON2, BmRON4 and BmRON5, two homologs of the rhoptry bulb constituents (BmRhopH2 and BmARO) and another microneme protein BmPLP1⁵² (Fig. 7C). Interestingly, a homolog of the endoplasmic reticulum protease Plasmepsin V (BBM_04g05270), proposed to play a role in the processing of some secreted proteins in P. falciparum⁵³, is found in the B. microti proteome, albeit with little sequence homology in the C-terminus part of the aspartyl protease A1 domain (Supplementary Fig. 2). No protein homologs of the targets of Plasmepsin V in P. falciparum^46,48,49, T. gondii⁵⁴ or B. bovis⁵⁵ could be found in B. microti. New studies are needed to identify possible targets of the Plasmespin V–like peptidase from B. microti.

RNAseq analysis showed that while some of the genes encoding members of the B. microti secretome are highly expressed during blood stage infection, others are either not expressed or expressed at very low levels during this phase of the parasite life cycle (Fig. 1). Members of the sub-telomeric multigene families, including the Tpr-like genes, are expressed but at different levels, suggesting that they are independently regulated.

Differential gene expression among B. microti isolates

Transcriptional analysis showed a few major differences in expression levels among B. microti genes, with secretome protein classes being among the most variable (Figs 1 and 8A–I). The three most expressed genes in B. microti in both mice and hamsters are those encoding the GPI-anchored proteins BmGPI12 and BmGPI13 and the sugar:H+ symporter BmHT1. Comparison of gene expression between different isolates shows that the vast majority of the genes are similarly expressed in all isolates, with correlation of gene expression between each pair of isolates ranging from 81% and 94%. However, there are some noticeable exceptions, with 410 genes (including 33 rRNA and tRNA genes) that are differentially expressed among strains (defined as RPKM differing by more than 3 fold from the median RPKM; Supplementary Table S8), with differences between isolates surpassing 30 fold. The threshold was benchmarked using several housekeeping genes including the 18S rDNA, and the genes encoding B. microti translation elongation factor EF1α and EF1β, glyceraldehyde-3-phosphate dehydrogenase, succinate dehydrogenase subunits and lactate dehydrogenase (Fig. 8A–C). Thirty nine genes showed differential expression with levels of expression at least 10X higher or lower than the median. These include members of the putative parasite secretome as well as a neck kinase 4 ortholog (BmNEK4: Bm_03g00715), which was highly expressed only in the B. microti ATCC-30222 isolate (Fig. 8F). Other genes showed differential expression in at least 2 isolates, and include six encoding hypothetical proteins and members of the parasite secretome.

Different B. microti isolates showed different host specificity, and therefore we have also compared host-dependent expression differences between isolates. Using EdgeR and DEseq2 methods to correlate gene expression to host specificity, 59 genes were identified in both analyses (Fig. 8G–I); 47 were up-regulated in isolates grown in hamsters, and 12 genes were up-regulated in isolates propagated in SCID mice. Of the 59 genes, 50 were found to be differentially expressed between isolates grown in different hosts with at least 3-fold increase or decrease in expression (Supplementary Fig. 3). Twenty-seven of these genes encode proteins with unknown function, whereas twelve are members of the secretome and six are involved in the regulation of protein expression, including the E3-ubiquitin ligase subunit, elongation factor eIF1B subunit and the mitochondrial subunit S8. Trafficking and cytoskeleton-related functions were attributed to three and two proteins, respectively. The secretome group includes three BmS48/45 genes, encoding the GPI-anchored antigen homolog of the P. falciparum sexual stage antigen Pfs48/45, which are highly expressed in parasites grown in hamsters.

The high prevalence of candidate antigen-encoding genes among differentially expressed B. microti genes, and the fact that these genes are twice as likely to be polymorphic as other parasite genes, suggest a possible role for these antigens in immune modulation.

Immunoproteomic analysis of B. microti major antigens

In order to identify B. microti proteins that trigger a humoral immune response, we developed a reverse phase, antigen down, protein array consisting of 174 predicted proteins. We screened the array using pre-immune as well as immune sera collected from wild type Swiss Webster mice at days 4, 8, 12, and 16 following inoculation with B. microti Lab Strain 1. Whereas no antibodies could be detected with naïve, pre-immune sera, analysis of the kinetics of the humoral immune response associated with B. microti infection phase identified several new antigens, 62% of which were constituents of the B. microti secretome (Figs 9 and 10). Detectable levels of IgM antibodies were measured as early as day 4 post-infection and increased significantly over time, peaking at day 8 and remaining high until day 16 post-infection (Figs 9 and 10). In contrast, IgG antibodies were very low at day 4 and increased over time reaching their peak at day 16 (Figs 9 and 10). The immune signature of the top 20 IgG or IgM most highly antigenic proteins identified 30 proteins (Fig. 9). Nearly half (14/30) are part of the secretome (5 GPI, 6 SEC and 3 TM). Only four genes are polymorphic, with one variable site each (SNP or indel). Interestingly, all US isolates outside R1 encode the same allele in each of those loci. Analysis of the protein array data resulted in the identification of three subsets of 10 proteins each. The first set includes proteins that trigger strong IgM and IgG responses starting at day 4 for the former and at day 8 for the latter, and remain high until day 16. This subset includes BmGPI12/BmSA1 (BBM_01g00985), a secreted S1/P1 nuclease (BBM_02g03140), BmRON2 (BBM_03g04695) and two secreted hypothetical proteins (BBM_01g00985 and BBM_03g00947). With the exception of BmGPI6 and BmGPI17 (BBM_02g00896 and BBM_03g03430 respectively), all genes encoding antigens in this subgroup are among the top 10% most expressed genes in B. microti. The strongest immunogenic responses were obtained against BBM_01g00985- and BBM_03g00947-encoded peptides, both of which are part of the secretome. Both genes contain non-synonymous polymorphisms (Supplementary Table S8), including a variable microsattelite in BBM_03g00947 which supports the three groupings shown in Fig. 6B. In addition, BBM_03g00947 is downregulated in the Naushon strain relative to the other isolates. The second subset consists of proteins that trigger a significant IgG response that increases over time, and peaks between days 12 and 16, but induced only a moderate to weak IgM response over the 16-day period. Most notable among these are three GPI-anchored proteins (BmGPI9 and 10) and the N1–15 maltese-cross seroactive antigen orthologue (BBM_04g07535). Seven of the proteins in this set are members of the secretome. The third subset includes proteins that trigger a strong immune IgM response and a low to weak IgG response. Half of the antigens in this group are members of the secretome including BmRON5 and two members of the BMN2 family.

Discussion

In this study we have combined genomic sequencing of seven B. microti isolates with transcriptomic analyses to systematically characterize the diversity of this emerging pathogen. Our sequencing of seven new isolates and re-sequencing of the R1 reference genome confirmed the previous genome analysis, which indicated that B. microti has the smallest apicomplexan genome available to date, and is among the most gene-dense. Draft genome assemblies generated for the different isolates confirmed a genome size around 6.5 Mb (Table S2), approximately one-fourth the size of Plasmodium genomes. Despite the significant difference in genome size, a careful manual curation of gene models, facilitated by RNAseq data, showed the total number of B. microti-encoded genes to be 3567, almost two thirds the number in the P. falciparum genome, which consists of 5324 genes. The parasite secretome consists of 420 proteins, over 10% of its proteome. Secretion of these proteins to the host membrane or environment to remodel the host cell, acquire nutrients, or modulate the host immune response most likely involve the standard secretion pathway. Our analysis showed that no components of the Plasmodium translocon exist in B. microti, and that no homologues of proteins secreted through the translocon pathway are found in B. microti. Analyses based on sequence similarity failed also to suggest the use of other known secretion pathways, such as those associated with dense granules in Toxoplasma gondii⁵⁴ or spherical bodies in Babesia bovis⁵⁵. The role of the Plasmepsin V–like peptidase found in B. microti remains to be clarified in the absence of large multigene families.

By comparing the sequences of seven new B. microti isolates with the genome of the reference R1 isolate, we have identified only a total of close to 900 variable sites, including 588 SNPs and 301 indels. An analysis of the distribution of SNP-associated parameter values for each parameter considered, together with amplicon sequencing-based validation, was critical for the accurate identification of SNPs and in particular the elimination of false positive variants. The extraordinarily low sequence polymorphism found among these isolates, which originate primarily (but not uniquely) from the Northeast region of the United States, suggest that they all share a very recent common ancestor, possibly in the hundreds of years, assuming a mutation rate similar to other eukaryotes⁵⁶. However, this finding needs to be confirmed with a more extensive population survey, accurately identified sequence variants, and coalescent modeling. Quite surprising is the lack of sequence divergence between the isolate ATCC 30222, thought to be originally from Zaire, and the remaining isolates, all of which are believed to originate from the Northeast United States. This issue that might require additional investigation to ensure the provenance of this ATCC isolate.

Three major variation-associated patterns were found among the B. microti isolates examined in this study. First, the R1 isolate appears significantly different from all other isolates with R1-specific mutations representing 90% of all microsatellites and nearly 70% of all SNPs and small indels. Re-sequencing of the R1 isolate further validated the uniqueness of the R1 genome. Interestingly, R1 was isolated from a babesiosis patient who likely contracted the disease in Nantucket, MA. It is possible that the R1 isolate represents a different B. microti lineage from all other isolates. A recent study by Lemieux and colleagues⁵⁶, released while this article was under review, suggests that all non-R1 isolates sequenced here likely belong to a New England lineage of B. microti separate from that containing the R1 reference. Second, non-R1 specific mutations, and differences in gene expression among isolates, are significantly associated with chromosome ends, a pattern similar to the accumulation of new mutations documented in other apicomplexans^57,58. Finally, much of the non-synonymous variation identified among isolates falls disproportionately in a small number of genes, including many members of the secretome, suggestive of immune system-related selective pressure.

Immuno-proteomic analyses show that few members of the secretome induce IgM and/or IgG responses (Fig. 9). Among them, the BmSA1 antigen (BBM_03g00785) has already been placed among the most promising proteins for the development of a detection assay for B. microti in blood samples¹⁹. Two other proteins from the secretome provide even stronger signal by reverse phase analysis in mice: BBM_01g00985 and BBM_03g00947. Combined analysis of the genome-wide variation, transcriptome and immuno-proteome further confirmed the relevance of the GPI-anchored protein set in parasite-host interaction. The GPI-anchored proteome of B. microti is composed of only 19 proteins¹⁹, but six were found to be among the 20 proteins inducing the strongest IgG or IgM responses. We also found that nine of GPI-anchored proteins were among the top ten most highly expressed genes, among the set of genes harboring non-synonymous mutations and/or among the set of differentially expressed genes (Fig. 5). All genes from the BMN2 family are among these proteins. Nine of these were members of the secretome but they show little immunogenicity, suggesting a possible role in antigenic variation.

It remains unknown whether B. microti host preference can be linked to specific genetic determinants. However, two new lines of evidence generated in this study support this possibility. First, RNAseq analysis revealed 59 parasite genes with significantly different expression levels between isolates grown in different rodent host systems. In addition, our attempts to propagate these isolates in small rodents revealed clear host preferences.

In conclusion, our genomic and transcriptomic analyses of B. microti isolates provides initial evidence that B. microti strains from the Northeast region of the U.S. are not highly diverse and that most polymorphisms in this parasite are found in genes encoding proteins likely to be involved in host-pathogen interactions. Several antigens might prove useful in the development of a specific and sensitive assay for rapid detection of B. microti infection as well as for antibody-based targeted therapy.

Material and Methods

Ethics statement

All animal experimental protocols followed Yale University institutional guidelines for care and use of laboratory animals and were approved by the Institutional Animal Care and Use Committees (IACUC) at Yale University (Protocol #2010-07689). Yale University is accredited by the American Association for Accreditation of Laboratory Animal Care (AAALAC Number 101), and has an approved Animal Welfare Assurance (#A3230-01, effective until 5/31/2011) on file with the NIH Office for Protection from Research Risks. Rules for ending experiments in animals were to be enacted if animals showed any signs of distress or appeared moribund. This, however, was not the case for any animals in the study.

Animals

CB17/Icr-Prkdc^scid/IcrIcoCrl mice and Golder Syrian Hamsters were purchased from Charles River, Inc. Animals were inoculated with infected blood via i.p. injection and monitored for infection. Parasitemia was determined using standard methods for collecting a drop of blood from the tail vein and using this blood to perform Giemsa staining.

B. microti Isolates

Babesia microti isolates used in this study are: GreenwichYale_Lab_Strain_1 (Lab_Strain_1), a tick isolate propagated in mice and kindly provided by Dr. Durland Fish. Two isolates obtained from BEI Resources: ATCC-30222, and ATCC-PRA99. Two isolates kindly provided by Dr. Sam Telford: GI and Naushon. Two clinical isolates obtained from blood collected from babesiosis patients in 2011 and 2014, respectively: Nan_Hs_2011_N11-50 (N11-50) and Bm1438. These isolates were injected into SCID mice and/or hamsters and infection was monitored for at least 2 months by microscopy analysis of Giemsa-stained blood (Supplemental Table 1).

Serum collection

Mouse sera were collected as follows. Five 6-week old female Swiss Webster mice were used to collect blood on day 0 (pre-immune sera) prior to infection with the B. microti LabS1 strain. Infection was achieved by IP injecting of 10⁷ iRBCs previously collected from an infected SCID mouse. Blood was then collected from the five mice on days 4, 8, 12 and 16 in microcentrifuge tubes and left at room temperature for 3 hours. After centrifugation at 4 °C for 10 minutes at 13,000 rpm, the serum fraction was collected in microcentrifuge tubes and stored at −80 °C until used. All animal experimental protocols followed Yale University institutional guidelines for care and use of laboratory animals and were approved by the Institutional Animal Care and Use Committee (IACUC) at Yale University.

Genome and RNA sequencing, assembly, structural and functional annotation, and differential gene expression analyses

Detailed protocols for genome and RNA sequencing of B. microti isolates are provided in Supplemental Methods.

Identification of variable mini- and microsatellites, single nucleotide polymorphisms (SNPs) and Nsmall indels

Tandem Repeat Finder (TRF)⁵⁹ was used to identify all micro-satellites and mini-satellites (mx-satellites) in the reference B. microti R1genome assembly²⁵. In house Perl scripts were used to extract unique sequences flanking the identified mx-satellites. Then, the BLAST⁶⁰ aligner was used to locate each of the unique flanking sequences in other B. microti isolate, hence revealing the presence and copy number of mx-satellites homologous to those in the reference R1 genome. In-house Perl scripts were used to determine length and copy number variability of mx-satellites in each isolate compared to reference R1. Bedtools⁶¹ was used to determine if a microsatellite was present in the exonic, intergenic or intronic region in the genome.

To identify SNPs and small indels, whole genome shotgun sequencing data for each of the strains was aligned to the reference B. microti R1 genome²⁵ using BWA⁶². Data was formatted using SAM tools⁶³ and Picard tools v.1.79 (http://broadinstitute.github.io/picard), and SNP variant calling and filtering using the Genome Analysis Toolkit GATK, UnifiedGenotyper, v2.2.5⁶⁴. In order to remove potential false positives. identified variants were filtered according to the following parameters values: (DP < 12) || (QUAL < 50) || (SB > −0.10) || (MQ0 > = 2 && (MQ0/(1.0 * DP)) > 0.1). SNPs that passed filter were attributed to non-coding or coding regions using VCFannotator (http://sourceforge.net/projects/vcfannotator) in the context of the reference genome re-annotation annotation.

Two approaches were then used to define true variations in the set of B. microti genomes. The first variant approach calling using parameters described above provide a list of 1490 possible variation sites where more than 95% were single point mutations. Indels were analyzed differently from SNP. All indels were kept for further analysis whereas the choice of GATK parameters was trained for the choice of the correct filtering threshold. Sanger sequencing of PCR products was performed for six loci: BBM_01g00985, BBM_02g04060, BBM_02g04280, BBM_03g00885, BBM_03g04060, BBM_04g09150. None of the variation described in the vcf files in these regions could confirmed at experimental level. Analysis was done on the 8 strains for loci BBM_02g04280 and BBM_03g04060 and on R1 only for the four other loci. Sequencing confirms the need for a specific and stringent variant calling method. All GATK parameters computed using default option were tested for the signal they provide and correspondence with training mutations. The histogram of several parameter including the ABHom, DP and MQ value was constructed per isolate for each SNP positions. The ABHom parameter evaluating homozygosis at a locus provide valid information over the threshold of 0.85. MQ and DP parameters were also analyzed. We keep SNP position where MQ was over 58. DP had no impact after these two threshold were chosen but in some isolates, DP could be equal to zero in some isolate and this information was identified as uncovered region. We found 889 (588 single point mutations and 301 insertions/deletions) highly reliable variants in the nuclear genome after analysis of eight NGS sequence.

Strain clustering Analyses based on sequence variation

Unsupervised hierarchical clustering was performed for 7 samples based on 12 variable microsatellites. The pairwise distance between the samples was calculated as the proportion of base substitutions between them over the number of variable microsatellites, i.e. for pair of isolates (number of pairwise differences)/(total number of variable sites). Unsupervised hierarchical clustering was also performed based on RNAseq data for 146 differentially expressed genes. Pairwise distance among 6 isolates was calculated as the Euclidean distance. The Ward minimum variance method was used as a metric to build the dendrogram in R for both approaches. Conserved nodes were identified between the two clustering results.

Immunoproteomic analyses

Detailed protocols for cloning of B. microti cDNAs, microarray fabrication and immunoproteomic analyses are provided in Supplemental Methods.

Statistical analysis of antibody binding intensity

The data matrix of the compiled intensity data, or “raw” immuno-proteomic data files, were imported in the statistical programing environment R (https://www.r-project.org/). The normalization procedure was as follows: (1) Peak intensity was normalized to the sum of all signals on the array for B. microti spots, and (2) intensity of each spot was calibrated to the maximum signal detected in the array. The normalized data (range between 0 and 100%) provide a relative measure of the B. microti antigenic response over time compared to day 16 where samples show maximum signal intensity. The data were grouped by time point and sorted by reactivity, and visualized using the RColorbrewer R package to create the color scheme and the gplots R package to generate the heatmap.

Data Access

Accession numbers for WGS read alignments on reference genome bam files, de novo assemblies and RNAseq reads are given in Additional File 2: Table S2. The updated annotation of nuclear chromosomes 1–4 will be associated with features with accession number FO082871, FO082872, LN871598 and LN871598, respectively.

Additional Information

How to cite this article: Silva, J. C. et al. Genome-wide diversity and gene expression profiling of Babesia microti isolates identify polymorphic genes that mediate host-pathogen interactions. Sci. Rep. 6, 35284; doi: 10.1038/srep35284 (2016).

References

Vannier, E. G., Diuk-Wasser, M. A., Ben Mamoun, C. & Krause, P. J. Babesiosis. Infectious disease clinics of North America 29, 357–370, 10.1016/j.idc.2015.02.008 (2015).
Article PubMed PubMed Central Google Scholar
Western, K. A., Benson, G. D., Gleason, N. N., Healy, G. R. & Schultz, M. G. Babesiosis in a Massachusetts resident. N Engl J Med 283, 854–856, 10.1056/NEJM197010152831607 (1970).
Article CAS PubMed Google Scholar
Krause, P. J. et al. Persistent and relapsing babesiosis in immunocompromised patients. Clinical Infectious Diseases 46, 370–376 (2008).
Article PubMed Google Scholar
Vannier, E. & Krause, P. J. Human babesiosis. N Engl J Med 366, 2397–2407, 10.1056/NEJMra1202018 (2012).
Article CAS PubMed Google Scholar
Krause, P. J. et al. Atovaquone and azithromycin for the treatment of babesiosis. N Engl J Med 343, 1454–1458, 10.1056/NEJM200011163432004 (2000).
Article CAS PubMed Google Scholar
Krause, P. J. et al. Persistent parasitemia after acute babesiosis. N Engl J Med 339, 160–165 (1998).
Article CAS PubMed Google Scholar
Sharma, D., Mudduluru, B., Moussaly, E., Mobarakai, N. & Hurford, M. Babesia in a Nonsplenectomized Patient Requiring Exchange Transfusion. Case Rep Infect Dis 2015, 405263, 10.1155/2015/405263 (2015).
Article PubMed PubMed Central Google Scholar
Vyas, J. M., Telford, S. R. & Robbins, G. K. Treatment of refractory Babesia microti infection with atovaquone-proguanil in an HIV-infected patient: case report. Clinical infectious diseases: an official publication of the Infectious Diseases Society of America 45, 1588–1590, 10.1086/523731 (2007).
Article CAS Google Scholar
Yager, P. H., Luginbuhl, L. M. & Dekker, J. P. Case records of the Massachusetts General Hospital. Case 6–2014. A 35-day-old boy with fever, vomiting, mottled skin, and severe anemia. N Engl J Med 370, 753–762, 10.1056/NEJMcpc1208155 (2014).
Article CAS PubMed Google Scholar
Wormser, G. P. et al. Emergence of resistance to azithromycin-atovaquone in immunocompromised patients with Babesia microti infection. Clinical infectious diseases: an official publication of the Infectious Diseases Society of America 50, 381–386, 10.1086/649859 (2010).
Article CAS Google Scholar
Lawres, L. A. et al. Radical cure of experimental babesiosis in immunodeficient mice using a combination of an endochin-like quinolone and atovaquone. J Exp Med 213, 1307–1318, 10.1084/jem.20151519 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bloch, E. M. et al. Development of a real-time polymerase chain reaction assay for sensitive detection and quantitation of Babesia microti infection. Transfusion 53, 2299–2306, 10.1111/trf.12098 (2013).
Article CAS PubMed Google Scholar
Johnson, S. T. et al. Babesia microti real-time polymerase chain reaction testing of Connecticut blood donors: potential implications for screening algorithms. Transfusion 53, 2644–2649, 10.1111/trf.12125 (2013).
Article CAS PubMed Google Scholar
Levin, A. E. et al. Determination of Babesia microti seroprevalence in blood donor populations using an investigational enzyme immunoassay. Transfusion 54, 2237–2244, 10.1111/trf.12763 (2014).
Article CAS PubMed PubMed Central Google Scholar
Priest, J. W. et al. Multiplex assay detection of immunoglobulin G antibodies that recognize Babesia microti antigens. Clinical and vaccine immunology: CVI 19, 1539–1548, 10.1128/CVI.00313–12 (2012).
Article CAS PubMed PubMed Central Google Scholar
Rollend, L. et al. Quantitative PCR for detection of Babesia microti in Ixodes scapularis ticks and in human blood. Vector borne and zoonotic diseases 13, 784–790, 10.1089/vbz.2011.0935 (2013).
Article PubMed PubMed Central Google Scholar
Wang, G., Villafuerte, P., Zhuge, J., Visintainer, P. & Wormser, G. P. Comparison of a quantitative PCR assay with peripheral blood smear examination for detection and quantitation of Babesia microti infection in humans. Diagnostic microbiology and infectious disease 82, 109–113, 10.1016/j.diagmicrobio.2015.03.010 (2015).
Article CAS PubMed Google Scholar
Wang, G. et al. Utilization of a real-time PCR assay for diagnosis of Babesia microti infection in clinical practice. Ticks and tick-borne diseases 6, 376–382, 10.1016/j.ttbdis.2015.03.001 (2015).
Article PubMed Google Scholar
Cornillot, E. et al. A targeted immunomic approach identifies diagnostic antigens in the human pathogen Babesia microti. Transfusion 56, 2085–2099, 10.1111/trf.13640 (2016).
Article CAS PubMed PubMed Central Google Scholar
Rudzinska, M. A. Ultrastructure of intraerythrocytic Babesia microti with emphasis on the feeding mechanism. J Protozool 23, 224–233 (1976).
Article CAS PubMed Google Scholar
Krause, P. J. et al. Shared features in the pathobiology of babesiosis and malaria. Trends Parasitol 23, 605–610, 10.1016/j.pt.2007.09.005 (2007).
Article CAS PubMed Google Scholar
Clark, I. A. et al. Absence of erythrocyte sequestration in a case of babesiosis in a splenectomized human patient. Malar J 5, 69, 10.1186/1475–2875–5–69 (2006).
Article PubMed PubMed Central Google Scholar
Schnittger, L., Rodriguez, A. E., Florin-Christensen, M. & Morrison, D. A. Babesia: A world emerging. Infection, Genetics and Evolution 12, 1788–1809, 10.1016/j.meegid.2012.07.004 (2012).
Article PubMed Google Scholar
Goethert, H. K. & Telford, S. R., 3rd . Not “out of Nantucket”: Babesia microti in southern New England comprises at least two major populations. Parasit Vectors 7, 546, 10.1186/s13071-014-0546-y (2014).
Article CAS PubMed PubMed Central Google Scholar
Cornillot, E. et al. Whole genome mapping and re-organization of the nuclear and mitochondrial genomes of Babesia microti isolates. PloS one 8, e72657, 10.1371/journal.pone.0072657 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Cornillot, E. et al. Sequencing of the smallest Apicomplexan genome from the human pathogen Babesia microti. Nucleic acids research 40, 9102–9114, 10.1093/nar/gks700 (2012).
Article CAS PubMed PubMed Central Google Scholar
Assefa, S. et al. Population genomic structure and adaptation in the zoonotic malaria parasite Plasmodium knowlesi. Proc Natl Acad Sci USA 112, 13027–13032, 10.1073/pnas.1509534112 (2015).
Article CAS ADS PubMed PubMed Central Google Scholar
Hayashida, K. et al. Whole-genome sequencing of Theileria parva strains provides insight into parasite migration and diversification in the African continent. DNA Res 20, 209–220, 10.1093/dnares/dst003 (2013).
Article CAS PubMed PubMed Central Google Scholar
Manske, M. et al. Analysis of Plasmodium falciparum diversity in natural infections by deep sequencing. Nature 487, 375–379, 10.1038/nature11174 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Hupalo, D. N. et al. Population genomics studies identify signatures of global dispersal and drug resistance in Plasmodium vivax. Nat Genet 48, 953–958, 10.1038/ng.3588 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ifeonu, O. O. et al. Annotated draft genome sequences of three species of Cryptosporidium: C. meleagridis isolate UKMEL1, C. baileyi isolate TAMU-09Q1, and C. hominis isolates TU502_2012 and UKH1. Pathog Dis, 10.1093/femspd/ftw080 (2016).
Malaria, G. E. N. P. f. C. P. Genomic epidemiology of artemisinin resistant malaria. Elife 5, 10.7554/eLife.08714 (2016).
Flannery, E. L. et al. Next-Generation Sequencing of Patient Samples Shows Evidence of Direct Evolution in Drug-Resistance Genes. ACS Infect Dis 1, 367–379, 10.1021/acsinfecdis.5b00049 (2015).
Article CAS PubMed PubMed Central Google Scholar
Carlton, J. M. et al. Population Genetics, Evolutionary Genomics, and Genome-Wide Studies of Malaria: A View Across the International Centers of Excellence for Malaria Research. Am J Trop Med Hyg 93, 87–98, 10.4269/ajtmh.15-0049 (2015).
Article CAS PubMed PubMed Central Google Scholar
Takala-Harrison, S. & Laufer, M. K. Antimalarial drug resistance in Africa: key lessons for the future. Ann N Y Acad Sci 1342, 62–67, 10.1111/nyas.12766 (2015).
Article CAS ADS PubMed PubMed Central Google Scholar
Miotto, O. et al. Genetic architecture of artemisinin-resistant Plasmodium falciparum. Nat Genet 47, 226–234, 10.1038/ng.3189 (2015).
Article CAS PubMed PubMed Central Google Scholar
Brown, T. S. et al. Plasmodium falciparum field isolates from areas of repeated emergence of drug resistant malaria show no evidence of hypermutator phenotype. Infect Genet Evol 30, 318–322, 10.1016/j.meegid.2014.12.010 (2015).
Article PubMed Google Scholar
Neafsey, D. E. et al. The malaria parasite Plasmodium vivax exhibits greater genetic diversity than Plasmodium falciparum. Nat Genet 44, 1046–1050, 10.1038/ng.2373 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lodes, M. J. et al. Serological expression cloning of novel immunoreactive antigens of Babesia microti. Infect Immun 68, 2783–2790 (2000).
Article CAS PubMed PubMed Central Google Scholar
Yokoyama, N. et al. Roles of the Maltese cross form in the development of parasitemia and protection against Babesia microti infection in mice. Infect Immun 71, 411–417 (2003).
Article CAS PubMed PubMed Central Google Scholar
Homer, M. J. et al. Identification and characterization of putative secreted antigens from Babesia microti. Journal of clinical microbiology 41, 723–729 (2003).
Article CAS PubMed PubMed Central Google Scholar
Ooka, H. et al. Molecular and immunological characterization of a novel 32-kDa secreted protein of Babesia microti. J Parasitol 98, 1045–1048, 10.1645/GE-2999.1 (2012).
Article CAS PubMed Google Scholar
Ooka, H. et al. Babesia microti: molecular and antigenic characterizations of a novel 94-kDa protein (BmP94). Exp Parasitol 127, 287–293, 10.1016/j.exppara.2010.06.018 (2011).
Article CAS PubMed Google Scholar
Luo, Y. et al. Identification and characterization of a novel secreted antigen 1 of Babesia microti and evaluation of its potential use in enzyme-linked immunosorbent assay and immunochromatographic test. Parasitology international 60, 119–125, 10.1016/j.parint.2010.11.001 (2011).
Article CAS PubMed Google Scholar
Cao, S. et al. Identification and characterization of an interspersed repeat antigen of Babesia microti (BmIRA). Exp Parasitol 133, 346–352, 10.1016/j.exppara.2012.12.015 (2013).
Article CAS PubMed Google Scholar
Sargeant, T. J. et al. Lineage-specific expansion of proteins exported to erythrocytes in malaria parasites. Genome Biol 7, R12, 10.1186/gb-2006–7–2-r12 (2006).
Article PubMed PubMed Central Google Scholar
Anantharaman, V., Iyer, L. M., Balaji, S. & Aravind, L. Adhesion molecules and other secreted host-interaction determinants in Apicomplexa: insights from comparative genomics. Int Rev Cytol 262, 1–74, 10.1016/S0074-7696(07)62001-4 (2007).
Article CAS PubMed Google Scholar
Hiller, N. L. et al. A host-targeting signal in virulence proteins reveals a secretome in malarial infection. Science 306, 1934–1937, 10.1126/science.1102737 (2004).
Article CAS ADS PubMed Google Scholar
Marti, M., Good, R. T., Rug, M., Knuepfer, E. & Cowman, A. F. Targeting malaria virulence and remodeling proteins to the host erythrocyte. Science 306, 1930–1933, 10.1126/science.1102452 (2004).
Article CAS ADS PubMed Google Scholar
Ginsburg, H. Malaria Parasite Metabolic Pathways, http://mpmp.huji.ac.il/ (2014).
Elsworth, B. et al. PTEX is an essential nexus for protein export in malaria parasites. Nature 511, 587–591, 10.1038/nature13555 (2014).
Article CAS ADS PubMed Google Scholar
Besteiro, S., Dubremetz, J. F. & Lebrun, M. The moving junction of apicomplexan parasites: a key structure for invasion. Cell Microbiol 13, 797–805, 10.1111/j.1462-5822.2011.01597.x (2011).
Article CAS PubMed Google Scholar
Sedwick, C. & Plasmepsin V, a secret weapon against malaria. PLoS Biol 12, e1001898, 10.1371/journal.pbio.1001898 (2014).
Article CAS PubMed Google Scholar
Hsiao, C. H., Luisa Hiller, N., Haldar, K. & Knoll, L. J. A HT/PEXEL motif in Toxoplasma dense granule proteins is a signal for protein cleavage but not export into the host cell. Traffic 14, 519–531, 10.1111/tra.12049 (2013).
Article CAS PubMed PubMed Central Google Scholar
Pelle, K. G. et al. Shared elements of host-targeting pathways among apicomplexan parasites of differing lifestyles. Cell Microbiol 17, 1618–1639, 10.1111/cmi.12460 (2015).
Article CAS PubMed Google Scholar
Lemieux, J. E. et al. A global map of genetic diversity in Babesia microti reveals strong population structure and identifies variants associated with clinical relapse. Nat Microbiol 1, 16079, 10.1038/nmicrobiol.2016.79 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bopp, S. E. et al. Mitotic evolution of Plasmodium falciparum shows a stable core genome but recombination in antigen families. PLoS Genet 9, e1003293, 10.1371/journal.pgen.1003293 (2013).
Article CAS PubMed PubMed Central Google Scholar
Norling, M. et al. The genomes of three stocks comprising the most widely utilized live sporozoite Theileria parva vaccine exhibit very different degrees and patterns of sequence divergence. BMC Genomics 16, 729, 10.1186/s12864-015-1910-9 (2015).
Article CAS PubMed PubMed Central Google Scholar
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic acids research 27, 573–580 (1999).
Article CAS PubMed PubMed Central Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. Journal of molecular biology 215, 403–410, 10.1016/S0022-2836(05)80360-2 (1990).
Article CAS PubMed Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842, 10.1093/bioinformatics/btq033 (2010).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760, 10.1093/bioinformatics/btp324 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079, 10.1093/bioinformatics/btp352 (2009).
Article CAS PubMed PubMed Central Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome research 20, 1297–1303, 10.1101/gr.107524.110 (2010).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Dr. Sam Telford, Dr. Laura Tonnetti and Dr. Tim Lepore for providing cryopreserved mouse and human blood infected with B. microti isolates for propagation in mice and hamsters. We thank Dr. Aprajita Garg for assistance with SNP validation. This work was supported in part with federal funds from the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services under contract number HHSN272200900009C (CMF, JCS, CBM, and PJK). CBM research was also supported by NIH grant AI116930 and AI1021571, and the Bill and Melinda Gates Foundation (OPP1086229 and OPP1069779) grants. EC is supported by the ANR (Investissements d’avenir/Bioinformatique): ANR-11-BINF-0002 (Institut de Biologie Computationnelle). PJK was supported in part for this work from a grant from the Gordon and Llura Gund Foundation.

Author information

Authors and Affiliations

Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, 21201, MD, USA
Joana C. Silva, Carrie McCracken, Olukemi O. Ifeonu, Jonathan Crabtree, Hanzel T. Gotia, Priti Kumari, Joshua Orvis, Kyle Tretina, Marcus Chibucos, Sandy Ott, Lisa Sadzewicz, Naomi Sengamalay, Amol C. Shetty, Qi Su, Luke Tallon & Claire M. Fraser
Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, 21201, MD, USA
Joana C. Silva & Marcus Chibucos
Institut de Biologie Computationnelle, IBC, Université de Montpellier, 860 rue St Priest, Bat 5 - CC05019, Montpellier, 34095, Cedex 5, France
Emmanuel Cornillot & Ankit Dwivedi
Institut de Recherche en Cancérologie de Montpellier, IRCM - INSERM U896 & Université de Montpellier & ICM, Institut régional du Cancer Montpellier, Campus Val d’Aurelle, Montpellier, 34298, France, Cedex 5
Emmanuel Cornillot, Ankit Dwivedi & Jacques Colinge
Department of Internal Medicine, Section of Infectious Diseases, Yale School of Medicine, 15 York St., New Haven, CT 06520, Connecticut, USA
Sahar Usmani-Brown, Azan Z. Virji, Vidya Kumar, Lauren Lawres & Choukri Ben Mamoun
Yale School of Public Health and Yale School of Medicine, 60 College St., New Haven, CT 06520, Connecticut, USA
Sahar Usmani-Brown, Jana Brancato & Peter J. Krause
Institut de Genomique Fonctionnelle, IGF - CNRS UMR 5203, 141 rue de la cardonille, Montpellier, 34094, Cedex 05, France
Christelle Reynes
Antigen Discovery Inc., Irvine, 92618, CA, USA
Joseph E. Pazzi, Jozelyn V. Pablo, Chris Hung & Douglas M. Molina
Université de Montpellier, IES, UMR 5214, 860 rue de St Priest, Bt5, Montpellier, 34095, France
Roger Frutos
CIRAD, UMR 17, Cirad-Ird, TA-A17/G, Campus International de Baillarguet, Montpellier, 34398, France
Roger Frutos

Authors

Joana C. Silva
View author publications
You can also search for this author in PubMed Google Scholar
Emmanuel Cornillot
View author publications
You can also search for this author in PubMed Google Scholar
Carrie McCracken
View author publications
You can also search for this author in PubMed Google Scholar
Sahar Usmani-Brown
View author publications
You can also search for this author in PubMed Google Scholar
Ankit Dwivedi
View author publications
You can also search for this author in PubMed Google Scholar
Olukemi O. Ifeonu
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Crabtree
View author publications
You can also search for this author in PubMed Google Scholar
Hanzel T. Gotia
View author publications
You can also search for this author in PubMed Google Scholar
Azan Z. Virji
View author publications
You can also search for this author in PubMed Google Scholar
Christelle Reynes
View author publications
You can also search for this author in PubMed Google Scholar
Jacques Colinge
View author publications
You can also search for this author in PubMed Google Scholar
Vidya Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Lauren Lawres
View author publications
You can also search for this author in PubMed Google Scholar
Joseph E. Pazzi
View author publications
You can also search for this author in PubMed Google Scholar
Jozelyn V. Pablo
View author publications
You can also search for this author in PubMed Google Scholar
Chris Hung
View author publications
You can also search for this author in PubMed Google Scholar
Jana Brancato
View author publications
You can also search for this author in PubMed Google Scholar
Priti Kumari
View author publications
You can also search for this author in PubMed Google Scholar
Joshua Orvis
View author publications
You can also search for this author in PubMed Google Scholar
Kyle Tretina
View author publications
You can also search for this author in PubMed Google Scholar
Marcus Chibucos
View author publications
You can also search for this author in PubMed Google Scholar
Sandy Ott
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Sadzewicz
View author publications
You can also search for this author in PubMed Google Scholar
Naomi Sengamalay
View author publications
You can also search for this author in PubMed Google Scholar
Amol C. Shetty
View author publications
You can also search for this author in PubMed Google Scholar
Qi Su
View author publications
You can also search for this author in PubMed Google Scholar
Luke Tallon
View author publications
You can also search for this author in PubMed Google Scholar
Claire M. Fraser
View author publications
You can also search for this author in PubMed Google Scholar
Roger Frutos
View author publications
You can also search for this author in PubMed Google Scholar
Douglas M. Molina
View author publications
You can also search for this author in PubMed Google Scholar
Peter J. Krause
View author publications
You can also search for this author in PubMed Google Scholar
Choukri Ben Mamoun
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived and designed the experiments: C.B.M., P.J.K., J.C.S. and C.M.F. Generated nucleic acid samples: S.U.-B. and E.C. Generated nucleic acid data: S.O., L.S., N.S., L.T., J.C.S., C.B.M. and P.J.K. Generated genome assemblies: Q.S., L.T. and J.C.S. Generated structural and functional annotation: E.C., C.M., H.T.G., J.O., O.I., M.C., K.T. and J.C.S. Generated informatics tools and figures: E.C., J.C., J.O., P.K., J.C.S., C.R. and A.Z.V. Analyzed sequence data: E.C., A.D., C.M., O.I., J.M., P.K., A.C.S., R.F. and J.C.S. Generated protein array, ran serological assays and performed immunoproteomic analyses: D.M.M., J.E.P., J.P. and C.H. Performed S.N.P. validation: L.L., J.B. and V.K. Contributed strains: P.J.K. Interpreted the data: E.C., J.C.S. and C.B.M. Wrote the manuscript: C.B.M., J.C.S., E.C. and P.J.K.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Supplementary Dataset 1

Supplementary Dataset 2

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Silva, J., Cornillot, E., McCracken, C. et al. Genome-wide diversity and gene expression profiling of Babesia microti isolates identify polymorphic genes that mediate host-pathogen interactions. Sci Rep 6, 35284 (2016). https://doi.org/10.1038/srep35284

Download citation

Received: 06 July 2016
Accepted: 26 September 2016
Published: 18 October 2016
DOI: https://doi.org/10.1038/srep35284

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.