Balamuthia mandrillaris, a pathogenic free-living amoeba, causes cutaneous skin lesions as well as granulomatous amoebic encephalitis, a ‘brain-eating’ disease. As with the other known pathogenic free-living amoebas (Naegleria fowleri and Acanthamoeba species), drug discovery efforts to combat Balamuthia infections of the central nervous system are sparse; few targets have been validated or characterized at the molecular level, and little is known about the biochemical pathways necessary for parasite survival. Current treatments of encephalitis due to B. mandrillaris lack efficacy, leading to case fatality rates above 90%. Using our recently published methodology to discover potential drugs against pathogenic amoebas, we screened a collection of 85 compounds with known antiparasitic activity and identified 59 compounds that impacted the growth of Balamuthia trophozoites at concentrations below 220 µM. Since there is no fully annotated genome or proteome of B. mandrillaris, we sequenced and assembled its transcriptome from a high-throughput RNA-sequencing (RNA-Seq) experiment and located the coding sequences of the genes potentially targeted by the growth inhibitors from our compound screens. We determined the sequence of 17 of these target genes and obtained expression clones for 15 that we validated by direct sequencing. These will be used in the future in combination with the identified hits in structure guided drug discovery campaigns to develop new approaches for the treatment of Balamuthia infections.
Balamuthia mandrillaris is a ubiquitous soil-dwelling amoeba that is the causative agent of granulomatous amoebic encephalitis (GAE)1,2,3,4. Similar to the other two major pathogenic free-living amoebas, Naegleria fowleri and Acanthamoeba castellanii, B. mandrillaris infections, though uncommon, have > 90% case fatality rate5. In the United States, 109 Balamuthia cases in both immunocompetent and immunocompromised individuals have been reported with at least twice that many cases worldwide, but these rates of GAE are likely to be underestimated due to historically poor diagnosis6. Nonetheless, awareness of potential cases has been on the rise7,8. In addition to diagnostic awareness, increasing rates of amoebic infections in northern regions of the United States could also be early indicators that recent emergence of these diseases might be associated with global warming6. In contrast to Naegleria infections that present and progress extremely rapidly after exposure, Balamuthia incubation times might be as long as several weeks to months and disease progression more subacute or chronic, increasing the opportunity for therapeutic intervention9. However, treatment options remain very limited and not very efficacious, leading to poor outcomes even with the correct diagnosis.
The recent development of an inexpensive and easily prepared media, as well as increasing interest in B. mandrillaris as a public health concern, has facilitated the development of robust high-throughput drug screening methods. Where low throughput methods restricted screening to ~ 10–20 drugs at a time, the new high-throughput methods allow rapid screening of hundreds to thousands of drugs simultaneously and direct comparisons of activity. In this study, we identify FDA approved drugs that could potentially be repurposed for therapy alone or in combination against B. mandrillaris. These drugs can also be used as leads for further structure-guided drug discovery (SGDD) exploration and in vivo efficacy studies.
Structure based drug discovery (SBDD) was originally devised in the mid 1980’s10. Advances in protein structure determination, less expensive and faster computer processing, and better prediction software have reduced the timeline to solve target structures and design specific and selective drugs10. SBDD has been mentioned in the amoeba literature as an attractive method to design selective enzymatic inhibitors that specifically target the parasite over the human host11, but only a small number of laboratories have actually applied this methodology to pathogenic free-living amoebas. Sterol biosynthesis has been the most attractive target since parasites utilize ergosterol over cholesterol for making cell plasma membranes with distinct host biosynthetic differences that could be selectively targeted12,13,14. Glucose metabolism is essential for parasitic cell viability. Milanes et al., recently targeted glucokinase in N. fowleri, and described NfGlck specific inhibitors with minimal activity against human glucokinase in recombinant enzymatic functional studies15. Other studies have looked at targeting histidine or shikimate essential amino acid biosynthetic pathways in Acanthamoeba species, which the hosts cannot synthesize de novo, as parasite specific targets for drug intervention16,17.
The development of new compounds against B. mandrillaris in particular has been hampered by the paucity of genomic information. Though draft genomes have been published, no structural and functional annotation is currently available18,19. This information is essential for the design of new drugs by SGDD/SBDD, as that methodology requires information about the molecular structure of the target protein. Once the protein coding sequences are annotated on the genome, rapid selection of multiple drug targets can be performed, for example by homology searches with known drug targets, thus paving the way for combinational therapy, a broadly established strategy to minimize the risk of drug resistance. This study presents the first proteome of B. mandrillaris reconstructed from RNA sequencing of logarithmic growing trophozoites, the infective form of the amoeba. Potential drug targets identified through phenotypic screening were selected specifically from the trophozoite transcriptome and PCR amplified. The clones were further validated by direct sequencing, providing the first step for recombinant expression and crystallization by the Seattle Structural Genomics Center for Infectious Disease (SSGCID) high-throughput gene-to-structure pipeline20.
We performed a drug susceptibility screen of 85 known anti-parasitic compounds against the trophozoite stage of B. mandrillaris and discovered that 59 of these compounds had 50% inhibitory concentration (IC50) efficacy at ≤ 220 µM concentration (Table 1). Our results indicated that only dequalinium chloride possessed nanomolar potency and that 43% of the currently recommended drugs by the CDC for treating Balamuthia infections appeared to be only moderately to slightly efficacious, with IC50 values ranging from 18.35 µM (pentamidine) to > 163.25 µM (fluconazole). Miltefosine, the newest drug addition to the amoebae chemotherapy cocktail, was inactive at the maximum screening concentration of 122.68 µM. Thus, all currently used therapeutics fall into the unacceptable activity range for today’s standards of hit to lead drug discovery and development. Macrolides (azithromycin, clarithromycin, roxithromycin, spiramycin A) have previously demonstrated potent activity against Naegleria or Acanthamoeba but did not show activity against Balamuthia at the concentrations we tested (IC50 values > 59 µM). Although we identified other ribosomal protein inhibitors for 50S (valnemulin, clindamycin, and solithromycin), 23S of the 50S [mirincamycin (cis- and trans-)], and the 16S ribosomal subunits (paromomycin) suggesting protein synthesis would be a useful target for drug therapy against Balamuthia infections. As pentamidine is currently used within the treatment regimen, we screened several anti-protozoal amidines, identifying hexamidine, octamidine and propamidine (IC50 values 4.46–6.5 µM) as being active against the amoeba. Although the exact mechanisms are still unknown, it is suggested that these compounds interfere with glyceraldehyde 3-phosphate dehydrogenase or interfere with nucleases. We further identified several 3-hydroxy-3-methylglutaryl-Coenzyme A reductase (HMGR) inhibitors (fluvastatin, atorvastatin, and simvastatin), with activity ranging from 1.18 to 3.03 µM.
Transcriptome sequencing, assembly and functional annotation
Transcriptome assembly and proteome prediction
The sequencing of RNA isolated from an axenic laboratory culture of B. mandrillaris trophozoites yielded 30,473,902 paired-end reads (2 × 75 bp). We combined three de-novo assemblies, obtained from different assemblers and multiple k-mers, with a genome-based assembly based on the existing B. mandrillaris genome LFUI0119 and then predicted protein coding sequences (CDSs) with EvidentialGene (EviGene)21 (Fig. 1).
Of the 37 K transcripts with predicted CDSs, just over half (53%) translated to complete proteins. The top 1000 longest complete proteins averaged 1550 ± 423 amino acids in length, a number indicative of assembly quality that is roughly comparable to the AmoebaDBv44 A. castellanii proteome (1688 ± 539)22,23. The EviGene “main” sequences, representing the haploid proteome, contained 14 K proteins; though only 63% translated to complete proteins, they contain 90% of complete eukaryotic Benchmarking Universal Single-Copy Orthologs (BUSCOs), a standard measure to quantify the accuracy and completeness of assemblies (Table 2)24. Table 2 shows comparable levels of completeness for the transcriptome and the unannotated genome (LFUI01), but the EviGene “main” proteins stand out for containing only 2 duplicates among the complete BUSCOs.
We compared the EviGene ‘main’ proteins to other species in the UniProt database with AAI-profiler; it retrieved A. castellanii strain Neff as the closest sequenced proteome with 29% of matched fraction (Fig. 2A). Note that the AAI-profiler does partial sampling as it relies on SANSparallel, a fast homology search that is only as sensitive as BLAST above ca. 50% sequence identity25. We confirmed this result with a BLASTP search of the EviGene ‘main’ proteins against A. castellanii: it returned hits for 65% of the Balamuthia sequences, with an average identity of 44%, but matched 29% with at least 50% identity.
To gain further insight on how closely the two proteomes are related, we performed orthologous cluster analysis with Dictyostelium discoideum as the outgroup. Results show that 38% of the Balamuthia proteins cluster with Acanthamoeba, of which 21% are shared between the three species (Fig. 2B). This result and the high proportion of singletons (48%) highlights the divergence of Balamuthia from Acanthamoeba.
To place the Balamuthia proteome in an evolutionary context, we constructed a neighbor-joining tree from an alignment-free comparison of complete proteomes from selected Amoebas, with the non-Amoebozoa Naegleria as outgroup (Fig. 2C). As detected by AAI-profiler, the Discosea genera Balamuthia and Acanthamoeba are in a separate group from the Variosea genus Planoprotostelium and the Eumycetozoa Dictyostelids Cavenderia, Polysphondylium, Tieghemostelium and Dictyostelium. The Evosea genus Entamoeba is in a separate branch from the other Amoebozoa in the tree.
Functional annotation of the draft proteome
To characterize the proteome further and expand the pool of potential targets, we conducted preliminary functional annotations of the draft proteome dataset. Functional annotation of the EviGene “main” protein sequences with PANNZER2, one of the top-10 rapid methods in the CAFA2 NK-full benchmark, provided 26% of the sequences with a description and 25% with a lower level GO molecular function term. A plot of high-level GO terms compared with those obtained for A. castellanii and D. discoideum, one of the most thoroughly annotated amoebas in UniProt, shows a similar profile for the three species, with differences limited to smaller gene families representing less than 1% of the genes (Fig. 3). In terms of potential targets for SBDD, the Balamuthia GO annotations classified 284 (2%) of proteins as having kinase activity, of which over half (163) were classified as protein kinases, but further analysis is needed to target the kinome with accuracy26.
Target identification and validation
For this study, putative protein kinase targets from the phenotypic screens were left out, leaving a total of 25 potential targets (out of the original list of 52), to which we added 6 potential drug targets requested by the amoeba community via the gene-to-structure service portal of the SSGCID website. From this list of 31 protein names, 19 could be assigned to specific human protein sequences. A total of 14 Balamuthia sequences for 13 targets (there are 2 copies of topoisomerase II) were identified from a BLASTP search with the human sequences: 12 from the phenotypic screens, and 2 community targets. Average pairwise identity was 49% with 77% coverage. Another three of the community targets were not detected by BLASTP searches of the Balamuthia proteome using the human sequences, therefore Acanthamoeba sequences were used instead. This yielded a total of 17 Balamuthia sequences that were entered into the SSGCID gene-to-structure pipeline. Truncations around putative catalytic domains were designed for 9 of the 17 sequences to increase crystallization likelihood, leading to 23 constructs as cloning candidates. PCR amplification produced clones for 18 constructs. Direct sequencing was successful for 15 of these and sequence comparison with the “main” proteins from the EviGene assembly showed excellent matches with over 99% average amino acid identity, corresponding to 2 amino acid variations on average per sequence, and 100% coverage for all but the two largest proteins (84% coverage and 100% identity for the 1068 amino-acid long Exportin-1, 81% coverage and 99% identity for the 784-residue primase and C-term domains of topoisomerase II).
The identity between the 15 validated protein sequences and their closest A. castellanii homologue ranged between 56 to 88%, with 3 notable exceptions: exportin-1 (21% identity), lanosterol 14-alpha demethylase (CYP51A) (28% identity) and glucokinase (51% identity). In the case of exportin-1, a multiple sequence alignment indicated that the Balamuthia protein was over 50% identical to the N. fowleri and Planoprotostelium fungivorum proteins, suggesting potential mis-assembly in the A. castellanii genome. Similarly, the Acanthamoeba glucokinase sequence appears to have a large deletion of over 30 residues compared to the Balamuthia and Naegleria sequences. This region corresponds to a double-stranded beta-sheet that lies in the glucose binding pocket in the Naegleria structure, and we would expect it to be conserved in Acanthamoeba (Fig. 4)15.
Of the 13 targets that were also found in humans, five shared over 55% sequence identity overall to their human counterpart and might potentially have similar active sites: S-adenosyl-homocysteinase (SAHH), 3-hydroxy-3-methylglutaryl-coenzyme A reductase (HMGR), heat-shock protein 90-alpha (HSP90ɑ), histone deacetylase 1 (HDAC1) and exportin-1 (XPO1) (Table 3). As a consequence, SBDD for these targets will likely require exploration of potential alternate binding sites that are specific to the Balamuthia protein. We would expect selectivity to be more readily achievable for the other targets, with the topoisomerase ATPases as borderline cases. One promising example of a Balamuthia target that can be selectively targeted is the GARTFase domain of trifunctional purine biosynthetic protein adenosine-3 (GART). Balamuthia GARTFase has a low sequence identity to the human enzyme (37%) and has a different domain arrangement than in human GART. Whereas GARTFase is the C-terminal domain of human GART, it is the middle domain in Balamuthia (Fig. 5). This domain arrangement, confirmed by direct sequencing and conserved in Acanthamoeba, leads us to postulate that targeting double domains in GART may offer a promising avenue to develop drugs against those pathogenic amoebas.
Based on previously determined in vitro activity and the few surviving cases of Balamuthia GAE infections, the Centers for Disease Control and Prevention (CDC) recommends that the drug cocktail regimen for treating disease include a combination of pentamidine, sulfadiazine, flucytosine, fluconazole, azithromycin or clarithromycin, and miltefosine6.
We thus proceeded to test these compounds first in order to confirm inhibition and assess if they have sufficient potency (< 10 µM) for hit-to-lead phenotypic drug screening. According to our screens, none of the recommended compounds passed this criterion. We therefore expanded our drug screening to other compounds that previously displayed anti-Acanthamoeba, anti-Naegleria, or anti-malarial activity, starting with the macrolides.
Our screening results consistently indicate that the compounds belonging to the macrolide drug class (azithromycin, clarithromycin, roxithromycin, and spiramycin) are inactive, in agreement with previous results28. Interestingly, solithromycin, a known ketolide antibiotic against macrolide-resistant Streptococcal species29, appeared to show moderate activity against B. mandrillaris (29.66 µm). We found that polyene antimycotics such as amphotericin B and natamycin, that target ergosterol within fungal cell membranes30, also were also inactive against B. mandrillaris. We then tested the azole compounds. CDC studies reported that fluconazole was inactive at concentrations lower than 10 µg/mL31. We were able to confirm the fluconazole result; however, other antifungal azoles (ketoconazole, tioconazole, difenoconazole, itraconazole, posaconazole, clotrimazole, climbazole, and sulconazole) displayed better, though still moderate, activity (29.89–81.51 µm). We tested flucytosine and miltefosine and both displayed moderate to poor activity against B. mandrillaris with an IC50 of 86.34 and > 122 µm, respectively. Flucytosine was previously described to inhibit 61% Balamuthia cytopathogenicity at 10 µg/mL (77 µm). Combined with our own results, this suggests flucytosine and miltefosine are equipotent32. Although miltefosine has been reported to have moderate activity at concentrations of 63–100 µm32,33, another study34 agrees with our data that miltefosine is less potent with a higher IC50 value > 122 µm. As for pentamidine, our results are consistent with prior studies that obtained comparable IC50 values between 9 and 29 µm31,35,36.
Thus, of the combination drug cocktail recommended by the CDC, only pentamidine and flucytosine appear to have in vitro activity against B. mandrillaris. It is possible that the recommended drug therapy is active in combination, and not when tested in isolation as in this study. It is also possible that the drugs are biologically activated in vivo or are only active against an in vivo form that we have not assayed. We therefore cannot rule out activity for the recommended drugs based solely on our in vitro sensitivity screens. Known inhibitors of 50S and 16S ribosomal proteins, such as macrolides and lincosamides, showed no or very poor activity in our screens, but we can only speculate that translation in Balamuthia evolved independently of prokaryotic endosymbiosis. Though Parachlamydia acanthamoebae was shown to infect B. mandrillaris37, we could find no evidence of P. acanthamoebae 50S ribosomal proteins in our de-novo transcriptome assembly.
We previously identified statins, which target 3-hydroxy-3-methylglutaryl-coenzyme A reductase (HMGR), as active compounds against B. mandrillaris36. As part of this study, we tested three additional statins: fluvastatin, atorvastatin and simvastatin and observed that they were active against B. mandrillaris at 1.18–3.03 µM concentration. Simvastatin and fluvastatin in particular have been shown to have better brain penetration compared to other statins38, indicating potential for drug repurposing. These results not only validate this drug class as promising lead molecules for the potential treatment of B. mandrillaris GAE, but also HMGR as a priority target of interest for future studies.
From a drug-repurposing standpoint, the compounds described in this study yielded a plethora of potentially useful drugs that act on B. mandrillaris. Available chemical inference data associated only few of the compounds with a specific protein target. Given our low success rate of structure determination in parasites, we increased the pool of potential protein targets by including results from our previous drug discovery studies. These include screening the MMV Malaria and Pathogen boxes that identified 11 compounds with equipotency to nitroxolone (8-Hydroxy-5-nitroquinoline) IC50 of 2.84 µM32, indicating these molecules could be used as an initial starting point for medicinal chemistry structure activity relationship (SAR) studies. More recently, we identified 63 compounds through screening the Calibr ReFRAME drug repurposing library, with activities ranging from 40 nM to 4.8 µM36.
Chemical inference from our previous drug screening efforts and the results from this study identified a total of 52 potential protein targets in B. mandrillaris. After excluding kinases, our target list was reduced to 25 proteins. Of these, 15 were identified solely from this study and the remainder was identified in several of our screening efforts. These 25 protein targets and an additional 6 community targets of interest were submitted for structural determination to the Seattle Structural Genomics Center for Infectious Diseases (SSGCID) gene-to-structure pipeline.
The first step of the SSGCID pipeline involves cloning of the B. mandrillaris sequences encoding the protein targets identified in the screens. However, lack of annotation of the Balamuthia genome hampered these efforts: although we were able to locate some sequences using BLAST searches of the Balamuthia genome using the human and Acanthamoeba homologues, PCR amplification from B. mandrillaris cDNA (or gDNA) was not successful. Therefore, transcriptome sequencing of Balamuthia was performed.
We reconstructed a draft B. mandrillaris proteome from a transcriptome that we obtained by combining 4 different assemblies from a single RNA-seq experiment. According to the BUSCO standard benchmark, our set of Evigene ‘main’ proteins is as complete (90%) as the reference genome, but with only a fraction of duplicates. We have thus based our subsequent analysis on this set as a representative of the haploid proteome of B. mandrillaris trophozoites. From our search of UniProt complete proteomes, A. castellanii stood out as the closest species with no other close relative, making the Acanthamoeba and Balamuthia proteomes the sole representatives of the phylum Discosea. As a third of the B. mandrillaris proteins appeared to be truncated, our phylogenetic analysis is based on an alignment-free method suitable for computing evolutionary distances between complete or incomplete proteomes39. The evolutionary tree placed the Discosea in a separate group as expected, highlighting their degree of divergence to the other pathogenic amoeba N. fowleri. Although they belong to the same phylum, Acanthamoeba and Balamuthia appear to be distantly related according to our alignment-based analyses-less than half of the proteins clustered together, and just above a quarter shared over 50% sequence identity.
We obtained functional annotation for above a quarter of the proteins using a conservative method. Annotation with the less stringent blast2go still leaves 60% of sequences with no annotation, a 20% higher proportion than the Acanthamoeba proteome. Much work remains to be done to characterize the complete proteome. Nonetheless, validation through direct sequencing showed that for our chosen targets at least, our assembly is accurate. In addition, alignment of the validated sequences to the genome revealed frameshifts in the reference assembly, which was further supported by analyzing the genome annotation we obtained with GENSAS40. Our attempts to correct the genome by re-assembling the publicly available PACBIO reads with modern assemblers and correcting exons with the transcripts were unsuccessful. Future efforts might succeed once the 454 Illumina reads used in the published genomic assembly will be made accessible.
The majority of the protein targets that we identified in Balamuthia had closely related sequence homologues in Acanthamoeba, a likely indication that they perform essential function in both organisms. From the high level of sequence identity, we would expect similar pattern of inhibition with the drugs tested in this report. However, our preliminary studies indicate that this is not the case. For example, the heat shock protein HSP90 is highly conserved in Balamuthia and Acanthamoeba (78% pairwise sequence identity), and yet we measured up to 5.56-fold differences in IC50 values in response to the same drug. Another example is 3-hydroxy-3-methylglutaryl-CoA reductase; the sequence identity is still high at 70%, but when we compared responses to nine different statins, we observed large differences ranging from > 2.51 to > 112-fold differences. The final example is lanosterol 14-alpha demethylase which displayed the least pairwise identity to A. castellanii protein sequence with 28.5% similarity. B. mandrillaris was found to be less susceptible to the 11 different azoles tested with 0.79 to 646-fold difference compared to the response of A. castellanii. The implication for drug discovery is that we cannot assume that orthologous sequences in Balamuthia and Acanthamoeba will be close enough to permit simultaneous targeting of these two pathogenic amoebas.
Through drug susceptibility screening with known antiparasitic compounds against B. mandrillaris, we identified protein targets with potential for treating Balamuthia granulomatous amoebic encephalitis. The reconstruction and annotation of the draft proteome from RNA-seq allowed us to amplify, clone and validate the B. mandrillaris targets by direct sequencing.
The haploid proteome appears distantly related to that of its closest relative A. castellanii, and thus provides an essential resource for further drug discovery and biological investigation. Even in cases where the target sequences are conserved, our preliminary results indicate that potent inhibitors against Acanthamoeba or Naegleria failed to inhibit Balamuthia growth, suggesting that the quest for a broad-spectrum drug against all three pathogenic amoebas might prove elusive. Drugs will need to be specifically developed for each amoeba.
This study illustrates how the combination of phenotypic drug screening and a single RNA-seq experiment with short reads are enabling structure-based drug design against a eukaryotic pathogen with no prior proteome information. As there are currently no genetic tools available for B. mandrillaris, the results presented here will enable future studies to validate specific drug targets. Validation strategies may include co-crystallization of protein and inhibitors; development of drug resistant B. mandrillaris clones followed by whole genome sequencing; specifically targeting genes through RNAi (which has been used successfully in other free-living amoebae) or, in the era of CRISPR/Cas9-mediated gene editing, the generation of conditional knock outs, would be some promising ways of target validation for future studies within this pathogen.
Materials and methods
Maintenance of Balamuthia mandrillaris
The pathogenic B. mandrillaris (CDC:V039; ATCC 50209), a GAE isolate, isolated from a pregnant Baboon at the San Diego Zoo in 1986 was donated by Luis Fernando Lares-Jiménez ITSON University, Mexico35. Trophozoites were routinely grown axenically in BMI media at 37 °C, 5% CO2 in vented 75 cm2 tissue culture flasks (Olympus), until the cells were 80–90% confluent. For sub-culturing, 0.25% Trypsin–EDTA (Gibco) cell detachment reagent was used to detach the cells from the culture flasks. The cells were collected by centrifugation at 4000 rpm at 4 °C. Complete BMI media is produced by the addition of 10% fetal bovine serum and 125 μg of penicillin/streptomycin antibiotics. All experiments were performed using logarithmic phase trophozoites.
We previously developed and standardized robust high-throughput screening methods for the discovery of active compounds against B. mandrillaris trophozoites35. The trophocidal activity of compounds were assessed using the CellTiter-Glo 2.0 luminescent cell viability assay (Promega, Madison, WI). In brief, B. mandrillaris trophozoites cultured in BMI-complete media were seeded at 16,000 cells/well into 96-well plates (Thermo Fisher 136102) with various compounds diluted in twofold serial dilutions to determine the 50% inhibitory concentration (IC50). The highest percentage of DMSO diluted in the highest screening drug concentration was 1%. Control wells were supplemented with 1% DMSO or 12.5 μM of chlorhexidine, as negative and positive controls, respectively. All assays were incubated at 37 °C for 72 h. At the end time point, 25 μL of CellTiter-Glo reagent was added to all wells. The plates were shaken using an orbital shaker at 300 rpm at room temperature for 2 min to induce cell lysis. After shaking, the plates were equilibrated at room temperature for 10 min to stabilize the luminescent signal. The ATP luminescent signal (relative light units; RLUs) were measured at 490 nm by using a SpectraMax i3X (Molecular Devices, Sunnyvale, CA). Drug inhibitory concentration (IC50) curves were generated using total ATP RLUs where controls were calculated as the average of replicates using the Levenberg–Marquardt algorithm, using DMSO as the normalization control, as defined in CDD Vault (Burlingame, CA, USA). Values reported are from a minimum of two biological replicates with standard error of the mean.
Selection of target genes
The protein names for verified potential targets were retrieved through Calibr at Scripps Research (https://reframedb.org/). The corresponding human protein sequences were downloaded from UniProt and queried against the B. mandrillaris assemblies using BLAST sequence similarity searches. Candidate targets were confirmed by comparing their protein sequences with closest sequence homologues in Acanthamoeba and Naegleria species and checking the B. mandrillaris functional annotation, where available. ORFs were selected for cloning from the Trinity de-novo assembly. Manual correction of putative start sites from multiple sequence alignments was performed with Geneious Prime 2019.1.1 (https://www.geneious.com).
RNA extraction, library preparation and sequencing
Balamuthia mandrillaris were cultured and harvested as described above; the cells were counted and adjusted to 2 million cells for each extraction. Total RNA isolated using the RNA extraction kit (Agilent) as per manufacturing instructions. In brief, to the pellet of Balamuthia cells, 350 μL of lysis buffer and 2.5 μL of β-mercaptoethanol were added and homogenized. This was transferred into a prefilter spin cup and centrifuged at maximum speed, 14,000×g, for 5 min. The filtrate was retained and an equal volume of 70% ethanol was added to the filtrate and vortexed until the filtrate and ethanol were mixed thoroughly. This mixture was then transferred into an RNA binding spin cup and receptacle tube and centrifuged at maximum speed for 1 min. The filtrate was discarded and 600 μL of 1 × low salt buffer was added and centrifuged at maximum speed for 1 min. The filtrate was removed and centrifuged at maximum speed for 2 min. DNase solution was added and incubated for 15 min at 37 °C. After incubation 600 μL of 1 × high salt buffer (contains guanidine thiocyanate) was added and centrifuged at maximum speed for 1 min. The filtrate was discarded and 300 μL of 1 × low salt buffer was added and centrifuged at maximum speed for 2 min. 100 μL of elution buffer was added and incubated at room temperature for 2 min. Final elution was into a sterile 1.5 mL microcentrifuge tube at maximum speed for 1 min.
Extracted RNA was stored at − 80 °C until further required. The integrity and purity of the RNA was assessed via RT-PCR and gel electrophoresis on a 2% agarose gel. The concentration was determined by measuring 280 nM absorbance on a nanodrop (Nanodrop 1000, Thermo Scientific).
RNA quality was reassessed after a freeze thaw cycle using the Bioanalyzer RNA 6000 pico chip (Agilent, 5067-1513) and quantity was assessed using the Qubit RNA Broad Range Assay (Invitrogen, Q10210). The mRNA was isolated using the NEB Poly(A) mRNA Magnetic Isolation Module (NEB, E7490S) and prepared using a version of the Stranded RNA-seq protocol that was modified for Leishmania41,42. Only the negative stranded RNA-seq library preparation portion was performed. Library quantity and quality was assessed using the Qubit dsDNA High Sensitivity Assay (Invitrogen, Q32851), Bioanalyzer High Sensitivity DNA Chip (Agilent, 5067-4627) and the KAPA library quantification kit (Roche, KK4824). Libraries were sequenced on the Illumina Hiseq 4000, yielding 2 × 75 bp paired end reads.
Transcripts assembly and annotation
Reads were quality filtered with Trimmomatic and assembled de-novo with Trinity v2.8 (k-mer = 25) and Spades v3.13 (k-mer = 29 and 33) after clipping of the adaptor sequences43,44,45. Further, quality-filtered reads were aligned to the published B. mandrillaris genome LFUI01 with STAR v2.6 and assembled with Trinity46. The three assemblies thus obtained were combined with EvidentialGene v19jan01 (EviGene) with BUSCO homology scores as input for the classifier21. Throughout the analysis, BUSCO v3 analysis was performed on either the European or Australian Galaxy mirrors47,48. Assemblies were screened for vector and common eukaryotic contaminants with the EvidentialGene utilities asmrna_vecscreen and asmrna_trimvec. Functional descriptions and gene ontology (GO) annotations of the EviGene ‘main’ proteins were predicted with PANNZER249. GO annotations that were highest ranked by PANNZER2 were visualized with WEGO 2.050.
Comparison to other species and phylogenetic analysis
Proteome comparisons to other species in the UniProt database were obtained from the AAI-profiler server51. Cluster analysis and Venn diagrams of orthologous clusters were generated with OrthoVenn252 (e-value: 1e−5, inflation value: 1.5). Unless otherwise specified, all BLAST searches were performed with BLAST + v2.8.1 and an expectation value of 0.00153. Pairwise distances for alignment-free phylogeny reconstruction were calculated with Prot-SpaM39. Input sequences included the Balamuthia EviGene ‘main’ proteins (this study), AmoebaDB A. castellanii strain Neff, N. fowleri ATCC 30894 and N. lovaniensis Braker1 predicted proteins54,55, and UniProt complete reference proteomes (C. fasciculata UP000007797, N. gruberi UP000006671, P. pallidum UP000001396, D. discoideum UP000002195, P. fungivorum UP000241769 and T. lacteum UP000076078). Phylogenetic relationships were inferred by constructing a neighbor-joining tree from the word match-based Prot-SpaM distance matrix using MEGA X56,57.
PCR and sequence validation
All B. mandrillaris constructs were cloned, expressed, and purified using SSGCID established protocols58,59. The genes selected were PCR-amplified using cDNA template and purchased primers (Integrated DNA Technologies, Inc., Coralville, IA) (Supplementary Table S1). The amplicons were extracted, purified and cloned into a ligation-independent cloning pET-14b derived, N-terminal His tag expression vector pBG1861 with a T7 promoter60. The cloned inserts were then transformed into purchased GC-5 cells (Genesee Scientific, El Cajon, CA) for ORF incorporation. Plasmid DNA was purified from the subsequent colonies and further transformed in chemically competent E. coli BL-21(DE3)R3 Rosetta cells with a chloramphenicol restriction.
Each B. mandrillaris construct was sequenced from both 5′- and 3′-ends with a custom forward primer (5′-GCGTCCGGCGTAGAGGATC-3′, 40nt upstream from the T7 promoter customary forward primer) and the T7 terminator reverse primer (5′-GCTAGTTATTGCTCAGCGG-3′) at GeneWiz (South Plainfield, NJ). The reads were assembled and matched to the expected sequences with the phrap assembler and cross_match61. Translations of the longest ORF in all six frames of the consensus sequence (or the forward read if unassembled) were then aligned using MUSCLE62 with the SSGCID target protein sequence to determine the best translated protein sequence and its alignment, percent identity and percent coverage. Manual examination of the sequences and alignments was performed in Geneious.
Illumina raw reads have been deposited at the National Center for Biotechnology Information (NCBI) BioProject repository with the accession number SRR12006108 under project PRJNA638697. This Transcriptome Shotgun Assembly project has been deposited at DDBJ/EMBL/GenBank under the accession GISS00000000. The version described in this paper is the first version, GISS01000000. The annotated protein sequences from the EviGene ‘main’ assembly are available on NIH Figshare under https://doi.org/10.35092/yhjc.12478733.v1. All data that are associated with the drug susceptibility study are archived using the database from Collaborative Drug Discovery (CDD; http://www.collaborativedrug.com/). The CDD database accommodates both compound chemistry data and results from phenotypic or target activity, cytotoxicity screening, and computed properties. The CDD database is becoming an established standard for the sharing of data within this community, and we are eager to facilitate the distribution of our results in a similar manner.
Visvesvara, G. S., Schuster, F. L. & Martinez, A. J. Balamuthia mandrillaris, N. G., N. Sp., agent of amebic meningoencephalitis in humans and other animals. J. Eukaryot. Microbiol. 40, 504–514 (1993).
Schuster, F. L. et al. Environmental isolation of Balamuthia mandrillaris associated with a case of amebic encephalitis. J. Clin. Microbiol. 41, 3175 (2003).
Cabello-Vílchez, A. M. et al. The isolation of Balamuthia mandrillaris from environmental sources from Peru. Parasitol. Res. 113, 2509–2513 (2014).
Niyyati, M., Karamati, S. A., Lorenzo Morales, J. & Lasjerdi, Z. Isolation of Balamuthia mandrillaris from soil samples in North-Western Iran. Parasitol. Res. 115, 541–545 (2016).
Gompf, S. G. & Garcia, C. Lethal encounters: The evolving spectrum of amoebic meningoencephalitis. IDCases 15, e00524 (2019).
Cope, J. R. et al. The epidemiology and clinical features of Balamuthia mandrillaris disease in the United States, 1974–2016. Clin. Infect. Dis. 68, 1815–1822 (2019).
Shehab, K. W., Aboul-Nasr, K. & Elliott, S. P. Balamuthia mandrillaris granulomatous amebic encephalitis with renal dissemination in a previously healthy child: Case report and review of the pediatric literature. J. Pediatr. Infect. Dis. Soc. 7, e163–e168 (2018).
Yang, Y., Hu, X., Min, L., Dong, X. & Guan, Y. Balamuthia mandrillaris-related primary amoebic encephalitis in China diagnosed by next generation sequencing and a review of the literature. Lab. Med. https://doi.org/10.1093/labmed/lmz079 (2019).
Pritzker, A. S., Kim, B. K., Agrawal, D., Southern, P. M. & Pandya, A. G. Fatal granulomatous amebic encephalitis caused by Balamuthia mandrillaris presenting as a skin lesion. J. Am. Acad. Dermatol. 50, S38-41 (2004).
Anderson, A. C. The process of structure-based drug design. Chem. Biol. 10, 787–797 (2003).
Byington, C. L., Dunbrack, R. L., Whitby, F. G., Cohen, F. E. & Agabian, N. Entamoeba histolytica: Computer-assisted modeling of phosphofructokinase for the prediction of broad-spectrum antiparasitic agents. Exp. Parasitol. 87, 194–202 (1997).
Thomson, S. et al. Characterisation of sterol biosynthesis and validation of 14α-demethylase as a drug target in Acanthamoeba. Sci. Rep. 7, 8247 (2017).
Debnath, A. et al. CYP51 is an essential drug target for the treatment of primary amoebic meningoencephalitis (PAM). PLoS Negl. Trop. Dis. 11, e0006104 (2017).
Kidane, M. E. et al. Sterol methyltransferase a target for anti-amoeba therapy: Towards transition state analog and suicide substrate drug design. J. Lipid Res. 58, 2310–2323 (2017).
Milanes, J. E. et al. Enzymatic and structural characterization of the Naegleria fowleri glucokinase. Antimicrob. Agents Chemother. 63, e02410-18 (2019).
Rice, C. A. et al. Structural and functional studies of histidine biosynthesis in Acanthamoeba spp. demonstrates a novel molecular arrangement and target for antimicrobials. PLoS ONE 13, e0198827 (2018).
Henriquez, F. L. et al. The Acanthamoeba shikimate pathway has a unique molecular arrangement and is essential for aromatic amino acid biosynthesis. Protist 166, 93–105 (2015).
Greninger, A. L. et al. Clinical metagenomic identification of Balamuthia mandrillaris encephalitis and assembly of the draft genome: the continuing case for reference genome sequencing. Genome Med. 7, 113 (2015).
Detering, H. et al. First draft genome sequence of Balamuthia mandrillaris, the causative agent of amoebic encephalitis. Genome Announc. 3, e01013-15 (2015).
Myler, P. J. et al. The Seattle Structural Genomics Center for infectious disease (SSGCID). Infect. Disord. Drug Targets 9, 493–506 (2009).
Gilbert, D. G. Genes of the pig, Sus scrofa, reconstructed with EvidentialGene. PeerJ 7, e6374 (2019).
Clarke, M. et al. Genome of Acanthamoeba castellanii highlights extensive lateral gene transfer and early evolution of tyrosine kinase signaling. Genome Biol. 14, R11 (2013).
Shabardina, V., Kischka, T., Kmita, H., Suzuki, Y. & Makałowski, W. Environmental adaptation of Acanthamoeba castellanii and Entamoeba histolytica at genome level as seen by comparative genomic analysis. Int. J. Biol. Sci. 14, 306–320 (2018).
Waterhouse, R. M. et al. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol. https://doi.org/10.1093/molbev/msx319 (2017).
Somervuo, P. & Holm, L. SANSparallel: Interactive homology search against Uniprot. Nucleic Acids Res. 43, W24-29 (2015).
Stroehlein, A. J., Young, N. D. & Gasser, R. B. Improved strategy for the curation and classification of kinases, with broad applicability to other eukaryotic protein groups. Sci. Rep. 8, 6808 (2018).
Di Tommaso, P. et al. T-Coffee: A web server for the multiple sequence alignment of protein and RNA sequences using structural information and homology extension. Nucleic Acids Res. 39, W13-17 (2011).
Schuster, F. L. & Visvesvara, G. S. Efficacy of novel antimicrobials against clinical isolates of opportunistic amebas. J. Eukaryot. Microbiol. 45, 612–618 (1998).
McGhee, P. et al. In vitro activity of CEM-101 against Streptococcus pneumoniae and Streptococcus pyogenes with defined macrolide resistance mechanisms. Antimicrob. Agents Chemother. 54, 230–238 (2010).
Hamilton-Miller, J. M. Chemistry and biology of the polyene macrolide antibiotics. Bacteriol. Rev. 37, 166–196 (1973).
Schuster, F. L. & Visvesvara, G. S. Axenic growth and drug sensitivity studies of Balamuthia mandrillaris, an agent of amebic meningoencephalitis in humans and other animals. J. Clin. Microbiol. 34, 385–388 (1996).
Laurie, M. T. et al. Functional assessment of 2,177 U.S. and international drugs identifies the quinoline nitroxoline as a potent amoebicidal agent against the pathogen Balamuthia mandrillaris. MBio 9, e02051-18 (2018).
Schuster, F. L., Guglielmo, B. J. & Visvesvara, G. S. In-vitro activity of miltefosine and voriconazole on clinical isolates of free-living amebas: Balamuthia mandrillaris, Acanthamoeba spp., and Naegleria fowleri. J. Eukaryot. Microbiol. 53, 121–126 (2006).
Ahmad, A. F., Heaselgrave, W., Andrew, P. W. & Kilvington, S. The in vitro efficacy of antimicrobial agents against the pathogenic free-living amoeba Balamuthia mandrillaris. J. Eukaryot. Microbiol. 60, 539–543 (2013).
Rice, C. A., Lares-Jiménez, L. F., Lares-Villa, F. & Kyle, D. E. In vitro screening of the open source MMV Malaria and pathogen boxes to discover novel compounds with activity against Balamuthia mandrillaris. Antimicrob. Agents Chemother. https://doi.org/10.1128/AAC.02233-19 (2020).
Rice, C. A., Colon, B. L., Chen, E., Hull, M. V. & Kyle, D. E. Discovery of repurposing drug candidates for the treatment of diseases caused by pathogenic free-living amoebae. PLoS Negl. Trop. Dis. 14, e0008353 (2020).
Michel, R., Müller, K. D. & Hoffmann, R. Enlarged Chlamydia-like organisms as spontaneous infection of Acanthamoeba castellanii. Parasitol. Res. 87, 248–251 (2001).
Sierra, S. et al. Statins as neuroprotectants: A comparative in vitro study of lipophilicity, blood–brain-barrier penetration, lowering of brain cholesterol, and decrease of neuron cell death. J. Alzheimers Dis. 23, 307–318 (2011).
Leimeister, C.-A. et al. Prot-SpaM: Fast alignment-free phylogeny reconstruction based on whole-proteome sequences. Gigascience 8, giy148 (2019).
Humann, J. L., Lee, T., Ficklin, S. & Main, D. Structural and functional annotation of eukaryotic genomes with GenSAS. Methods Mol. Biol. 1962, 29–51 (2019).
Hunt, A. G. A rapid, simple, and inexpensive method for the preparation of strand-specific RNA-Seq libraries. Methods Mol. Biol. 1255, 195–207 (2015).
Myler, P. J., McDonald, J. A., Alcolea, P. J. & Sur, A. Quantitative RNA analysis using RNA-Seq. Methods Mol. Biol. 1971, 95–108 (2019).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512 (2013).
Bushmanova, E., Antipov, D., Lapidus, A. & Prjibelski, A. D. rnaSPAdes: A de novo transcriptome assembler and its application to RNA-Seq data. Gigascience 8, giz100 (2019).
Dobin, A. et al. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Seppey, M., Manni, M. & Zdobnov, E. M. BUSCO: Assessing genome assembly and annotation completeness. Methods Mol. Biol. 1962, 227–245 (2019).
Afgan, E. et al. Genomics virtual laboratory: A practical bioinformatics workbench for the cloud. PLoS One 10, e0140829 (2015).
Törönen, P., Medlar, A. & Holm, L. PANNZER2: A rapid functional annotation web server. Nucleic Acids Res. 46, W84–W88 (2018).
Ye, J. et al. WEGO 2.0: a web tool for analyzing and plotting GO annotations, 2018 update. Nucleic Acids Res. 46, W71–W75 (2018).
Medlar, A. J., Törönen, P. & Holm, L. AAI-profiler: Fast proteome-wide exploratory analysis reveals taxonomic identity, misclassification and contamination. Nucleic Acids Res. 46, W479–W485 (2018).
Xu, L. et al. OrthoVenn2: A web server for whole-genome comparison and annotation of orthologous clusters across multiple species. Nucleic Acids Res. 47, W52–W58 (2019).
Camacho, C. et al. BLAST+: Architecture and applications. BMC Bioinform. 10, 421 (2009).
Liechti, N., Schürch, N., Bruggmann, R. & Wittwer, M. Nanopore sequencing improves the draft genome of the human pathogenic amoeba Naegleria fowleri. Sci. Rep. 9, 16040 (2019).
Liechti, N., Schürch, N., Bruggmann, R. & Wittwer, M. The genome of Naegleria lovaniensis, the basis for a comparative approach to unravel pathogenicity factors of the human pathogenic amoeba N. fowleri. BMC Genom. 19, 654 (2018).
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549 (2018).
Stecher, G., Tamura, K. & Kumar, S. Molecular evolutionary genetics analysis (MEGA) for macOS. Mol. Biol. Evol. https://doi.org/10.1093/molbev/msz312 (2020).
Stacy, R. et al. Structural genomics of infectious disease drug targets: The SSGCID. Acta Crystallogr. Sect. F. Struct. Biol. Cryst. Commun. 67, 979–984 (2011).
Aslanidis, C. & de Jong, P. J. Ligation-independent cloning of PCR products (LIC-PCR). Nucleic Acids Res. 18, 6069–6074 (1990).
Studier, F. W. Protein production by auto-induction in high density shaking cultures. Protein Expr. Purif. 41, 207–234 (2005).
Ewing, B. & Green, P. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8, 186–194 (1998).
Edgar, R. C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004).
We thank Drs. Luis Fernando Lares-Jiménez & Fernando Lares-Villa (Instituto Tecnológico de Sonora, Ciudad Obregón, Sonora, Mexico) for the pathogenic isolate of B. mandrillaris used in this study, and Aakash Sur for insightful discussions. The authors acknowledge the support of the Freiburg Galaxy Team led by Prof. Rolf Backofen, Bioinformatics, University of Freiburg, Germany funded by Collaborative Research Centre 992 Medical Epigenetics (DFG grant SFB 992/1 2012) and German Federal Ministry of Education and Research (BMBF grant 031 A538A de.NBI-RBC). Rooksana Noorai was supported by an Institutional Development Award (IDeA) from the National Institute of General Medical Sciences of the National Institutes of Health under Grant number P20GM109094. This project has been funded in part with Federal funds from the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under Contract No.: HHSN272201700059C and the Georgia Research Alliance (GRA) also supported this work.
W.C. Van Voorhis is a co-owner of ParaTheraTech, Inc, a small biotech that aims to market compounds for the therapy of parasitic diseases in Animal Health. All other authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Phan, I.Q., Rice, C.A., Craig, J. et al. The transcriptome of Balamuthia mandrillaris trophozoites for structure-guided drug design. Sci Rep 11, 21664 (2021). https://doi.org/10.1038/s41598-021-99903-8
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.