Large-scale computational drug repositioning to find treatments for rare diseases

Govindaraj, Rajiv Gandhi; Naderi, Misagh; Singha, Manali; Lemoine, Jeffrey; Brylinski, Michal

doi:10.1038/s41540-018-0050-7

Download PDF

Technology Feature
Open access
Published: 13 March 2018

Large-scale computational drug repositioning to find treatments for rare diseases

npj Systems Biology and Applications volume 4, Article number: 13 (2018) Cite this article

9204 Accesses
29 Citations
55 Altmetric
Metrics details

Subjects

Abstract

Rare, or orphan, diseases are conditions afflicting a small subset of people in a population. Although these disorders collectively pose significant health care problems, drug companies require government incentives to develop drugs for rare diseases due to extremely limited individual markets. Computer-aided drug repositioning, i.e., finding new indications for existing drugs, is a cheaper and faster alternative to traditional drug discovery offering a promising venue for orphan drug research. Structure-based matching of drug-binding pockets is among the most promising computational techniques to inform drug repositioning. In order to find new targets for known drugs ultimately leading to drug repositioning, we recently developed eMatchSite, a new computer program to compare drug-binding sites. In this study, eMatchSite is combined with virtual screening to systematically explore opportunities to reposition known drugs to proteins associated with rare diseases. The effectiveness of this integrated approach is demonstrated for a kinase inhibitor, which is a confirmed candidate for repositioning to synapsin Ia. The resulting dataset comprises 31,142 putative drug-target complexes linked to 980 orphan diseases. The modeling accuracy is evaluated against the structural data recently released for tyrosine-protein kinase HCK. To illustrate how potential therapeutics for rare diseases can be identified, we discuss a possibility to repurpose a steroidal aromatase inhibitor to treat Niemann-Pick disease type C. Overall, the exhaustive exploration of the drug repositioning space exposes new opportunities to combat orphan diseases with existing drugs. DrugBank/Orphanet repositioning data are freely available to research community at https://osf.io/qdjup/.

Introduction

Repositioning drugs to treat conditions for which they were not originally intended is an emerging strategy offering a faster and cheaper route to develop new treatments compared to traditional drug discovery.¹ Since repurposed molecules not only have optimized pharmacokinetics, pharmacodynamics, and toxicity profiles, but are also already approved by the U.S. Food and Drug Administration (FDA), this approach speeds up the evaluation of drug candidates in clinical trials at the reduced risk of failure. Drug repositioning is expected to play a major role in the development of treatments for rare, or orphan, diseases defined as those disorders afflicting <200,000 patients in the United States. Even though rare diseases collectively affect more than 350 million people worldwide (https://globalgenes.org/rare-diseases-facts-statistics/), developing new therapeutics for their small individual markets is not profitable enough to warrant commercial interest.² On that account, many countries passed orphan drug legislation, such as the Orphan Drug Act of 1983 in the U.S., in order to provide financial inducements in terms of the market exclusivity and reduced development costs. Legislators work on the Orphan Product Extensions Now Accelerating Cures and Treatments (OPEN) Act to extend the market exclusivity for repurposing already approved drugs to treat rare diseases,³ signifying the importance of drug repositioning to orphan disease research.

It is noteworthy that most repurposed drugs are the result of serendipitous observations made either in the lab or during clinical tests. Sildenafil is perhaps the most recognized example of a repositioned compound. Originally developed to treat hypertension and angina pectoris in the 1980s, it was later repurposed to erectile dysfunction and pulmonary arterial hypertension.⁴ Other examples are amantadine and memantine. The former was introduced in the 1960s as a prophylactic agent in respiratory infections.⁵ A few years later, a patient with Parkinson’s disease experienced a dramatic improvement in her symptoms during the daily administration of amantadine for influenza prophylaxis.⁶ This anecdotal observation stimulated research on using amantadine and other members of the aminoadamantane class of molecules to treat neurological diseases. Indeed, amantadine is presently approved by the FDA as both an antiviral and an antiparkinsonian drug. Structurally similar to amantadine, memantine was also synthesized in the 1960s as a putative hypoglycemic agent, though it was found to be devoid of such activity. It was later discovered that memantine is an uncompetitive antagonist of glutamatergic N-methyl-D-aspartate (NMDA) receptors⁷ and currently, memantine is used to treat moderate to severe Alzheimer-type dementia.⁸

A clear necessity for rational approaches to find alternative indications for existing therapeutics has stimulated the development of computational methods for drug repositioning.⁹ Many currently available algorithms exploit the fact that proteins with similar pockets tend to have similar functions and recognize similar molecules.¹⁰ For instance, the sequence-order independent profile-profile alignment (SOIPPA) program employs Delaunay tessellation of Cα atoms and geometric potentials to compare binding pockets.¹¹ Further, SiteAlign measures distances between druggable pockets with cavity fingerprints constructed by projecting eight topological and physicochemical properties onto a multidimensional, discretized space.¹² Both SOIPPA and SiteAlign have been used in drug repurposing, for example, SOIPPA helped reveal new targets for entacapone and tolcapone,¹³ whereas SiteAlign detected the cross-reaction of protein kinase inhibitors with a protein regulating neurotransmitter release in the synapse.¹⁴

Notwithstanding the success of existing methods to recognize similar pockets, many of these algorithms perform well only against the experimental structures of proteins complexed with small molecules. Utilizing datasets of target structures with predicted binding sites poses a formidable challenge for pocket matching programs because of inevitable inaccuracies in the annotation of binding residues. To alleviate this issue, we recently developed eMatchSite, which offers a high tolerance to residue misannotations and, to some extent, structure imperfections in ligand-binding regions.^15,16 In this communication, we combine eMatchSite and structure-based virtual screening (VS) with AutoDock Vina¹⁷ in order to enhance the accuracy of binding site matching. Subsequently, we demonstrate the effectiveness of eMatchSite/VS for a kinase inhibitor, which is a confirmed candidate for repositioning to synapsin Ia. Next, this methodology is employed to explore new opportunities to combat orphan conditions through a large-scale repositioning of existing drugs to proteins linked to rare diseases.¹⁸ The results are discussed with respect to the structural data recently released in the Protein Data Bank (PDB)¹⁹ for tyrosine-protein kinase HCK, as well as a possibility to repurpose a steroidal aromatase inhibitor to treat Niemann-Pick disease type C. Overall, the protocol combining protein structure modeling, binding site prediction and matching, and structure-based virtual screening holds a significant promise to systematically explore the drug repositioning space at the systems level.

Results and discussion

Integrating binding site matching with virtual screening

Although the primary application of VS is to identify potentially bioactive molecules, it can also be used to indirectly measure the similarity between binding sites.²⁰ Specifically, VS is conducted against a pair of target pockets and then a statistical dependence between the ranking of library compounds is evaluated by Spearman’s ρ rank correlation coefficient. A high positive Spearman’s ρ indicates that two binding sites are chemically similar, i.e., tend to bind similar compounds (Supplementary Text S1). Here, we employ structure-based VS with Vina to increase the accuracy of eMatchSite detecting similar binding sites. Since many proteins associated with rare diseases are yet to be experimentally annotated, repositioning drugs to orphan proteins is generally dependent on the accuracy of pocket matching conducted against computationally predicted binding sites. On that account, we run both eMatchSite and Vina on pockets predicted by eFindSite for ligand-bound and unbound structures in the Huang dataset.²¹

The performance is assessed in Fig. 1 with the Boltzmann-Enhanced Discrimination of Receiver Operating Characteristic (BEDROC, Supplementary Text S2) devised to statistically evaluate the early recognition capabilities of binary classifiers.²² Using the bound structures, the median BEDROC score for eMatchSite alone is 0.66, which increases to 0.77 when it is combined with VS. As expected, the performance for unbound structures is somewhat lower compared to bound structures, however, including VS brings about similar improvements. The median BEDROC for unbound structures increases from 0.60 for eMatchSite to 0.71 for eMatchSite/VS. For comparison, the performance of a random classifier is notably lower with the median BEDROC values of 0.18 for bound and 0.21 for unbound structures. Overall, combining pocket matching with methodologically orthogonal structure-based VS is an effective strategy to increase the accuracy of detecting pockets binding similar ligands regardless of the conformational state of target proteins.

Example of a validated candidate for repositioning

The capability of eMatchSite to correctly recognize similar ligand-binding sites in primary and off-targets is demonstrated for a confirmed candidate for repositioning. Here, we conduct pocket matching followed by VS against weakly homologous protein models constructed by eThread with binding sites predicted by eFindSite. This procedure is essentially the same as that employed to reposition existing drugs to proteins linked to rare diseases.¹⁸ The example shown in Fig. 2 is the cross-reaction of staurosporine, a pan-kinase inhibitor,²³ with synapsin Ia, an ATP-binding protein regulating neurotransmitter release in the synapse.¹⁴ A weakly homologous model of the primary target for staurosporine, the human proto-oncogene serine/threonine-protein kinase Pim-1, was built based on the structure of the murine AMP-activated protein kinase (PDB-ID: 5ufu, chain A, 31.3% sequence identity to Pim-1).²⁴ This model has a Template Modeling (TM)-score²⁵ of 0.86 and a Cα-root-mean-square deviation (RMSD)²⁶ of 5.47 Å against the crystal structure of Pim-1 (PDB-ID: 1yhs, chain A),²⁷ with a Matthews correlation coefficient (MCC)²⁸ between predicted and staurosporine-binding residues of 0.67. The TM-score and Cα-RMSD are described in the Supplementary Text S3. The model of synapsin Ia constructed based on a remote template α-aminoadipate-LysW ligase LysX (PDB-ID: 3vpd, chain A, 22.8% sequence identity to synapsin Ia)²⁹ has a TM-score of 0.67 and a Cα-RMSD of 9.03 Å against the crystal structure of synapsin Ia (PDB-ID: 1aux, chain A).³⁰

Although Pim-1 and synapsin Ia have globally different sequences (a sequence identity of 15.5%) and structures (a TM-score of 0.32), eMatchSite predicted that pockets in both models are in fact highly similar with an eMS-score (Supplementary Text S4) of 0.97. Figure 2a presents the local superposition of binding sites in Pim-1 (purple ribbons and spheres) and synapsin Ia (gold ribbons and spheres) models resulting in a Cα-RMSD of 2.47 Å over 13 aligned residues. In addition to staurosporine repositioned to synapsin Ia (solid gold sticks), Fig. 2a includes an ATP-γS molecule bound to the active site of synapsin Ia (transparent teal sticks). Encouragingly, staurosporine transferred according to the local Pim-1→synapsin Ia alignment adopts an orientation closely resembling those of typical ATP-competitive inhibitors bound to protein kinases.³¹ Further, Spearman’s ρ calculated for ranks assigned by Vina is as high as 0.86 (Fig. 2b) strongly indicating that these pockets are in fact chemically similar. Indeed, competition experiments of staurosporine against ATP-γS confirmed its nanomolar binding to synapsin Ia¹⁴ corroborating the model constructed by eMatchSite.

Repositioning of DrugBank compounds to Orphanet proteins

The repositioning procedure was developed in our previous study¹⁸ and it is summarized in Fig. 3. Full-chain structures of proteins from DrugBank³² and Orphanet (Fig. 3a) are modeled with eThread³³ (Fig. 3b) followed by the annotation of ligand-binding sites with eFindSite³⁴ (Fig. 3c). Next, drug-target complexes are constructed for DrugBank proteins with a two-step similarity-based docking procedure employing Fr-TM-align³⁵ and KCOMBU³⁶ (Fig. 3d). This protocol generates drug-bound structures for DrugBank and unbound structures for Orphanet proteins (Fig. 3e). Subsequently, all DrugBank pockets are compared against all Orphanet pockets with eMatchSite^15,16 and drugs are transferred from DrugBank to Orphanet proteins for significant matches (Fig. 3f). Finally, Orphanet drug-target complexes are refined with Modeller³⁷ (Fig. 3g) and subjected to quality assessment with Distance-scaled Finite Ideal-gas REference (DFIRE)³⁸ and VS¹⁷ (Fig. 3h).

All-against-all pockets matching conducted with eMatchSite for DrugBank and Orphanet proteins produced 320,856 binding site alignments, 5.6% of which yield a statistically significant eMS-score. It is noteworthy that the average TM-score between matched DrugBank and Orphanet targets is as low as 0.27 ± 0.10 indicating that in the majority of cases, existing drugs are repositioned from proteins having globally unrelated structures. Based on 18,145 confident local alignments reported by eMatchSite, 31,142 unique putative complexes between DrugBank compounds and Orphanet proteins have been modeled. An analysis of the DrugBank→Orphanet repositioning data reveals that 381 existing drugs could be repurposed to target as many as 761 Orphanet proteins. These proteins link to 980 orphan diseases representing 32 classes including (ten the most common classes) 923 genetic, 428 neurological, 377 inborn errors of metabolism, 266 developmental anomalies during embryogenesis, 170 eye, 117 skin, 102 bone, 93 neoplastic, 92 endocrine, and 85 hematological disorders.

Repositioning multiple drugs through a single alignment

Drug repositioning conducted in this study includes two kinds of special cases. Figure 4 illustrates the first situation, in which complexes between multiple drugs (Fig. 4a) and an Orphanet target (Fig. 4b) are modeled based on a single pocket alignment. Employing this approach generates a series of structure models of drugs transferred from a DrugBank target to the binding site of an Orphanet protein (Fig. 4c). For instance, catechol O-methyltransferase (COMT) produces a significant local alignment with guanine nucleotide-binding protein subunit alpha-11 (GNA11), associated with a rare disease, autosomal dominant hypocalcemia (ADH) or hypoparathyroidism³⁹ (ORPHA:428, GARD:2877). This condition is characterized by low levels of calcium in the blood and an imbalance of other molecules, such as phosphate and magnesium, leading to a variety of symptoms, although about half of affected individuals have no associated health problems.⁴⁰ ADH is primarily caused by mutations of a gene encoding the calcium-sensing receptor, however, activating mutations in GNA11 have also been reported.^41,42 A binding site predicted in GNA11 by eFindSite aligns well to a pocket binding tolcapone and entacapone in COMT with an eMS-score of 0.97 and a Cα-RMSD of 4.5 Å calculated over 14 aligned binding residues. Based on this single alignment, tolcapone and entacapone, COMT inhibitors used as adjuncts to levodopa/carbidopa medication in the treatment of Parkinson’s disease,^43,44 could be repositioned to GNA11. Figure 4d shows the putative binding poses of both compounds in the binding pocket of GNA11 modeled based on the local COMT→GNA11 alignment reported by eMatchSite. Interaction energies with GNA11 reported by DFIRE for tolcapone and entacapone are −355.7 and −311.7, respectively. For comparison, the interaction energies with COMT are −283.7 for tolcapone and −310.9 for entacapone. Overall, these results indicate that both molecules may favorably bind to GNA11 producing stable, low-energy assemblies.

Construction of multiple models of a single complex

The second special case is the modeling of a single complex based on multiple pocket alignments. More than one structure model of a drug repositioned to the Orphanet protein can be constructed if this drug has multiple targets in DrugBank producing significant pocket alignments with the Orphanet protein. This procedure is illustrated in Fig. 5. Figure 5a shows three DrugBank targets binding the same compound, colored in blue, orange and yellow. Assuming that pockets for this drug in all three proteins align to a binding site in an Orphanet target colored in green (Fig. 5b), three independent structure models can be constructed (Fig. 5c). An example is ponatinib, a novel inhibitor of Bcr-Abl tyrosine kinase developed to treat chronic myeloid leukemia and Philadelphia chromosome-positive acute lymphoblastic leukemia.⁴⁵ Ponatinib is a multi-targeted compound, which in addition to its primary target, Abelson tyrosine-protein kinase 1, binds to 14 other macromolecules according to DrugBank.³² Binding sites of three of these proteins, Lck/Yes-related novel protein tyrosine kinase (LYN), lymphocyte cell-specific protein-tyrosine kinase (LSK), and proto-oncogene tyrosine-protein kinase Src (SRC), produce significant local alignments with a drug-binding pocket predicted in Ras-related protein Rab-23 (RAB23). The corresponding eMS-score/Cα-RMSD values reported by eMatchSite for these alignments are 0.97/3.8, 0.98/3.7, and 0.98/3.8 Å, respectively. According to Orphanet, RAB23 is associated with Carpenter syndrome^46,47 (ORPHA:65759, GARD:6003), a very rare disease with approximately 40 cases described in the literature.⁴⁸ The repositioning of ponatinib to RAB23 can, therefore, be carried out through kinases LSK, LYN, and SRC, resulting in three independent models of a ponatinib-RAB23 complex structure. Figure 5d shows that the binding poses of ponatinib in the RAB23 pocket are very similar across these models. The heavy-atom RMSD between ponatinib molecules is 2.1 Å for LSK- and LYN-based models, 0.7 Å for LSK-based and SRC-based models, and 2.3 Å for LYN-based and SRC-based models, with similar drug-protein interactions present in all models (Supplementary Fig. S1). The interaction energy between ponatinib and RAB23 reported by DFIRE for LSK-, LYN-, and SRC-based models are −829.7, −727.7, and −723.6, respectively. These values are even lower than those calculated for the parent complexes of ponatinib and LSK (−587.9), LYN (−571.9), and SRC (−586.9) suggesting that ponatinib may form favorable interactions with the binding residues of RAB23.

Multiple structure models of the same complex of a drug repositioned to the Orphanet target can be used to estimate the confidence of the large-scale modeling reported in this study. Specifically, employing different DrugBank proteins to transfer the same drug to the Orphanet target should, in principle, produce similar complex models. To test this assumption, we selected 4878 drugs repositioned to Orphan targets by matching binding sites of multiple DrugBank proteins. Supplementary Fig. S2 shows that up to 20 different models can be constructed for some drugs, however, two and three models are generated for the majority of cases (52.4 and 21.9%, respectively). Next, we identified the most typical binding pose of each drug in the pocket of an Orphanet protein by calculating a ligand heavy-atom RMSD against all other models of the same drug-target complex. The distribution of these RMSD values across 4878 DrugBank drugs repositioned to Orphanet targets is shown as inset in Supplementary Fig. S2. Encouragingly, the RMSD for most compounds is relatively low with a median value of 3.6 Å. One should keep in mind that these complex structures are constructed from the computer-generated models of target proteins with computationally predicted ligand-binding sites, and drug molecules are transferred according to fully sequence order-independent pocket alignments.

Binding affinity prediction for repositioned drugs

We also evaluate the binding affinity of drugs repositioned to Orphanet proteins in comparison with their complexes with primary targets from DrugBank. Figure 6 shows the relation between interaction energies estimated by DFIRE for DrugBank and Orphanet complexes. The DFIRE statistical potential is described in the Supplementary Text S5. Because a single drug-target complex from DrugBank can be used to reposition the bound drug molecule to multiple Orphanet proteins, mean scores and the corresponding standard errors of the mean are plotted on the y-axis. Encouragingly, DFIRE energies calculated for DrugBank and Orphanet complexes involving the same drug are highly correlated with a Pearson correlation coefficient of 0.86. This analysis indicates that the interaction strength of drug molecules repositioned to Orphanet proteins is generally comparable to that calculated for their complexes with primary targets. Therefore, those pairs of DrugBank and Orphanet proteins producing statistically significant pocket alignments also share similarities with respect to ligand binding as independently evaluated with knowledge-based statistical potentials.

Validation against a recently determined X-ray structure

Repositioning prediction by eMatchSite is further validated against a complex structure released in the PDB several months after the modeling was completed. Figure 7 shows ibrutinib (DrugBank-ID: DB09053), an anti-cancer drug primarily targeting B-cell malignancies,⁴⁹ predicted to bind to proto-oncogene, Src family tyrosine kinase Blk (UniProt-ID: P51451). According to Orphanet, Blk is linked to maturity-onset diabetes of the young (MODY, ORPHA:552, GARD:3697)⁵⁰ caused by mutations in at least 13 genes, 5 of which are placed within 100 kb corresponding to the Blk gene.⁵¹ Nonetheless, a reassessment study showed that Blk mutations, A71T in particular, unlikely cause highly penetrant MODY and may weakly influence type 2 diabetes risk in the context of obesity.⁵² More recently, it was discovered that malignant T cells in the majority of patients with the cutaneous T-cell lymphoma (CTCL) display the ectopic expression of Blk.⁵³ Since Blk functions as an oncogene promoting the proliferation of malignant T cells, it is a potential therapeutic target in CTCL.⁵⁴

Although the full-length experimental structure of Blk is unavailable, a confident model of Blk, whose estimated Global Distance Test (GDT)-score⁵⁵ (Supplementary Text S2) is 0.72, was constructed by eThread based on proto-oncogene tyrosine-protein kinase Src (PDB-ID: 1y57, chain A, 64% sequence identity to Blk).⁵⁶ Further, the binding site annotated in the Blk model by eFindSite with a 99.2% confidence (Supplementary Text S6) was matched to the ibrutinib-binding pocket in tyrosine-protein kinase BTK with a high eMS-score of 0.99. In October 2017, tyrosine-protein kinase HCK co-crystallized with a 7-substituted pyrrolo-pyrimidine inhibitor, OOS (PDB-ID: 5h0e, chain A), sharing 69.7% sequence identity with Blk, was released in the PDB.⁵⁷ Figure 7a shows that ibrutinib and OOS have very similar chemical structures with a Tanimoto coefficient⁵⁸ (TC, Supplementary Text S7) of 0.61 and 27 common atoms.

The global superposition of the modeled ibrutinib-Blk and experimental OOS-HCK structures is presented in Fig. 7b. The Blk model (purple ribbons) has a globally correct structure with a TM-score of 0.86 and a Cα-RMSD of 2.25 Å calculated against HCK (gold ribbons) over the kinase domain. Further, binding residues were accurately predicted by eFindSite in the Blk model (purple spheres) with a MCC of 0.60 against OOS-binding residues in the HCK complex structure. Encouragingly, the binding pose of ibrutinib repositioned to Blk based on the local BTK→Blk alignment closely resembles the conformation of OOS in HCK. The RMSD calculated over equivalent non-hydrogen atoms of these compounds is 2.57 Å and 1.43 Å upon the superposition of target proteins and ligands, respectively. Despite the fact that matching binding sites in a sequence-order independent manner is a challenging task, the modeled ibrutinib-Blk complex is noticeably similar to the experimental OOS-HCK structure recently released in the PDB.

Niemann-Pick disease, type C and exemestane

Niemann-Pick disease, type C (NPC, ORPHA:646) is a fatal hereditary disorder characterized by the accumulation of low-density, lipoprotein-derived cholesterol in lysosomes causing hepatosplenomegaly and severe progressive neurological dysfunction. Mutations in either of two lysosomal proteins, Niemann-Pick disease types C1 (NPC1) or C2 (NPC2), interrupt sterol transport from late endosomes and lysosomes to other cellular organelles resulting in cholesterol accumulation in lysosomes and the fatal NPC disease.⁵⁹ As many as 22 mutations in NPC2 are associated with orphan NPC diseases, including adult, juvenile, late infantile, and severe early infantile neurologic onset. In particular, V30M, V39M, C47F, S67P, C93F, C99R, and P120S mutations in NPC2 have an effect on cholesterol binding.^{60,61,62,63,64} Furthermore, mutations of M79, V81, and V83 block sterol transport making NPC2 a promising drug target to treat NPC diseases.⁶⁵ Interestingly, eMatchSite detected a significant structure similarity between the cholesterol-binding pocket of NPC2 and the steroid-binding pocket of cytochrome P450 aromatase (CYP19A1), an enzyme involved in the biosynthesis of aromatic C18 estrogen from C19 androgen. CYP19A1 is a target for exemestane, an oral steroidal aromatase inhibitor approved by the FDA for the treatment of breast cancer in postmenopausal patients.⁶⁶

The full-length model of CYP19A1 was generated by eThread from the crystal structure of an N-terminal-truncated recombinant human CYP19A1 (PDB-ID: 4kq8, chain A, 100.0% sequence identity with a coverage of 89.9%).⁶⁷ Subsequently, exemestane was placed in the steroid-binding pocket of CYP19A1 based on its global structure alignment with the X-ray structure of human placental CYP19A1 (PDB-ID: 3s79, chain A, TM-score of 0.89 and Cα-RMSD 0.55 Å) bound to androstenedione,⁶⁸ another steroidal inhibitor with a TC to exemestane of 0.95. Although the experimental structure of CYP19A1 bound to exemestane is available (PDB-ID: 3s7s, chain A),⁶⁸ it is not included in the template library used to model DrugBank complexes. By reason of removing the redundancy in the library at 80% protein sequence identity³³ and a TC of 0.9 for the ligand chemical similarity,³⁴ androstenedione-bound CYP19A1 was identified as a cluster centroid to represent the entire group of similar complexes, including the exemestane-CYP19A1 structure.

We selected this case to demonstrate that a non-redundant library is adequate to build complex models fairly indistinguishable from experimental structures. The exemestane-CYP19A1 model constructed in this study is shown in Fig. 8a as thick sticks colored by atom type representing exemestane and purple ribbons representing CYP19A1. Two other structures are globally aligned onto the exemestane-CYP19A1 model, the androstenedione-CYP19A1 complex used as the template to position exemestane within the steroid-binding pocket and the experimentally determined exemestane-CYP19A1 complex; both structures are presented in Fig. 8a as thin sticks colored by atom type and teal ribbons. Indeed, the Cα-RMSD, as well as the RMSD calculated over binding residues between CYP19A1 model and experimental structure are below 1 Å. Further, RMSD calculated for exemestane upon the global structure superposition is as low as 0.06 Å demonstrating that the exemestane-CYP19A1 assembly is modeled with a very high accuracy. It is also noteworthy that eFindSite identified the binding site for exemestane with 96.2% confidence and the predicted binding residues, shown as purple spheres in Fig. 8a, yield an MCC of 0.71 against exemestane-binding residues in the CYP19A1 model.

The full-length model of lysosomal protein NPC2 was constructed based on the crystal structure of the human NPC2 (PDB-ID: 5kwy, chain C, 100.0% sequence identity with a coverage of 87.4%).⁶⁵ Figure 8b shows the global superposition of the NPC2 model represented by gold ribbons and two experimental NPC2 structures represented by teal ribbons, human (the template, 5kwyC) and bovine (PDB-ID: 2hka, chain A, 79% sequence identity to human NPC2),⁶⁹ both complexed with cholesterol sulfate. These superpositions yield a Cα-RMSD of 0.92 Å against human and 1.06 Å against bovine structures. NPC2 has an Ig-like β-sandwich fold comprising seven β-strands forming a hydrophobic pocket that was suggested to become wider in order to accommodate cholesterol-like molecules.⁷⁰ This region was accurately identified by eFindSite with a high confidence of 95.2% as a highly hydrophobic binding site formed by 20 conserved residues. The prediction was made based on a non-redundant set of 21 holo-templates, including ganglioside GM2 activator (GM2A), lymphocyte antigen 96 (LY96), mite group 2 allergen Der f 2 (DERF2), and NPC2 itself. Selected template-bound ligands are shown in Fig. 8b as a cluster of transparent, teal molecules upon the global alignment of template proteins onto the NPC2 model. In addition, eFindSite estimated that the average ± standard deviation molecular weight (MW), octanol-water partition coefficient (logP), and polar surface area (PSA) for molecules binding to this region on the NPC2 surface are 383 Da ± 225, 4.76 ± 1.97, and 90.2 Å² ± 77.9, respectively. The predicted physicochemical properties of putative binders of NPC2 are a good match for exemestane (and androstenedione), whose MW is 296 Da (286 Da), logP is 4.03 (4.09), and PSA is 34.1 Å² (34.1 Å²).

Although the global similarity between CYP19A1 and NPC2 is low as assessed by a TM-score of 0.14 and 5.2% sequence identity, eMatchSite predicted that their binding sites are in fact similar with a high eMS-score of 0.86. Figure 8c shows exemestane repositioned from CYP19A1 to the cholesterol-binding pocket of NPC2 based on the sequence order-independent local alignment reported by eMatchSite. Exemestane fits into a deep, non-polar cavity in the NPC2 structure forming a number of hydrophobic interactions with Y55, V57, V73, V74, F85, P88, Y109, N111, L113, V126, W128, and W141. Encouragingly, an interaction energy of −409.5 calculated with DFIRE for the exemestane-NPC2 complex is lower than a value of −381.4 for exemestane-CYP19A1 indicating that this drug may form favorable interactions with NPC2. Notably, exemestane adopts a conformation distinct from that of cholesterol sulfate in the crystal structure of NPC2. The latter is larger (MW of 466 Da) and has two moieties attached to the steroid scaffold, an aliphatic branched-chain interacting with the inner part of the NPC2 pocket and a polar sulfate group protruding from the pocket toward the cholesterol-transfer tunnel between NPC2 and the N-terminal domain of NPC1.⁶⁵ In contrast, smaller exemestane may bind deeper in the NPC2 structure to inhibit conformational changes required for transporting cholesterol to NPC1.

This conjecture is supported by several recent studies. For instance, U18666A, a cationic sterol similar to exemestane with a TC of 0.67, binds to NPC1, inhibiting cholesterol export.⁷¹ Further, FDA-approved ezetimibe was shown to target NPC1 decreasing the cholesterol level.⁷² Another study independently suggests repurposing thiabendazole, a potent inhibitor of cytochrome P450 1A2 (CYP1A2), to NPC1.⁷³ Note that CYP1A2 and CYP19A1 are members of the cytochrome P450 family⁷⁴ (Pfam-ID: PF00067) and have highly similar structures with a TM-score of 0.87. Finally, NPC2 was demonstrated to bind a range of cholesterol-related molecules, leading to an alteration of its function in lysosomal cholesterol transport.⁷⁵ On that account, we hypothesize that exemestane binding to NPC2 disrupts the dynamics of its hydrophobic cavity. This effect could be exploited as a viable strategy to impede sterol movement to NPC1 preventing the accumulation of cholesterol in lysosomes in NPC disease.

Conclusions

Rational repositioning of existing drugs is expected to play a major role in the development of treatments for orphan diseases. Comparing ligand-binding sites in protein structures is among the most promising computational techniques to inform drug repurposing efforts. In this study, we demonstrate that combining eMatchSite with structure-based virtual screening enhances the accuracy of the detection of similar binding pockets. This promising methodology was employed to match drug-binding pockets from DrugBank with those from Orphanet exposing a number of opportunities to combat orphan diseases with existing drugs.

Materials and methods

DrugBank and Orphanet datasets

The DrugBank dataset includes proteins binding FDA-approved drugs with a molecular weight of 150–550 Da selected from DrugBank,³² whereas the Orphanet dataset contains proteins associated with rare disorders obtained from Orphanet (http://www.orpha.net). Target structures composed of 50–999 amino acids in both datasets were modeled with eThread, a template-based structure prediction algorithm.³³ In the next step, drug-binding pockets were predicted by eFindSite³⁴ in confidently modeled target DrugBank and Orphanet proteins whose estimated GDT-score is ≥0.4. Drug repositioning utilizes only those binding sites assigned a high and moderate confidence. Further, we devised a two-step alignment protocol to position drug compounds within the predicted binding pockets in the DrugBank proteins. First, holo-templates selected by eFindSite were structurally aligned onto the target protein with Fr-TM-align³⁵ and then the drug molecule was superposed onto the most similar template-bound ligand according to the chemical alignment constructed by KCOMBU.³⁶ The Orphanet dataset comprises 922 proteins, whereas the DrugBank dataset contains 2012 drug-protein complexes formed by 715 drugs and 348 proteins.

Matching DrugBank and Orphanet pockets

All-against-all matching of drug-binding pockets in DrugBank and Orphanet proteins was conducted with eMatchSite.^15,16 This algorithm constructs sequence order-independent alignments of pocket residues by solving the assignment problem with machine learning and the Kuhn-Munkres algorithm.^76,77 Local alignments are then assigned a similarity score, called the eMS-score, which measures the overlap of various physicochemical features and evolutionary profiles. For significant matches identified with eMatchSite, drugs bound to the DrugBank target were transferred to a binding site in the Orphanet protein upon the superposition of the two pockets according to the local alignment. Subsequently, the constructed complexes of drugs repositioned to Orphanet proteins were rebuilt with Modeller³⁷ in order to refine drug-target interactions eliminating steric clashes. The quality of final complex models is assessed by a knowledge-based statistical energy function for protein-ligand complexes with DFIRE³⁸ and VS with Vina.¹⁷

Huang dataset

The Huang dataset was originally compiled to evaluate the performance of geometry-based methods to predict binding pockets²¹ and then it was adopted to assess the accuracy of pocket comparison algorithms.⁷⁸ From this dataset, we selected 107 proteins for which eFindSite correctly annotated binding sites within a distance of 8 Å from the geometric center of the bound ligand in the experimental complex structure. These target proteins bind the following ligands, adenosine, biotin, fructose-6-phosphate, α-L-fucose, β-D-galactose, guanine, α-D-mannose, O1-methyl-mannose, 4-phenyl-1H-imidazole, palmitic acid, retinol, and 2’-deoxyuridine 5’-monophosphate. The comprehensive information on the Huang dataset is given in Supplementary Table S1.

Virtual screening

A target binding site is subjected to VS with AutoDock Vina¹⁷ against a non-redundant library of 1515 FDA-approved drugs compiled previously.²⁰ MGL tools⁷⁹ and Open Babel⁸⁰ were used to add polar hydrogens and partial charges, as well as to convert target proteins and library compounds to the PDBQT format. For each docking ligand, the optimal search space centered on the binding site annotated with eFindSite was calculated from its radius of gyration.⁸¹ Molecular docking was carried out with AutoDock Vina 1.1.2 and the default set of parameters.

Data availability

Data generated for the repositioning of DrugBank drugs to Orphanet proteins are available from the Open Science Framework at https://osf.io/qdjup/. The source codes of programs used in this study are available from GitHub, eThread: https://github.com/michal-brylinski/ethread, eFindSite: https://github.com/michal-brylinski/efindsite, and eMatchSite: https://github.com/michal-brylinski/ematchsite.

References

Ashburn, T. T. & Thor, K. B. Drug repositioning: identifying and developing new uses for existing drugs. Nat. Rev. Drug. Discov. 3, 673–683 (2004).
Article CAS Google Scholar
Provost, G. “Homeless” or “orphan” drugs. Am. J. Hosp. Pharm. 25, 609 (1968).
Google Scholar
Kwok, A. K. & Koenigbauer, F. M. Incentives to repurpose existing drugs for orphan indications. ACS Med Chem. Lett. 6, 828–830 (2015).
Article CAS Google Scholar
Boolell, M. et al. Sildenafil: an orally active type 5 cyclic GMP-specific phosphodiesterase inhibitor for the treatment of penile erectile dysfunction. Int. J. Impot. Res. 8, 47–52 (1996).
CAS PubMed Google Scholar
Callmander, E. & Hellgren, L. Amantadine hydrochloride as a prophylactic in respiratory infections. A double-blind investigation of its clinical use and serology. J. Clin. Pharmacol. J. New. Drugs 8, 186–189 (1968).
Article CAS Google Scholar
Schwab, R. S., Poskanzer, D. C., England, A. C. Jr & Young, R. R. Amantadine in Parkinson’s disease. Review of more than two years’ experience. JAMA 222, 792–795 (1972).
Article CAS Google Scholar
Bormann, J. Memantine is a potent blocker of N-methyl-D-aspartate (NMDA) receptor channels. Eur. J. Pharmacol. 166, 591–592 (1989).
Article CAS Google Scholar
Olivares, D. et al. N-methyl D-aspartate (NMDA) receptor antagonists and memantine treatment for Alzheimer’s disease, vascular dementia and Parkinson’s disease. Curr. Alzheimer Res. 9, 746–758 (2012).
Article CAS Google Scholar
Li, J. et al. A survey of current trends in computational drug repositioning. Brief. Bioinform. 17, 2–12 (2016).
Article Google Scholar
Ehrt, C., Brinkjost, T. & Koch, O. Impact of binding site comparisons on medicinal chemistry and rational molecular design. J. Med. Chem. 59, 4121–4151 (2016).
Article CAS Google Scholar
Xie, L. & Bourne, P. E. Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments. Proc. Natl Acad. Sci. USA 105, 5441–5446 (2008).
Article CAS Google Scholar
Schalon, C., Surgand, J. S., Kellenberger, E. & Rognan, D. A simple and fuzzy method to align and compare druggable ligand-binding sites. Proteins 71, 1755–1778 (2008).
Article CAS Google Scholar
Kinnings, S. L. et al. Drug discovery using chemical systems biology: repositioning the safe medicine Comtan to treat multi-drug and extensively drug resistant tuberculosis. PLoS Comput. Biol. 5, e1000423 (2009).
Article Google Scholar
Defranchi, E. et al. Binding of protein kinase inhibitors to synapsin I inferred from pair-wise binding site similarity measurements. PLoS One 5, e12214 (2010).
Article Google Scholar
Brylinski, M. eMatchSite: sequence order-independent structure alignments of ligand binding pockets in protein models. PLoS Comput. Biol. 10, e1003829 (2014).
Article Google Scholar
Brylinski, M. Local alignment of ligand binding sites in proteins for polypharmacology and drug repositioning. Methods Mol. Biol. 1611, 109–122 (2017).
Article CAS Google Scholar
Trott, O. & Olson, A. J. AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 31, 455–461 (2010).
CAS PubMed PubMed Central Google Scholar
Brylinski, M., Naderi, M., Govindaraj, R. G. & Lemoine, J. eRepo-ORP: Exploring the opportunity space to combat orphan diseases with existing drugs. J. Mol. Biol. https://doi.org/10.1016/j.jmb.2017.12.001 (2018).
Berman, H. M. et al. The Protein Data Bank. Acta Crystallogr. D. Biol. Crystallogr. 58, 899–907 (2002).
Article Google Scholar
Govindaraj, R. G. & Brylinski, M. Comparative assessment of strategies to identify similar ligand-binding pockets in proteins. bioRxiv https://doi.org/10.1101/268565 (2018).
Huang, B. & Schroeder, M. LIGSITEcsc: predicting ligand binding sites using the Connolly surface and degree of conservation. BMC Struct. Biol. 6, 19 (2006).
Article Google Scholar
Truchon, J. F. & Bayly, C. I. Evaluating virtual screening methods: good and bad metrics for the “early recognition” problem. J. Chem. Inf. Model. 47, 488–508 (2007).
Article CAS Google Scholar
Karaman, M. W. et al. A quantitative analysis of kinase inhibitor selectivity. Nat. Biotechnol. 26, 127–132 (2008).
Article CAS Google Scholar
Cokorinos, E. C. et al. Activation of skeletal muscle AMPK promotes glucose disposal and glucose lowering in non-human primates and mice. Cell. Metab. 25, 1147–1159 (2017). e1110.
Article CAS Google Scholar
Zhang, Y. & Skolnick, J. Scoring function for automated assessment of protein structure template quality. Proteins 57, 702–710 (2004).
Article CAS Google Scholar
Kabsch, W. A solution for the best rotation to relate two sets of vectors. Acta Crystallogr. A. 32, 922–923 (1976).
Article Google Scholar
Jacobs, M. D. et al. Pim-1 ligand-bound structures reveal the mechanism of serine/threonine kinase inhibition by LY294002. J. Biol. Chem. 280, 13728–13734 (2005).
Article CAS Google Scholar
Matthews, B. W. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim. Biophys. Acta 405, 442–451 (1975).
Article CAS Google Scholar
Ouchi, T. et al. Lysine and arginine biosyntheses mediated by a common carrier protein in Sulfolobus. Nat. Chem. Biol. 9, 277–283 (2013).
Article CAS Google Scholar
Esser, L. et al. Synapsin I is structurally similar to ATP-utilizing enzymes. EMBO. J. 17, 977–984 (1998).
Walker, E. H. et al. Structural determinants of phosphoinositide 3-kinase inhibition by wortmannin, LY294002, quercetin, myricetin, and staurosporine. Mol. Cell. 6, 909–919 (2000).
Article CAS Google Scholar
Wishart, D. S. et al. DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res. 34, D668–672 (2006).
Article CAS Google Scholar
Brylinski, M. & Lingam, D. eThread: a highly optimized machine learning-based approach to meta-threading and the modeling of protein tertiary structures. PLoS ONE 7, e50200 (2012).
Article CAS Google Scholar
Brylinski, M. & Feinstein, W. P. eFindSite: improved prediction of ligand binding sites in protein models using meta-threading, machine learning and auxiliary ligands. J. Comput. Aided Mol. Des. 27, 551–567 (2013).
Article CAS Google Scholar
Pandit, S. B. & Skolnick, J. Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score. BMC Bioinform. 9, 531 (2008).
Kawabata, T. Build-up algorithm for atomic correspondence between chemical structures. J. Chem. Inf. Model. 51, 1775–1787 (2011).
Article CAS Google Scholar
Webb, B. & Sali, A. Protein structure modeling with MODELLER. Methods Mol. Biol. 1137, 1–15 (2014).
Article CAS Google Scholar
Zhang, C., Liu, S., Zhu, Q. & Zhou, Y. A knowledge-based energy function for protein-ligand, protein-protein, and protein-DNA complexes. J. Med. Chem. 48, 2325–2335 (2005).
Article CAS Google Scholar
Pollak, M. R. et al. Autosomal dominant hypocalcaemia caused by a Ca(2+)-sensing receptor gene mutation. Nat. Genet. 8, 303–307 (1994).
Article CAS Google Scholar
Kinoshita, Y., Hori, M., Taguchi, M., Watanabe, S. & Fukumoto, S. Functional activities of mutant calcium-sensing receptors determine clinical presentations in patients with autosomal dominant hypocalcemia. J. Clin. Endocrinol. Metab. 99, E363–368 (2014).
Article CAS Google Scholar
Nesbit, M. A. et al. Mutations affecting G-protein subunit alpha11 in hypercalcemia and hypocalcemia. N. Engl. J. Med. 368, 2476–2486 (2013).
Article CAS Google Scholar
Roszko, K. L., Bi, R. D. & Mannstadt, M. Autosomal dominant hypocalcemia (hypoparathyroidism) types 1 and 2. Front Physiol. 7, 458 (2016).
Article Google Scholar
Guay, D. R. Tolcapone, a selective catechol-O-methyltransferase inhibitor for treatment of Parkinson’s disease. Pharmacotherapy 19, 6–20 (1999).
Article CAS Google Scholar
Najib, J. Entacapone: a catechol-O-methyltransferase inhibitor for the adjunctive treatment of Parkinson’s disease. Clin. Ther. 23, 802–832 (2001). discussion 771.
Article CAS Google Scholar
Huang, W. S. et al. Discovery of 3-[2-(imidazo[1,2-b]pyridazin-3-yl)ethynyl]-4-methyl-N-{4-[(4-methylpiperazin-1-y l)methyl]-3-(trifluoromethyl)phenyl}benzamide (AP24534), a potent, orally active pan-inhibitor of breakpoint cluster region-abelson (BCR-ABL) kinase including the T315I gatekeeper mutant. J. Med. Chem. 53, 4701–4719 (2010).
Article CAS Google Scholar
Ben-Salem, S., Begum, M. A., Ali, B. R. & Al-Gazali, L. A novel aberrant splice site mutation in RAB23 leads to an eight nucleotide deletion in the mRNA and is responsible for Carpenter syndrome in a consanguineous emirati family. Mol. Syndromol. 3, 255–261 (2013).
Article CAS Google Scholar
Haye, D. et al. Prenatal findings in carpenter syndrome and a novel mutation in RAB23. Am. J. Med. Genet. A. 164A, 2926–2930 (2014).
Article Google Scholar
Robinson, L. K., James, H. E., Mubarak, S. J., Allen, E. J. & Jones, K. L. Carpenter syndrome: natural history and clinical spectrum. Am. J. Med. Genet. 20, 461–469 (1985).
Article CAS Google Scholar
Gayko, U. et al. Development of the Bruton’s tyrosine kinase inhibitor ibrutinib for B cell malignancies. Ann. N. Y. Acad. Sci. 1358, 82–94 (2015).
Article CAS Google Scholar
Reynolds, C. & Garg, A. K. Who is a diabetic? Can. Fam. Physician 24, 687–690 (1978).
CAS PubMed PubMed Central Google Scholar
Borowiec, M. et al. Mutations at the BLK locus linked to maturity onset diabetes of the young and beta-cell dysfunction. Proc. Natl Acad. Sci. USA 106, 14460–14465 (2009).
Article CAS Google Scholar
Bonnefond, A. et al. Reassessment of the putative role of BLK-p.A71T loss-of-function mutation in MODY and type 2 diabetes. Diabetologia 56, 492–496 (2013).
Article CAS Google Scholar
Imam, M. H., Shenoy, P. J., Flowers, C. R., Phillips, A. & Lechowicz, M. J. Incidence and survival patterns of cutaneous T-cell lymphomas in the United States. Leuk. Lymphoma 54, 752–759 (2013).
Article Google Scholar
Petersen, D. L. et al. B-lymphoid tyrosine kinase (Blk) is an oncogene and a potential target for therapy with dasatinib in cutaneous T-cell lymphoma (CTCL). Leukemia 28, 2109–2112 (2014).
Article CAS Google Scholar
Zemla, A., Venclovas, C., Moult, J. & Fidelis, K. Processing and analysis of CASP3 protein structure predictions. Proteins Suppl 3, 22–29 (1999).
Cowan-Jacob, S. W. et al. The crystal structure of a c-Src complex in an active conformation suggests possible steps in c-Src activation. Structure 13, 861–871 (2005).
Article CAS Google Scholar
Yuki, H. et al. Activity cliff for 7-substituted pyrrolo-pyrimidine inhibitors of HCK explained in terms of predicted basicity of the amine nitrogen. Bioorg. Med. Chem. 25, 4259–4264 (2017).
Article CAS Google Scholar
Tanimoto, T. T. An elementary mathematical theory of classification and prediction. (IBM Internal Report, 1958).
Pentchev, P. G. Niemann-Pick C research from mouse to gene. Biochim. Biophys. Acta 1685, 3–7 (2004).
Article CAS Google Scholar
Chikh, K., Rodriguez, C., Vey, S., Vanier, M. T. & Millat, G. Niemann-Pick type C disease: subcellular location and functional characterization of NPC2 proteins with naturally occurring missense mutations. Hum. Mutat. 26, 20–28 (2005).
Article CAS Google Scholar
Klunemann, H. H. et al. Frontal lobe atrophy due to a mutation in the cholesterol binding protein HE1/NPC2. Ann. Neurol. 52, 743–749 (2002).
Article Google Scholar
Millat, G. et al. Niemann-Pick C disease: use of denaturing high performance liquid chromatography for the detection of NPC1 and NPC2 genetic variations and impact on management of patients and families. Mol. Genet. Metab. 86, 220–232 (2005).
Article CAS Google Scholar
Millat, G. et al. Niemann-Pick disease type C: spectrum of HE1 mutations and genotype/phenotype correlations in the NPC2 group. Am. J. Hum. Genet. 69, 1013–1021 (2001).
Article CAS Google Scholar
Park, W. D. et al. Identification of 58 novel mutations in Niemann-Pick disease type C: correlation with biochemical phenotype and importance of PTC1-like domains in NPC1. Hum. Mutat. 22, 313–325 (2003).
Article CAS Google Scholar
Li, X., Saha, P., Li, J., Blobel, G. & Pfeffer, S. R. Clues to the mechanism of cholesterol transfer from the structure of NPC1 middle lumenal domain bound to NPC2. Proc. Natl Acad. Sci. USA 113, 10079–10084 (2016).
Article CAS Google Scholar
Buzdar, A. U., Robertson, J. F., Eiermann, W. & Nabholtz, J. M. An overview of the pharmacology and pharmacokinetics of the newer generation aromatase inhibitors anastrozole, letrozole, and exemestane. Cancer 95, 2006–2016 (2002).
Article CAS Google Scholar
Lo, J. et al. Structural basis for the functional roles of critical residues in human cytochrome p450 aromatase. Biochemistry 52, 5821–5829 (2013).
Article CAS Google Scholar
Ghosh, D. et al. Novel aromatase inhibitors by structure-guided design. J. Med. Chem. 55, 8464–8476 (2012).
Article CAS Google Scholar
Xu, S., Benoff, B., Liou, H. L., Lobel, P. & Stock, A. M. Structural basis of sterol binding by NPC2, a lysosomal protein deficient in Niemann-Pick type C2 disease. J. Biol. Chem. 282, 23525–23531 (2007).
Article CAS Google Scholar
Friedland, N., Liou, H. L., Lobel, P. & Stock, A. M. Structure of a cholesterol-binding protein deficient in Niemann-Pick type C2 disease. Proc. Natl Acad. Sci. USA 100, 2512–2517 (2003).
Article CAS Google Scholar
Lu, F. et al. Identification of NPC1 as the target of U18666A, an inhibitor of lysosomal cholesterol export and Ebola infection. Elife 4, e12177 (2015).
Phan, B. A., Dayspring, T. D. & Toth, P. P. Ezetimibe therapy: mechanism of action and clinical update. Vasc. Health Risk. Manag. 8, 415–427 (2012).
CAS PubMed PubMed Central Google Scholar
Soufan, O. et al. DRABAL: novel method to mine large high-throughput screening assays using Bayesian active learning. J. Cheminform. 8, 64 (2016).
Bateman, A. et al. The Pfam protein families database. Nucleic Acids Res. 28, 263–266 (2000).
Article CAS Google Scholar
Liou, H. L. et al. NPC2, the protein deficient in Niemann-Pick C2 disease, consists of multiple glycoforms that bind a variety of sterols. J. Biol. Chem. 281, 36710–36723 (2006).
Article CAS Google Scholar
Kuhn, H. W. The Hungarian method for the assignment problem. Nav. Res. Logist. Q. 2, 83–97 (1955).
Article Google Scholar
Munkres, J. Algorithms for the assignment and transportation problems. J. Soc. Ind. Appl. Math. 5, 32–38 (1957).
Article Google Scholar
Chikhi, R., Sael, L. & Kihara, D. Real-time ligand binding pocket database search using local surface descriptors. Proteins 78, 2007–2028 (2010).
Article CAS Google Scholar
Morris, G. M. et al. AutoDock4 and AutoDockTools4: automated docking with selective receptor flexibility. J. Comput. Chem. 30, 2785–2791 (2009).
Article CAS Google Scholar
O’Boyle, N. M. et al. Open Babel: an open chemical toolbox. J. Cheminform. 3, 33 (2011).
Feinstein, W. P. & Brylinski, M. Calculating an optimal box size for ligand docking and virtual screening against experimental and predicted binding pockets. J. Cheminform. 7, 18 (2015).

Download references

Acknowledgements

This work was supported by the National Institute of General Medical Sciences of the National Institutes of Health [R35GM119524].

Author information

Authors and Affiliations

Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, 70803, USA
Rajiv Gandhi Govindaraj, Misagh Naderi, Manali Singha, Jeffrey Lemoine & Michal Brylinski
Division of Computer Science and Engineering, Louisiana State University, Baton Rouge, LA, 70803, USA
Jeffrey Lemoine
Center for Computation and Technology, Louisiana State University, Baton Rouge, LA, 70803, USA
Michal Brylinski

Authors

Rajiv Gandhi Govindaraj
View author publications
You can also search for this author in PubMed Google Scholar
Misagh Naderi
View author publications
You can also search for this author in PubMed Google Scholar
Manali Singha
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Lemoine
View author publications
You can also search for this author in PubMed Google Scholar
Michal Brylinski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michal Brylinski.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Govindaraj, R.G., Naderi, M., Singha, M. et al. Large-scale computational drug repositioning to find treatments for rare diseases. npj Syst Biol Appl 4, 13 (2018). https://doi.org/10.1038/s41540-018-0050-7

Download citation

Received: 02 November 2017
Revised: 22 January 2018
Accepted: 03 February 2018
Published: 13 March 2018
DOI: https://doi.org/10.1038/s41540-018-0050-7

This article is cited by

Potential of Artificial Intelligence to Accelerate Drug Development for Rare Diseases
- Giulio Napolitano
- Canan Has
- Carsten Ullrich
Pharmaceutical Medicine (2024)
A computational multi-targeting approach for drug repositioning for psoriasis treatment
- Akachukwu Ibezim
- Emmanuel Onah
- Fidele Ntie-Kang
BMC Complementary Medicine and Therapies (2021)
Cestode strobilation: prediction of developmental genes and pathways
- Gabriela Prado Paludo
- Claudia Elizabeth Thompson
- Henrique Bunselmeyer Ferreira
BMC Genomics (2020)
A network-based approach to identify deregulated pathways and drug effects in metabolic syndrome
- Karla Misselbeck
- Silvia Parolo
- Corrado Priami
Nature Communications (2019)
Comparative assessment of strategies to identify similar ligand-binding pockets in proteins
- Rajiv Gandhi Govindaraj
- Michal Brylinski
BMC Bioinformatics (2018)