We have developed the Computational Analysis of Novel Drug Opportunities (CANDO) platform to infer homology of drug behaviour at a proteomic level by constructing and analysing structural compound-proteome interaction signatures of 3,733 compounds with 48,278 proteins in a shotgun manner. We applied the CANDO platform to predict putative therapeutic properties of 428 psychoactive compounds that belong to the phenylethylamine, tryptamine, and cannabinoid chemical classes for treating mental health indications. Our findings indicate that these 428 psychoactives are among the top-ranked predictions for a significant fraction of mental health indications, demonstrating a significant preference for treating such indications over non-mental health indications, relative to randomized controls. Also, we analysed the use of specific tryptamines for the treatment of sleeping disorders, bupropion for substance abuse disorders, and cannabinoids for epilepsy. Our innovative use of the CANDO platform may guide the identification and development of novel therapies for mental health indications and provide an understanding of their causal basis on a detailed mechanistic level. These predictions can be used to provide new leads for preclinical drug development for mental health and other neurological disorders.
Drug discovery traditionally revolves around single biological targets and focuses on a limited set of relationships between a protein target and small molecules of interest. The goal of this approach is to change the biological function of a protein responsible for pathogenesis and subsequently determine the toxicity and side effect profile of a compound to make it a suitable clinical candidate. The expected result of this approach is a compound that modulates the single protein that it targets. Although this traditional approach has been successfully applied to develop the majority of approved drugs, it has been questioned in recent years as the number of new approved drugs continues to decrease (currently down to 30 according to fda.gov). Additionally, many new drugs are analogues to already known drugs or reformulated to improve efficacy and filed as new patents. According to the Tufts Center for the study of Drug Development (csdd.tufts.edu), the average cost to bring a new drug to market can be as large as $2.6 billion. Therefore, there exists a shortage of novel drug development because the current approach is both time and cost prohibitive1,2,3,4.
One methodology to combat the rising cost and time commitment of novel drug development is to repurpose already approved drugs that are known to have few deleterious side effects3,5,6,7,8,9,10,11. Competiveness in the pharaceutical industry hinders the systematic exploration of potential repurposing opportunites, but computational approaches enable a workaround. Using computational multi-target docking with dynamics, we developed a drug repurposing approach for malaria7 and have since validated our models numerous times experimentally3,8,9,12,13,14,15,16,17. To expand the applicability of our work, we have developed a shotgun approach to evaluate all potential drug repurposing opportunities simultaneously by evaluating the relationships of compounds with entire proteomes (chemoproteome) in an indication-specific manner9,15. Here, we describe the application of our platform to identify possible therapeutic uses of phenethylamines, tryptamines, and cannabinoids in treating mental health indications.
Leveraging computational chemoproteomics for drug discovery
Natural products have a profound impact on drug discovery. Many of these products come from plant sources18,19,20, where 60% of drugs approved by the FDA circa the 1990s came from these sources21. While this percentage has decreased to about 40% in recent years, it is clear that natural products have an important impact on drug discovery22. Since plants, animals, and other organisms have evolved together, we hypothesize that multiple modes of action are responsible for a small molecule to become a drug. We have thus developed a platform which relies on a “signature of interactions” (a row of binary or real numbers) to represent the interactions of compounds with a set of protein structures that are selected to represent the known structural universe. Our hypothosis requires that similar chemoproteome signatures indicate similar functional behvarious while non-similar signatures (or regioins thereof) indicate off and anti-target (side) effects as these signatures infer proteomic homology of compound or drug behavior. We can use these chemoproteomic signatures to rank how well a compound can be repurposed for given indication and provide a set of protein interactions responsible for this ranking to obtain an understanding of drug mechanisms at the level of atomic interactions.
CANDO: A shotgun computational chemoproteomics platform for drug repurposing and discovery
Biologically active molecules, such as proteins and drugs, do not function in isolation. The absorption, dispersion, metabolism, and excretion (ADME) and effectiveness of a drug are dependent on the interactions of the drug with a system of proteins expressed at different sites in an organism. The Computational Analysis of Novel Drug Opportunities (CANDO) platform works at the proteomic level by leveraging the interaction signature of a compound to all proteins in a generic structural library. It compares the signatures of candidate compounds/drugs to those approved for particular indications to make drug repurposing predictions in a shotgun manner (here meaning an all vs. all compound-proteome signature comparison).
The first version of the CANDO platform (CANDO v1) shown in Fig. 1 predicts interactions between 3,733 FDA approved drugs and a variety of other human ingestible compounds (including supplements and illegal substances) and 48,278 protein structures from multiple species (46,784 of which are used in this study and this protein list is provided in the GitHub data repository) either taken from the Protein Data Bank (PDB)23 or representing high confidence homology models24 constructed using protein structure prediction methods described previously15,25 Specifically, the proteins structures include solved and modeled proteins obtained from eukaryotic, prokaryotic, archaea and viral organismal proteomes, including 14,595 human proteins (8,841 of these are high-confidence models), a set of 24,958 nonredundant solved protein structures in the PDB, in addition to the remaining solved and modeled structures from M. tuberculosis, P. aeruginosa, and viral proteomes, etc. We consider different conformations of protein structures by separately including multiple domains (chains) and isoforms of proteins for calculating all compound proteins interactions. As an example, for the experimental structures considered for the human proteome, we use a mapping between PDB chains and UniProtKB/SwissProt codes26 in the human proteome. We also treat all such protein-compound interactions equally as proteins from different biological classes affect benchmarking accuracy results to predict putative repurposeable drugs for diseases15.
We employ our bioinformatic docking approach to construct a 3,733 × 46,784 compound-protein interaction matrix (see Compound-Proteome Interaction Signature section15) that is analysed to determine similarity in drug behavior15,25. No special methods were used for different protein classes (e.g., kinases and GPCRs) so that scores of two proteins from different classes could be compared directly. To generate a pose we used a hierarchical fragment-based docking with dynamics algorithm27 using knowledge-based potentials28 as done previously for the Ebola proteome29. We have previously shown that all-atom dynamics is necessary for accurate prediction of binding energies30 and demonstrated all-atom knowledge-based force fields are more accurate than physics-based approaches for both protein structure prediction and docking16,17,31,32,33,34. Furthermore, we have shown that multi-targeted docking with dynamics leads to improved hit rates for finding inhibitors of pathogens relative to conventional approaches7,8. It should be noted that the interaction score stored in this matrix does not represent whether a given target will be inhibited or activated, only that the compound and target interact. As a result, the CANDO platform can be used for both inhibitors and agonists with the caveat that the predicted effect of a compound may be unknown until verified experimentally. For example, CANDO could predict cocaine for the treatment of cocaine-related disorders. Therefore, special care needs to be used when examining these predictions since dose selection is not part of the current model.
Once the interaction matrix is constructed, our methods compare the all compound-proteome interaction signatures where the similarity of two signatures can be calculated using various metrics as simple as root mean squared deviations (RMSD) to sophisticated graph theory based comparisons that can take underlying protein-protein interactions (compiled from public sources24,35,36,37) into account. Similarities between (regions of) interaction signatures indicate a relationship in functional behaviour. However, the differences between two signatures are difficult to understand without further knowledge as it may indicate a more potent drug, a possible side effect, or no effect whatsoever. In addition to predicting a ranked list of putative drugs that are most likely to function similarly to other drugs approved for a particular indication, the signature comparison and ranking helps to analyse compound behaviour in biologically relevant pathways35,36,38. Our CANDO platform is successful for prospectively validating putative leads for several indications15,25,29.
Mental health indications and interventions
A large number of diseases and disorders have mental health implications as catalogued by the American Psychiatric Association (APA)39. These indications affect people in all age groups, social classes, and races40,41,42,43,44. The treatments for these indications mostly consist of small molecule therapeutics, varying individually for specific diseases, disorders, or conditions. According to a report published by the World Health Organization in 201145, the number of United States (US) citizens taking medication to treat mental health has increased to over two million US citizens since 2001. Anxiety disorders make up the largest category of mental illness in the US affecting a total of 42 million people. The second largest category is major depression disorder affecting 14.8 million US citizens on any given day. Approximately 2.4 million US citizens have schizophrenia where no effective treatment or cure is currently available as schizophrenia medication typically results in metabolic issues leading to weight gain and type 2 diabetes46. Collectively mental health indications/disorders cost the US economy $192.3 billion each year and result in high morbidity, with suicide being the tenth largest cause of death47,48. Unfortunately, adolescents are susceptible to depression and suicide, and the effectiveness of antidepressants for these individuals remains uncertain49.
Human use of psychoactive substances
We define psychoactives as compounds that cross the blood-brain barrier, target proteins expressed in the brain as their primary modes of action, and thereby perturb human mental states. Although proteins expressed in the brain are paramount for the prediction of compounds as potential therapies for mental health disorders, synergistic effects may occur due to interactions in the periphery. For example, it has been shown that the gut microbiome plays an important role in the central nervous system50 and multiple links between the peripheral mechanisms and depression have been found previously51,52. We have also benchmarked CANDO to show that best drug repurposing accuracies are obtained when all protein structures are used for interaction signature comparisons to determine compound similarity, suggesting the role of multiple networks working together in biology to achieve a certain phenotype/function, instead of specific proteins as used traditionally for drug discovery15,25. This approach makes CANDO different than other methods that are focused towards single target inhibitor discovery vs drug discovery. Therefore, we believe that the study of proteome-wide interaction signature for repurposing psychoactive compounds is suitable in the context of mental health indications.
Since the time of the earliest records, humans have been ingesting psychoactive substances for religious and spiritual purposes (for example, dimethyltryptamine in Ayahuasca, mescaline in Peyote), for medicinal purposes (opium), and for recreation (caffeine, nicotine, alcohol)53. The vast majority of pyschoactives are considered taboo for a variety of reasons and, with few exceptions, are not investigated for potential medicinal properties. In this study, we focus on the phenylethylamine and tryptamine classes of psychoactives described by Alexander Shulgin54,55 as well as additional cannabinoids.
Due to recent changes in legislation, a few of these compounds are available as approved drugs in some jurisdictions (for example amphetamine for diet control and attention deficit hyperactive disorder, and tetrahydrocannabinol for anxiety). The action of these compounds is thought to affect human physiology by their structural similarity/mimicry to neurotransmitters (for example, psilocybin and lysergic acid diethylamide both mimic the compound serotonin). There is an increasing amount of evidence for cannabinoids having the ability to treat epilepsy and epilepsy-related indications56,57, but its legal status is still diffuse as cannabinoids remain classified as Schedule I by the United States Federal Government (a classification possessing no medicinal use). Similarly, psilocybin and ketamine have been shown to treat depression via a mechanism not targeted by current antidepressants58,59.
These examples are the tip of a proverbial iceberg, and recent reinvestigations into the clinical relevance of illicit psychoactive compounds suggest further investigation into the potential of these compounds in treating mental health indications60. This clear disconnect between current research and current legislation warrants a more comprehensive investigation for the use of these psychoactive compounds for medicinal purposes but in vitro and in vivo verification is currently difficult given their scheduling status. The CANDO shotgun drug discovery and repurposing platform is therefore uniquely suited to conduct such an investigation to make a case for experimental verification.
While other classifications of psychoactives could be utilized (for example, all compounds known to cross the blood-brain barrier), our goal in this study was to see if any of the selected psychoactive compounds, primarily known without any therapeutic utility, are predicted to treat mental health indications. Our work also demonstrates the more general utility of the CANDO platform in assessing the effect of drug classes on this specific class of indications.
Analysing the role of psychoactives in mental health indications using CANDO
Most of our selected psychoactive compounds are illegal to synthesize and thus difficult to study in vitro (much less in vivo). Cannabinoids are in the process of being legalized for medicinal uses in some jurisdictions, and this serves as a justification for studying these drugs further. The cause of many mental health indications is not characterized by one protein, but by several proteins in several different categories61,62,63,64,65. Thus, the traditional high throughput screening methodology of testing one compound against one protein is not a suitable approach for mental health drug discovery. The CANDO platform allows evaluation of all selected psychoactives across a large library of protein structures, providing a logical and reasonable method to develop leads for medications that may be suitable for treating mental health indications. Our goal here is to study, analyse, and characterize these psychoactive compounds using the CANDO platform so that the potential medicinal properties of these compounds can be assessed and evaluated in further bench and clinical studies. The outcomes for this study are not necessarily to predict mental health therapies but rather to generate hypotheses if the predicted psychoactives serve as the most promising leads for different mental health indications based on similar chemoproteomics perspective.
We describe our results based on two approaches of examining the relationships between the selected psychoactives and mental health indications using the top-ranked predictions by the CANDO platform. An example of these predictions is given in Table 1, where we are careful to list potential issues with the predicted psychoactive. At the outset, we examined the distributions of percentages of psychoactive compounds (relative to total compounds) in the top-ranked predictions for mental health indications. Conversely, we can compare the distributions of percentages of mental health indications selected by the psychoactive compounds in the top-ranked predictions. We further analyse the latter distributions broken down by psychoactive classes and the distributions of mental health indications. We conclude with three case studies illustrating the application and utility of the CANDO platform in discovering psychoactive therapeutics to treat mental health indications. We again caution that applying these predictions for the development of new therapeutics must be done judiciously.
Putative psychoactives for mental health indications
The results showing the distributions of percentages of psychoactives for mental health indications are given in Fig. 2, and the values used to create the figure are provided in Table S7. This figure shows that the difference in the random and non-random distributions. Since these distributions are statistically different, we conclude the selected psychoactive compounds are better at treating mental health indications on average than non-psychoactive compounds selected at random. As the number of compounds considered increases, the normal and randomized distributions become more alike. This result is expected as there are a larger number of non-psychoactive compounds than the selected psychoactive ones and, therefore, the addition of a new compound is more likely to be non- psychoactive than psychoactive. Therefore, as the number of compounds in the result list increases the percentage of psychoactives predicted for any indication will decrease (Fig. 2a–d).
Selection of mental health indications by selected psychoactives
The distributions for the selection of mental health indications by selected psychoactives relative to all indications are shown in Fig. 3 (raw values provided in Table S8). The greater the percentage of mental health indications, the more selective the psychoactive. Furthermore, the indications selected by psychoactives using the CANDO platform yield a high percentage of mental health indications relative to random controls, illustrating that these psychoactives are more likely than non-psychoactives to be effective at treating mental health indications.
Comparison of randomized compound and indication distributions
The two randomized distributions in Fig. 3 (shown in green and blue) are distinct. The distribution representing randomized compounds is less uniform and has a larger average percentage (p-value less than 2 × 10−16 from a one-tailed student t-test for all four plots) than the randomized indication distribution. These data show that a single drug is more likely to treat multiple indications than a single indication is to be treated by multiple drugs. This has been shown previously by the ability to repurpose drugs66,67,68 and is an important feature of the CANDO platform. The ability to repurpose previously approved compounds is increasingly important69. This result highlights the utility of the CANDO platform for drug repurposing.
Comparison of different psychoactive classes
Figures 4 and 5, Tables S8 and S9 differentiate the psychoactives by compound class: amphetamine, cannabinoids, cathinones, phenethylamines, and tryptamines. These figures and tables illustrate that the classification of a compound has an impact on which indications it is predicted to treat. Therefore, we will continue the discussion based on psychoactive compound-classes. The details to classify psychoactive compounds used in this work is given in the Supporting Information section entitled: “Classification of psychoactive compounds via substructure searching”.
Relationships between mental health indications
Our predictions for indication-indication associations are shown in Fig. 6. Interestingly, some indication relationships have been verified clinically. These include: Epilepsy with Seizure, Cocaine-related disorders with depression70, Seizures with Substance Withdrawal Syndrome71, Depression with Anxiety,72 and possibly relating binge-eating and personality disorder73. The ability of our repurposing platform to reproduce known indication relationships suggests that our chemoproteomic signatures can capture key biological interactions. In addition, the number of overlapping psychoactive compound predictions strongly relate multiple mental health indications (width of the chords in Fig. 6). These psychoactives interact with multiple proteins (similar chemo-proteome signatures) suggesting common biochemical pathways. We are confident that our method may be useful to discover new disease pathways relating these indications. Identifying and validating these new pathways are beyond the scope of this work.
The Top10, Top25, and Top40 predictions in Fig. 2 for three mental health indications, Seasonal Affective Disorder, Circadian Rhythm Sleep Disorders, and Jet Lag Syndrome, consist only of psychoactives belonging to the tryptamine class (indication rank of 100%). The only compound known to treat all these indications is melatonin (also a tryptamine)74,75,76, indicating that its proteomic interaction signature is most similar to the interaction signatures for these predicted psychoactives. This result demonstrates that the proteomic shotgun drug repurposing approach adopted by the CANDO platform makes sensible predictions of related compounds based on their similarly of interaction signature with all proteins, compared to traditional single target approaches. As a result, we present class specific breakdowns of Fig. 2 in the Supporting information (Fig. S1). Studies by an Israeli pharmaceutical company give experimental evidence demonstrating that some of these tryptamine psychoactives are indeed likely to treat the aforementioned three indications77. These studies provide corroborative evidence for the efficacy of the CANDO platform and highlight its potential of finding new drugs for treating any indication that has at least one approved drug.
The remainder of this discussion will be used to highlight case studies which are verified in the literature. For a complete list of psychoactive predictions, please see the tables in the supporting information.
The indication with the largest number of high ranking psychoactives in the top-ranked predictions is cocaine-related disorders belonging to the cathinone class of stimulants, a summary of which is given in Table 2. The similarity between the effects of cathinone and cocaine on behaviour has been previously established as part of a similar pathway78. We are aware that some of these predictions are unlikely to have any potential for the development of new therapeutics for cocaine-related disorders due to their associated toxicity79,80. A cathinone of interest is the anti-depressant bupropion, which is well known for promoting smoking cessation and has also been proposed for the treatment of methamphetamine and cocaine substance abuse disorders81,82. These findings and related uses further showcase the ability of CANDO platform to accurately associate compounds/drugs and indications. While this example is successful in showcasing CANDO’s ability to find the relationship between compounds and mental health disorders, one needs to be cautious as these predictions may mimic cocaine and lead to adverse reactions depending on the dose. For example, Bupropion is perceived as a stimulant to those with a history of cocaine use83,84. Further, the effects of dextromethorphan may be due to its stimulant properties85. However, in some cases we can obtain therapeutic benefit from potentially problematic compounds, e.g. methadone is an approved treatment for opioid abuse, but is known to have several opioid-related effects when given in high enough dosages86.
The highest-ranking phenethylamine predicted to treat cocaine-related disorders is the antitussive drug, dextromethorphan. This compound, generally available over the counter, is known for its hallucinogenic side effects at high doses, which is reflected both in the predictions by CANDO and is reinforced in the literature87,88,89,90,91. The use of the CANDO platform for making predictions to treat specific mental health indications is strengthened by the accurate identification of bupropion and dextromethorphan (both selected psychoactives) in treating cocaine-related disorders.
The two psychoactive cannabinoids, tetrahydrocannabinol and cannabinol are predicted to treat Epilepsy and Absence Epilepsy by the CANDO platform, and cannabinol is also predicted to treat Status Epilepticus. While the cannabinoids are not the highest ranked compounds relative to other psychoactives for these indications, our findings are validated by recently published studies for the use of cannabinoids to treat epilepsy-related indications56,57. The non-psychoactive cannabinoid (cannabidiol) is not predicted to treat any epilepsy-related indications, leading to an intriguing hypothesis concerning the likelihood of a cannabinoid treating epilepsy corresponding to its psychoactivity. However, given the limited data available, further study is warranted to verify this hypothesis. Our work illustrates the recovery of known corroborative associations between cannabinoids and epilepsy but also demonstrates how predictions made by the CANDO platform can be used to develop hypotheses on the biology of diseases for experimental investigation.
An overview of the CANDO platform is described in Supporting Information. Here, we describe the approach used to analyse the data generated by this platform to characterize the role of the selected psychoactives in mental health indications.
Selection of specific phenethylamines, tryptamines, and cannabinoids
We collected a total of 428 compounds (structures in Tables S1–S6) to be investigated using CANDO and categorized them into 291 phenethylamines and 109 tryptamines described by Alexander Shulgin54,55, and 6 cannabinoids (cannabinol, cannabidiol, and tetrahydrocannabinol) using a subgraph based search methodology based on the structure of the parent molecule (see Supporting Information for full description of this method). An additional 22 compounds are not strictly classified as phenethylamines but have structural similarity to the phenethylamine class are included as unclassified. We further subdivided the 291 phenethylamine compounds into 149 amphetamines and 20 cathinones, the remaining 122 phenethylamines are simply referred to as phenethylamines. The CANDO v1 compound library includes these 428 psychoactives and their proteomic interactions signatures to repurpose psychoactives for indications/diseases9. Most of these psychoactives are classified as Schedule I substances by the United States Drug Enforcement Agency, indicating they have no known medicinal use, no accepted standards for safety, or have a high potential for abuse. Thus, when such a substance is discussed, the potential pitfalls are presented along with that substance. We selected this set of compounds as almost all of them are known to affect mental physiology upon ingestion54,55,92. A notable exception in the compounds evaluated is cannabidiol which is not strictly psychoactive92 but is structurally similar to other cannabinoids and therefore warrants an investigation into its potential therapeutic value.
The CANDO v1 compound-proteome interaction signature (see Supporting Methods) includes all associations of treatment and side effects caused for each compound via the proteomic signature as this is composed of all target, anti-target, and off-targets proteins for each indication/disease. The compound proteomic interaction signature similarity yields therapeutic predictions by considering similarity to known drug signatures for each disease. It should be noted that this methodology can also match a psychoactive to a compound known to worsen a given indication in addition to predicting a compound known to ameliorate the same indication. Therefore, the set of compounds that were used as therapy for a given indication did not include any of the aforementioned psychoactive compounds given that the nature of these compounds as treatments is still controversial. As a result, the ability of the platform to predict a psychoactive from another psychoactive compound-proteome signature is not investigated in this work. Most importantly, all predictions are made based on similarity to an approved non-psychoactive drug for a mental health indication, without any knowledge of therapeutic target associations for making predictions for psychoactive compounds. Therefore, no association between an indication and a protein target is used to weight the similarity between two compounds. For example, the interaction score of a psychoactive and the dopamine receptor is not given a special weight for Schizophrenia.
Selection of mental health indications
The Medical Subject Headings (MeSH) vocabulary is used to specify the diseases, disorders, and conditions that are classified as mental health indications. The U.S. National Laboratory of Medicine division of the National Institutes of Health (www.nlm.nih.gov) includes the latest version of the MeSH database. It should be noted that this database is compiled at the clinical level and does not consider the underlying biology leading to a specific indication. Therefore, some spurious and non-traditional indications may be included as mental health indications. Since a biological mechanism study is beyond the scope of this paper, we used all the indications suggested by MeSH.
The MeSH database is divided into tree structures with a specific tree (F03) denoted for Mental Disorders. The specific branches of the Mental Disorder Tree used in this study are Anxiety Disorders (F03.080), Dissociative Disorders (F03.300), Feeding and Eating Disorders (F03.400), Neurocognitive Disorders (F03.615), Somatoform Disorders (F03.875), Conduct Disorders (F03.250), Neurodevelopmental Disorders (F03.625), Mood Disorders (F03.600), Neurotic Disorders (F03.650), Personality Disorders (F03.675), Schizophrenia Spectrum Disorders (F03.700), Sleep-Wake Disorders (F03.870), and Substance-Related Disorders (F03.900). All the indications listed in these branches were used along with Dyspareunia, Erectile Dysfunction, Paraphilias, Fetishism, and Paedophilia from the Sexual Dysfunctions (F03.835) branch yielding a total of 108 mental health indications that are analyzed in this work. A separate MeSH identification paradigm was done for epilepsy-related indications as these indications are placed in a separate MeSH tree because they are neurological disorders, not psychiatric disorders. The MeSH tree evaluated for epilepsy is C10.228.140.490 which includes Drug-Resistant Epilepsy, Myoclonic Epilepsies, Partial Epilepsies, Benign Neonatal Epilepsy, Generalized Epilepsy, Post-Traumatic Epilepsy, Reflex Epilepsy, Landau-Kleffner Syndrome, Lennox-Gastaut Syndrome, Seizures, and Febrile Seizures. A total of 29 additional epilepsy-related indications are presented in this work.
Ranking the importance of predicted psychoactives for mental health indications
We generated Top10, Top25, Top40, and Top100 ranked compound lists for all indications (mental health related and otherwise) using the CANDO v1 platform for each indication and counted the number of times a compound prediction is present in each of the ranked lists. A compound may be predicted to treat an indication several times if there are numerous known drugs for that indication.
For example, 65 known drugs are used clinically for schizophrenia and are included in the CANDO platform. Therefore, a set of 65 chemoproteomic signature similarities are used to predict an uncharacterized compound for schizophrenia. It is possible that the same uncharacterized compound may be predicted at most 65 times for schizophrenia. The number of times such a compound is predicted for a given indication is termed as the ‘consensus count,’ which is normalized as the percent occurrence. We compute percent occurrence as the ratio of consensus count to the maximum number of times a compound could be predicted for an indication using signature similarity (i.e. the number of known treatments for the indication).
We hypothesized that the higher the number of times a compound is predicted to treat a given indication, the greater the confidence in the prediction made because different drugs treat indications due to different biological pathways on the proteomic level. The combination of proteomic similarity implicitly includes a combination of all pathways to yield efficacy and is denoted by the frequency of compounds predicted for each indication. To investigate the general role of psychoactives for mental health, we took the frequency of psychoactive compounds predicted as putative drugs for each indication as a percentage of all compounds predicted for the given indication. The ratio of psychoactive to all compounds for a given indication is referred to as the normalized indication rank and quantifies the overall performance of psychoactive compounds versus non-psychoactive for a given indication.
A similar procedure is used to measure the propensity of a compound to be predicted for mental health indications and is used to ask the question: how many times an indication is (or percentage of mental health and non-mental health indications) listed either as a prediction for treatment by each psychoactive compound and is referred to as the normalized compound rank. This measure allows us to express the preference of a given compound towards mental health indications. To illustrate metrics of normalized compound rank, and normalized indication rank, we consider a simple example where a psychoactive compound ergoline and a non-psychoactive compound aspirin are only predicted for the mental health indication Pica (an eating disorder) and Stomach pain (non-mental health indication). Now consider ergoline that is predicted seven times for Pica, and three times for Stomach pain based on the similarity of chemo-proteome analysis while the non-psychoactive drug aspirin is predicted to treat Pica four times and Stomach pain six times. The normalized compound rank for this simple example will be 70% [100 * 7/(7 + 3)] for ergoline and 40% [100 * 4/(4 + 6)] for aspirin. Using the above example, we can also determine the normalized indication rank for Pica and Stomach pain. Since seven psychoactives and four non-psychoactive compounds were predicted for Pica, the normalized indication rank is 64% [100 * 7/(4 + 7)] for Pica and similar calculation yields an indication rank of 33% for Stomach pain. Together these metrics suggest the importance of psychoactive compounds and their preference for mental health compared to non-mental health indications.
Computational randomized controls
To further ensure that our results were not arrived at by chance, the order of the predicted compounds is randomized, and the above procedures repeated. All compounds predicted to treat an indication are randomized regardless of whether the indication is categorized as a mental health indication. Thus, a compound not predicted initially to treat a mental health indication may, due to random chance, be predicted to treat a mental health indication in the randomized data set. If a random compound replaces a predicted compound multiple times for a single indication, the random compound replaces the original (non-random) compound for every prediction. This randomization process is repeated 1000 times, and the compound and indication ranks for all the randomized searches are averaged.
A second random control is performed in addition to the one described above where the indications (mental health or otherwise) are randomly rearranged. Thus, a non-mental health indication may be classified as a mental health indication by chance (and vice versa). This procedure provides a second control that allows us to assess whether selected psychoactives are more likely to be predicted for mental health indications than non-mental health indications.
Determination of relationships between mental health indications
We relate two mental health indications when at least two different psychoactive compounds are predicted for both indications. The frequency of prediction for common psychoactive compound predictions is termed as ‘indication-indication association counts.’ Next, to strongly relate the two indications, we also calculated the ‘consensus count’ for all psychoactives predicted for each mental health indication. Note that a predicted psychoactive could have a different consensus count for each indication. For example, 1-naphthyl(1-pentyl-1H-indole-3-yl) methanone is predicted 10 times for seizures and 8 times for sleep initiation and maintenance disorders (SIMD). Therefore, the consensus count of 1-naphthyl(1-pentyl-1H-indole-3-yl) methanone for seizures is 10 and 8 for SIMD. Another compound, 2-(5-methoxy-2-methyl-1H-indole-3yl)-n,n-dimethyl ethanamine has consensus count of 3 for seizures and 2 for SIMD. Since two different compounds are common predictions for the two indications, the indication-indication association count for these two indications is 2. To strongly relate the indications in Top lists and limit a large number of associations, we selected indication pairs with predicted psychoactive compound consensus count as follows: >=2 for the Top10 set, >=3 for the Top25 set, >=4 for the Top40 set, and >=6 for the Top100 set.
Tests for statistical significance
A one-tailed Kolmogorov-Smirnov test93 was used to compare the distributions of psychoactives in the randomized and non-random distributions as this statistical test is typically used to show two distributions are dissimilar. For all statistical tests performed in this work, we formulated the null hypothesis to be that the distribution of psychoactives predicted to treat mental health disorders can be obtained by chance. Our alternative hypothesis is that the true distribution of psychoactives is greater than the randomized control (hence a one-tailed test). We also performed a one-tailed paired T-test to ensure that the mean of the differences between the test distribution and the randomized distribution is greater than zero. The raw p-values obtained for all statistical tests are given in Tables S10–S11.
Traditional drug discovery is limited by its narrow focus on one or a few targets. Drugs approved for one indication interact with multiple proteins and thereby work across multiple indications. The CANDO platform improves upon the traditional approach by examining all interactions between a compound and a universal proteome. This novel approach enables the study of drugs in a holistic chemoproteomic manner that is especially relevant for the development of compounds intended for treating mental health indications as these complex disorders are mediated by multiple proteins and pathways. In this study, we investigated the compounds previously described by Alexander Shulgin along with additional cannabinoids to identify potential therapies for mental health indications. The results of this study indicate the selected psychoactive compounds perform better than compounds selected at random for mental health indications.
Conversely, the percentage of mental health indications selected by psychoactives is better than randomly selected compounds. This shows that psychoactives may represent promising leads for the development of therapeutics for the treatment of mental health indications. Specifically, the set shows promising results for sleep-related disorders, binge eating disorders, seasonal affective disorder, and cocaine substance abuse disorder. In addition, the other non-psychoactive compounds predicted by the CANDO platform present in the top-ranked predictions may also represent putative repurposable therapies for mental health indications, which will be explored in future studies. In a broader context, our work illustrates the advantages of using a computational chemoproteomics approach for drug discovery and repurposing by providing mechanistic information on which proteins are involved in the mediation of the therapeutic effect.
The 3,733 × 46,784 compound-protein interaction matrix, canpredict executables to calculate similarity with additional dependency files, output from canpredict for top10, top25, top40, top100, files for analysis, SMILES for 3,733 compounds, entire 46,784 list of proteins, R workspace and analysis scripts used to generate all figures presented in this work are available at http://github.com/chopralab/candiy_fun.git.
Bonander, N. & Bill, R. M. Relieving the first bottleneck in the drug discovery pipeline: using array technologies to rationalize membrane protein production. Expert Rev. Proteomics 6, 501–505 (2009).
Gillespie, S. H. & Singh, K. XDR-TB, what is it; how is it treated; and why is therapeutic failure so high. Recent Pat. Antiinfect. Drug Discov. 6, 77–83 (2011).
Horst, J. A., Laurenzi, A., Bernard, B. & Samudrala, R. Computational Multitarget Drug Discovery. In Polypharmacology in Drug Discovery 263–301, https://doi.org/10.1002/9781118098141.ch13 (John Wiley & Sons, Inc., 2012).
Sacks, L. V. & Behrman, R. E. Challenges, successes and hopes in the development of novel TB therapeutics. Future Med. Chem. 1, 749–756 (2009).
Ekins, S., Williams, A. J., Krasowski, M. D. & Freundlich, J. S. In silico repositioning of approved drugs for rare and neglected diseases. Drug Discov. Today 16, 298–310 (2011).
Xu, K. & Cote, T. R. Database identifies FDA-approved drugs with potential to be repurposed for treatment of orphan diseases. Brief. Bioinform. 12, 341–345 (2011).
Jenwitheesuk, E. & Samudrala, R. Identification of Potential Multitarget Antimalarial Drugs. JAMA 294, 1487 (2005).
Jenwitheesuk, E., Horst, J. A., Rivas, K. L., Van Voorhis, W. C. & Samudrala, R. Novel paradigms for drug discovery: computational multitarget screening. Trends Pharmacol. Sci. 29, 62–71 (2008).
Minie, M. et al. CANDO and the infinite drug discovery frontier. Drug Discovery Today 19, 1353–1363 (2014).
Swamidass, S. J. Mining small-molecule screens to repurpose drugs. Brief. Bioinform. 12, 327–335 (2011).
Ren, J., Xie, L., Li, W. W. & Bourne, P. E. SMAP-WS: a parallel web service for structural proteome-wide ligand-binding site comparison. Nucleic Acids Res. 38, W441–W444 (2010).
Costin, J. M. et al. Structural Optimization and De Novo Design of Dengue Virus Entry Inhibitory Peptides. PLoS Negl. Trop. Dis. 4, e721 (2010).
Nicholson, C. O. et al. Viral entry inhibitors block dengue antibody-dependent enhancement in vitro. Antiviral Res. 89, 71–74 (2011).
Horst, J. A. et al. Strategic Protein Target Analysis for Developing Drugs to Stop Dental Caries. Adv. Dent. Res. 24, 86–93 (2012).
Sethi, G., Chopra, G. & Samudrala, R. Multiscale Modelling of Relationships between Protein Classes and Drug Behavior Across all Diseases Using the CANDO Platform. Mini Rev. Med. Chem. 15, 705–717 (2015).
Ma, X. et al. Identification of New FLT3 Inhibitors That Potently Inhibit AML Cell Lines via an Azo Click-It/Staple-It Approach. ACS Med. Chem. Lett. 8, 492–497 (2017).
Hernandez-Perez, M. et al. Inhibition of 12/15-Lipoxygenase Protects Against β Cell Oxidative Stress and Glycemic Deterioration in Mouse Models of Type 1 Diabetes. Diabetes (2017).
Pink, R., Hudson, A., Mouriès, M.-A. & Bendig, M. Opportunities and Challenges in Antiparasitic Drug Discovery. Nat. Rev. Drug Discov. 4, 727–740 (2005).
Balunas, M. J. & Kinghorn, A. D. Drug discovery from medicinal plants. Life Sci. 78, 431–441 (2005).
Hur, M. et al. A global approach to analysis and interpretation of metabolic data for plant natural product discovery. Nat. Prod. Rep. 30, 565 (2013).
Cragg, G. M., Newman, D. J. & Snader, K. M. Natural products in drug discovery and development. Journal of Natural Products 60, 52–60 (1997).
Patridge, E., Gareiss, P., Kinch, M. S. & Hoyer, D. An analysis of FDA-approved drugs: Natural products and their derivatives. Drug Discovery Today 21, 204–207 (2016).
Berman, H. M. et al. The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000).
Sander, C. & Schneider, R. Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins Struct. Funct. Bioinforma. 9, 56–68 (1991).
Chopra, G. & Samudrala, R. Exploring Polypharmacology in Drug Discovery and Repurposing Using the CANDO Platform. Curr. Pharm. Des. 22, 3109–23 (2016).
Martin, A. C. R. Mapping PDB chains to UniProtKB entries. Bioinformatics 21, 4297–4301 (2005).
Fine, J., Konc, J., Samudrala, R. & Chopra, G. CANDOCK: Chemical atomic network based hierarchical flexible docking algorithm using generalized statistical potentials. bioRxiv, https://doi.org/10.26434/chemrxiv.7187540.v1 (2018).
Bernard, B. & Samudrala, R. A generalized knowledge-based discriminatory function for biomolecular interactions. Proteins Struct. Funct. Bioinforma. 76, 115–128 (2009).
Chopra, G., Kaushik, S., Elkin, P. L. & Samudrala, R. Combating Ebola with repurposed therapeutics using the CANDO platform. Molecules 21, 1537 (2016).
Jenwitheesuk, E. et al. Improved prediction of HIV-1 protease-inhibitor binding energies by molecular dynamics simulations. BMC Struct. Biol. 2003 31 40, 299–309 (2003).
Samudrala, R. & Moult, J. An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction 1 1Edited by F. Cohen. J. Mol. Biol. 275, 895–916 (1998).
Samudrala, R. & Moult, J. A graph-theoretic algorithm for comparative modeling of protein structure. J. Mol. Biol. 279, 287–302 (1998).
Xia, Y., Huang, E. S., Levitt, M. & Samudrala, R. Ab initio construction of protein tertiary structures using a hierarchical approach. J. Mol. Biol. 300, 171–185 (2000).
Samudrala, R., Xia, Y., Huang, E. & Levitt, M. Ab initio protein structure prediction using a combined hierarchical approach. Proteins Suppl 3, 194–8 (1999).
Cheng, F. et al. Prediction of drug-target interactions and drug repositioning via network-based inference. PLoS Comput. Biol. 8, e1002503 (2012).
Cheng, F., Zhou, Y., Li, W., Liu, G. & Tang, Y. Prediction of chemical-protein interactions network with weighted network-based inference method. PLoS One 7, e41064 (2012).
Bader, G. D., Betel, D. & Hogue, C. W. V. BIND: The Biomolecular Interaction Network Database. Nucleic Acids Res. 31, 248–250 (2003).
Csermely, P., Ágoston, V. & Pongor, S. The efficiency of multi-target drugs: The network approach might help drug design. Trends in Pharmacological Sciences 26, 178–182 (2005).
American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders, https://doi.org/10.1176/appi.books.9780890425596 (2013).
Mohamed-Kaloo, Z. & Laher, S. Perceptions of mental illness among Muslim general practitioners in South Africa. South African Med. J. 104, 350 (2014).
Rüsch, N., Evans-Lacko, S. & Thornicroft, G. What is a mental illness? Public views and their effects on attitudes and disclosure. Aust. New Zeal. J. Psychiatry 46, 641–650 (2012).
Skaer, L., Robison, M., Sclar, A. & Galin, S. Treatment of depressive illness among children and adolescents in the United States. Curr. Ther. Res. 61, 692 (2000).
Bauer, A., Knapp, M. & Parsonage, M. Lifetime costs of perinatal anxiety and depression. J. Affect. Disord. 192, 83–90 (2016).
Henderson, C., Evans-Lacko, S. & Thornicroft, G. Mental illness stigma, help seeking, and public health programs. Am. J. Public Health 103, 777–80 (2013).
Medco Health Solutions. America’s state of mind (2011).
Sernyak, M. J., Leslie, D. L., Alarcon, R. D., Losonczy, M. F. & Rosenheck, R. Association of Diabetes Mellitus With Use of Atypical Neuroleptics in the Treatment of Schizophrenia. Am. J. Psychiatry 159, 561–566 (2002).
National Alliance on Mental Illness. Mental Health Facts. National Institute of Mental Health Available at, https://www.nami.org/NAMI/media/NAMI-Media/Infographics/Children-MH-Facts-NAMI.pdf (2018).
National Alliance on Mental Illness. What is mental illness: Mental illness facts, Www.Nami.Org 1–2 (2013).
Cox, G. R. et al. Psychological therapies versus antidepressant medication, alone and in combination for depression in children and adolescents. Cochrane Database of Systematic Reviews 2014 (2014).
Sharon, G., Sampson, T. R., Geschwind, D. H. & Mazmanian, S. K. The Central Nervous System and the Gut Microbiome. Cell 167, 915–932 (2016).
Akil, H. et al. Treatment resistant depression: A multi-scale, systems biology approach. Neurosci. Biobehav. Rev. 84, 272–288 (2018).
Zheng, X., Zhang, X., Wang, G. & Hao, H. Treat the brain and treat the periphery: toward a holistic approach to major depressive disorder. Drug Discov. Today 20, 562–568 (2015).
Crocq, M. A. Historical and cultural aspects of man’s relationship with addictive drugs. Dialogues in Clinical Neuroscience 9, 355–361 (2007).
Shulgin, A. & Shulgin, A. Phenethylamines I Have Known And Loved: A Chemical Love Story. Transform 1–1188 (1991).
Shulgin, A. & Shulgin, A. Tryptamines I Have Known And Loved. (Transform, 1997).
Paolino, M. C., Ferretti, A., Papetti, L., Villa, M. P. & Parisi, P. Cannabidiol as potential treatment in refractory pediatric epilepsy. Expert Review of Neurotherapeutics 16, 17–21 (2016).
Killestein, J. Cannabinoids in the Treatment of Epilepsy. N. Engl. J. Med. 374, 94–95 (2016).
Roseman, L., Demetriou, L., Wall, M. B., Nutt, D. J. & Carhart-Harris, R. L. Increased amygdala responses to emotional faces after psilocybin for treatment-resistant depression. Neuropharmacology 142, 263–269 (2018).
Pham, T. H. et al. Ketamine treatment involves medial prefrontal cortex serotonin to induce a rapid antidepressant-like activity in BALB/cJ mice. Neuropharmacology 112, 198–209 (2017).
Rucker, J. J. H., Iliff, J. & Nutt, D. J. Psychiatry & the psychedelic drugs. Past, present & future. Neuropharmacology 142, 200–218 (2018).
Nascimento, J. M. & Martins-de-Souza, D. The proteome of schizophrenia. npj Schizophr. 1, 14003 (2015).
Lee, J. et al. Proteomic Analysis of Serum from Patients with Major Depressive Disorder to Compare Their Depressive and Remission Statuses. Psychiatry Investig. 12, 249 (2015).
Carboni, L. The contribution of proteomic studies in humans, animal models, and after antidepressant treatments to investigate the molecular neurobiology of major depression. Proteomics - Clinical Applications 9, 889–898 (2015).
Taurines, R. et al. Review: Proteomic research in psychiatry. Journal of Psychopharmacology 25, 151–196 (2011).
Kobeissy, F. H., Sadasivan, S., Liu, J., Gold, M. S. & Wang, K. K. W. Psychiatric research: Psychoproteomics, degradomics and systems biology. Expert Review of Proteomics 5, 293–314 (2008).
Bloom, B. E. Recent successes and future predictions on drug repurposing for rare diseases. Expert Opin. Orphan Drugs 4, 1–4 (2016).
Oprea, T. I. & Mestres, J. Drug Repurposing: Far Beyond New Targets for Old Drugs. AAPS J. 14, 759–763 (2012).
Xu, R. & Wang, Q. Large-scale extraction of accurate drug-disease treatment pairs from biomedical literature for drug repurposing. BMC Bioinformatics 14, 181 (2013).
Lavecchia, A. & Cerchia, C. In silico methods to address polypharmacology: Current status, applications and future perspectives. Drug Discovery Today 21, 288–298 (2016).
Brown, R. A. et al. Depression Among Cocaine Abusers in Treatment: Relation to Cocaine and Alcohol Use and Treatment Outcome. Depression Among Cocaine Abusers In 155 (1998).
Brust, J. C. M. Seizures and substance abuse: treatment considerations. Neurology 67, S45–S48 (2006).
Ressler, K. J. & Nemeroff, C. B. Role of serotonergic and noradrenergic systems in the pathophysiology of depression and anxiety disorders. Depression and Anxiety 12, 2–19 (2000).
Chen, E. Y., Matthews, L., Allen, C., Kuo, J. R. & Linehan, M. M. Dialectical behavior therapy for clients with binge-eating disorder or bulimia nervosa and borderline personality disorder. Int. J. Eat. Disord. 41, 505–512 (2008).
Cerezo, A. B. et al. Quality control and determination of melatonin in food supplements. J. Food Compos. Anal. 45, 80–86 (2016).
Kumar Jha, P., Challet, E. & Kalsbeek, A. Circadian rhythms in glucose and lipid metabolism in nocturnal and diurnal mammals. Molecular and Cellular Endocrinology 418, 74–88 (2015).
Reid, K. J. & Abbott, S. M. Jet lag and shift work disorder. Sleep Med. Clin. 10, 523–535 (2015).
Zisapel, N. & Laudon, M. Derivating of tryptamine and analgous compounds and pharamaceutical formulations containing them (2004).
Schechter, M. D. & Glennon, R. A. Cathinone, cocaine and methamphetamine: similarity of behavioral effects. Pharmacol. Biochem. Behav. 22, 913–916 (1985).
Blum, K. et al. Hypothesizing that designer drugs containing cathinones (‘bath salts’) have profound neuro-inflammatory effects and dangerous neurotoxic response following human consumption. Med. Hypotheses 81, 450–455 (2013).
Ellison, G. Neural degeneration following chronic stimulant abuse reveals a weak link in brain, fasciculus retroflexus, implying the loss of forebrain control circuitry. European Neuropsychopharmacology 12, 287–297 (2002).
Carroll, F. I. et al. Bupropion and bupropion analogs as treatments for CNS disorders. Adv. Pharmacol. 69, 177–216 (2014).
Holm, K. J. & Spencer, C. M. Bupropion: A review of its use in the management of smoking cessation. Drugs 59, 1007–1024 (2000).
Vento, A. E. et al. Bupropion perceived as a stimulant by two patients with a previous history of cocaine misuse. Ann. Ist. Super. Sanita 49, 402–405 (2013).
Sullivan, M. & Evans, E. Abuse and misuse of antidepressants. Subst. Abuse Rehabil. 107, https://doi.org/10.2147/sar.s37917 (2014).
Chomchai, C. & Manaboriboon, B. Stimulant Methamphetamine and Dextromethorphan Use Among Thai Adolescents: Implications for Health of Women and Children. J. Med. Toxicol. 8, 291–294 (2012).
Strain, E. C., Stitzer, M. L., Liebson, I. A. & Bigelow, G. E. Dose-Response Effects of Methadone in the Treatment of Opioid Dependence. Ann. Intern. Med. 119, 23 (1993).
Kim, H. C. et al. Dextromethorphan affects cocaine-mediated behavioral pattern in parallel with a long-lasting Fos-related antigen-immunoreactivity. Life Sci. 69, 615–624 (2001).
Pulvirenti, L., Balducci, C. & Koob, G. F. Dextromethorphan reduces intravenous cocaine self-administration in the rat. Eur. J. Pharmacol. 321, 279–283 (1997).
Jhoo, W. K. et al. Dual effects of dextromethorphan on cocaine-induced conditioned place preference in mice. Neurosci. Lett. 288, 76–80 (2000).
Kim, H. C., Park, B. K., Hong, S. Y. & Jhoo, W. K. Dextromethorphan alters the reinforcing effect of cocaine in the rat. Methods Find Exp Clin Pharmacol 19, 627–631 (1997).
Shin, E.-J. et al. Neuropsychotoxic and Neuroprotective Potentials of Dextromethorphan and Its Analogs. J. Pharmacol. Sci. 116, 137–148 (2011).
Perrotin-Brunel, H. et al. Solubility of non-psychoactive cannabinoids in supercritical carbon dioxide and comparison with psychoactive cannabinoids. J. Supercrit. Fluids 55, 603–608 (2010).
Marsaglia, G., Tsang, W. W. & Wang, J. Evaluating Kolmogorov’s Distribution. J. Stat. Softw. 8 (2015).
This work was supported in part by a Purdue University start-up package from the Chemistry Department, Ralph W. and Grace M. Showalter Research Trust award, the Integrative Data Science Initiative award, and the Jim and Diann Robbers Cancer Research Grant for New Investigators award to Gaurav Chopra and a Lynn Fellowship to Jonathan Fine. Additional support, in part by a NIH Director’s Pioneer Award (1DP1OD006779), a NCATS Clinical and Translational Sciences Award (UL1TR001412), a NCATS Clinical and Translational Sciences Award from the Indiana Clinical and Translational Sciences Institute (UL1TR002529), and the Purdue University Center for Cancer Research, NIH grant P30 CA023168 are also acknowledged. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
The authors declare no competing interests.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.