Pharmacophore-based virtual screening and density functional theory approach to identifying novel butyrylcholinesterase inhibitors

Sakkiah, Sugunadevi; Lee, Keun Woo

doi:10.1038/aps.2012.21

Download PDF

Original Article
Published: 11 June 2012

Pharmacophore-based virtual screening and density functional theory approach to identifying novel butyrylcholinesterase inhibitors

Sugunadevi Sakkiah¹ &
Keun Woo Lee¹

Acta Pharmacologica Sinica volume 33, pages 964–978 (2012)Cite this article

2418 Accesses
62 Citations
Metrics details

Abstract

Aim:

To identify the critical chemical features, with reliable geometric constraints, that contributes to the inhibition of butyrylcholinesterase (BChE) function.

Methods:

Ligand-based pharmacophore modeling was used to identify the critical chemical features of BChE inhibitors. The generated pharmacophore model was validated using various techniques, such as Fischer's randomization method, test set, and decoy set. The best pharmacophore model was used as a query in virtual screening to identify novel scaffolds that inhibit BChE. Compounds selected by the best hypothesis in the virtual screening were tested for drug-like properties, and molecular docking study was applied to determine the optimal orientation of the hit compounds in the BChE active site. To find the reactivity of the hit compounds, frontier orbital analysis was carried out using density functional theory.

Results:

Based on its correlation coefficient (0.96), root mean square (RMS) deviation (1.01), and total cost (105.72), the quantitative hypothesis Hypo1 consisting of 2 HBA, 1 Hy-Ali, and 1 Hy-Ar was selected as the best hypothesis. Thus, Hypo1 was used as a 3D query in virtual screening of the Maybridge and Chembridge databases. The hit compounds were filtered using ADMET, Lipinski's Rule of Five, and molecular docking to reduce the number of false positive results. Finally, 33 compounds were selected based on their critical interactions with the significant amino acids in BChE's active site. To confirm the inhibitors' potencies, the orbital energies, such as HOMO and LUMO, of the hit compounds and 7 training set compounds were calculated. Among the 33 hit compounds, 10 compounds with the highest HOMO values were selected, and this set was further culled to 5 compounds based on their energy gaps important for stability and energy transfer. From the overall results, 5 hit compounds were confirmed to be potential BChE inhibitors that satisfied all the pharmacophoric features in Hypo1.

Conclusion:

This study pinpoints important chemical features with geometric constraints that contribute to the inhibition of BChE activity. Five compounds are selected as the best hit BchE-inhibitory compounds.

Crystal structure, Hirshfeld surface analysis and DFT studies of 5-(adamantan-1-yl)-3-[(4-chlorobenzyl)sulfanyl]-4-methyl-4H-1,2,4-triazole, a potential 11β-HSD1 inhibitor

Article Open access 24 December 2019

Synthesis, docking, MD simulation, ADMET, drug likeness, and DFT studies of novel furo[2,3-b]indol-3a-ol as promising Cyclin-dependent kinase 2 inhibitors

Article Open access 07 February 2024

Absolute binding free energy calculations improve enrichment of actives in virtual compound screening

Article Open access 10 August 2022

Introduction

Cholinesterases (ChEs) are involved in the degradation of choline and show similarity in protein sequence but differences in their kinetic properties. On the basis of their substrate and inhibitor specificities, cholinesterases are divided into two subfamilies: acetylcholinesterases (AChEs; EC 3.1.1.7) and butyrylcholinesterases (BChEs; EC 3.1.1.8). AChE is predominantly present in the central and peripheral nervous system, as well as in muscles. In muscles, AChE terminates impulse transmission by the rapid hydrolysis of acetylcholine to acetic acid and choline¹. BChE is primarily synthesized in the liver and secreted into plasma, and it is responsible for the hydrolysis of a variety of choline (hydrophilic and hydrophobic) and non-choline esters². BChE plays a key role in cholinergic synapses by terminating acetylcholine action, although the complete physiological function of BChE remains unclear³. Both cholinesterase enzymes belong to the super family of α/β-hydrolase fold proteins⁴. Both AChE and BChE exist as multimers of catalytic subunits in globular forms such as G1, G2, and G4 that contain one, two, or four subunits, respectively. The hydrolysis of substrates by both enzymes proceeds through a transacylation step involving nucleophilic and general acid-base elements⁵. BChE acts as a scavenger protein that protects the cholinergic system against anticholinesterase poisons. BChE is the sole carboxylesterase^6,7 with recognized toxicological and pharmacological importance in scavenging and detoxification of numerous ester-containing drugs, pro-drugs^8,9, and poisonous carbamyl- and phosphoryl-esters, including nerve agents^10,11.

Currently, BChE is emerging as an important pharmacological target in Alzheimer's disease (AD) therapy¹². A 40%–90% increase in BChE expression and activity have been found in AD brain neuronal plaques¹³. BChE is capable of compensating for reduced AChE catalytic functions in the synaptic cleft^14,15 and shows significantly increased activity (30%–60%) during the time course of AD^16,17. Hence, in recent years, many scientists and researchers have shown keen interest in designing small molecules that can inhibit BChE activity ^{18,19,20,21,22,23}. However, there is also increasing evidence of BChE's involvement in non-cholinergic functions such as cell differentiation²⁴, neurogenesis, and the formation of amyloid plaques in AD^25,26,27.

In this work, we used computer-aided drug design approaches to identify novel and potent inhibitors of BChE. Pharmacophore studies are more cost-effective than experimental chemical screening of large databases. A 3D pharmacophore model was generated for BChE based on a series of well-known inhibitors. The best quantitative model was used as a 3D query for virtual screening of chemical databases to discover novel hit compounds. The virtual screening results revealed a small subset of database compounds that were promising potential hit compounds for BChE inhibition. The hits were subsequently filtered by Lipinski's Rule of Five, ADME (absorption, distribution, metabolism, and excretion) properties, and molecular docking. Finally, density functional theory (DFT) was used to calculate the orbital energy value and energy gap for the molecules screened by docking.

Computational methods

Pharmacophore modeling is one of the most frequently used and valuable methods to discover novel scaffolds for various targets.

Selection of compounds

To construct the BChE data set, 71 compounds were collected with their corresponding reported inhibitory activity values (IC₅₀) which were tested using the same bioassay technique from various publications^{28,29,30,31,32}. The BChE data set was divided into two sets: training and test sets that contained 26 and 45 compounds, respectively. The training set was prepared based on the following criteria: (i) a minimum of 16 diverse compounds were selected to avoid any chance correlation; (ii) the activity data should have a range of 4–5 orders of magnitude; (iii) the compounds should be selected to provide clear, concise information to avoid redundancy or bias in terms of both structural features and activity range; (iv) the most active compounds should be included so that they provide information on the most critical features required for a reliable/rational pharmacophore model; and (v) the inclusion of any compound known to be inactive due to steric hindrance must be avoided. The training set was used to build the quantitative hypothesis based on principles of structural diversity and IC₅₀ values that spanned a wide activity range, from 3.6 nmol/L to 11000 nmol/L (Figure 1). The test set was used to evaluate the predictive ability of the generated pharmacophore model. Both the training and test set compounds were classified into three categories based on their activity values. The compounds with IC₅₀ values less than or equal to 100 nmol/L were considered to be highly active (+++), compounds with an activity range between 100 nmol/L and 10000 nmol/L were considered to be moderately active (++), and compounds with IC₅₀ values greater than or equal to 10000 nmol/L were set as low activity compounds (+). The 2D structures of the training and test set molecules were drawn using ChemSketch²⁴, and the structures were converted into their corresponding 3D form using DS.

Pharmacophore modeling

Quantitative hypotheses were generated, and the best hypothesis was selected based on the models' ability to predict the biological activity of novel compounds from various chemical databases using Discovery Studio v2.5.5 (DS, www.accelrys.com, San Diego, CA, USA). There are generally two methods to generate molecular conformation: FAST and BEST. The FAST algorithm only considers existing conformers and interrupts a search as soon as a pharmacophore matching conformation is found, whereas the BEST algorithm additionally “tweaks” bond distances, angles, and dihedral angles of pregenerated conformers on the fly to achieve the best matches. Herein, we used the BEST conformation method to generate multiple acceptable conformations for each compound present in the training and test sets with 20 kcal/mol as the energy cutoff³³. All default parameters were used to generate the pharmacophore, except the uncertainty default value (3.0) was changed to 2.0³⁴. The uncertainty is the ratio of the reported activity value relative to the minimum, and the maximum values must be greater than 1.0. The uncertainty value affects the categorization of ligands in the data set as either active or inactive compounds and is used during the constructive and subtractive phases. Here, an uncertainty value of 2.0 was more suitable for our data set because the compound activities spanned the requisite 4 orders of magnitude; this choice has been confirmed by evidence in the literature^35,36. The feature mapping/DS protocol was used to identify common features present in the active inhibitors of BChE. This protocol computes a maximum of 1000 possible pharmacophore features mappings for the selected ligands. The selected features from the feature mapping were used as one of the key inputs for the 3D-QSAR pharmacophore generation module using a HypoGen algorithm. The HypoGen algorithm further estimates the activity of each training set compound by computing regression analysis using parameters such as the relationship of geometric fit value versus the negative logarithm of the activity. While generating the quantitative model, a minimum of 0 to a maximum of 5 features were selected to build a series of hypotheses. Ten quantitative pharmacophore models were generated with corresponding statistical parameters such as cost values, root mean square (RMS), and fit values. The best quality hypothesis was selected based on cost values as defined by Debnath's methods³⁴.

Hypothesis validation

In general, pharmacophore models should be statistically significant, accurately predict the activity of molecules, and retrieve active compounds from databases. The best pharmacophore model was validated using various potent approaches such as Fischer's randomization, test set, and decoy set³³. The main purpose of validating a quantitative pharmacophore model is to determine its capacity to identify active compounds, as well as its predictive ability for corresponding molecules. Fischer's randomization test was performed simultaneously during the original hypotheses generation and produced a number of random spreadsheets depending on the selected significance level (90%, 95%, 98%, and 99%) by shuffling the activity values present in the training set. Here, a 95% significance level was selected. Nineteen random spreadsheets were produced by randomly shuffling the activity value of the training set compounds, and the test generated hypotheses using the same chemical features and parameters used to develop the original hypothesis. Test and decoy sets were used to check whether the best hypothesis was able to select molecules with orders of magnitude of activity similar to that of the active training set and to determine how well the model hypothesis could differentiate potential BChE inhibitors from other compounds, respectively. The test set consisted of structurally diverse chemical compounds from the training set to ascertain the broadness of pharmacophore predictability. The decoy set was prepared by calculating the 1D property of 25 active inhibitors of BChE and 2075 inactive or unknown molecules. EF and GF were calculated using the following equations:

where H_a is the total number of active compounds in the hit list, H_t is the number of hits retrieved from the database, A is the total number of active compounds in the database, and D is the total number of molecules in the database.

Virtual screening

Pharmacophore-based database searching was used to find potential hit compounds that could repress or trigger BChE activity. The generated, well-validated hypothesis was used as a 3D structural query in the virtual screening of databases such as Maybridge and Chembridge to retrieve novel scaffolds for BChE inhibition. The Fast Flexible search method from Ligand Pharmacophore Mapping/DS was applied to retrieve hits that satisfy the chemical moiety requirements and spatially map with corresponding features in the pharmacophoric query³⁷.

Drug likeness filtration

Poor pharmacokinetic properties are one of the main causes for the termination of a compound's entry or progression along the drug development pipeline. The medicinal chemist needs compounds with good pharmacokinetic properties; thus, all of the hit compounds obtained from database searching were filtered by applying ADME and the Rule of Five developed by Lipinski³⁸. To obtain compounds with good pharmacokinetic properties, ADMET descriptors were calculated. ADMET was applied to check whether the compounds are able to cross the blood-brain barrier (BBB) and have good solubility, human intestinal absorption (HIA), and low toxicity. Here, we mainly focused on oral bioavailability, low or no hepatotoxicity, and the capacity to penetrate the BBB, which is a key decision filter for central nervous system drug discovery. The compounds that satisfied the abovementioned properties were selected for molecular docking studies. Lipinski's rule of 5 states that clogP≤5, molecular weight≤500, and number of hydrogen bond acceptors≤10 and donors≤5. Compounds violating more than one of these rules may have problems with bioavailability, therefore these parameters were calculated by DS to eliminate compounds that did not pass the above criterias.

Molecular docking

Molecular docking generates a score for each compound based on the binding affinities of protein-ligand complexes. Molecular docking was used to identify the small molecules that were able to fit well into the binding site of BChE proteins. LigandFit³⁹ was used to execute the molecular docking studies and to determine the accurate orientation of ligands in protein active sites. The LigandFit module was classified into three stages: (i) docking, when an attempt was made to dock a ligand into a user defined binding site; (ii) in situ ligand minimization; and (iii) scoring, when various scoring functions were calculated for each pose of the ligands. The 3D crystal structure of BChE (PDB code: 1P0I) was downloaded from Protein Data Bank (PDB, www.rcsb.org) with good resolution (2 Å). The protein was prepared by adding the hydrogen atoms by applying CHARMm force field⁴⁰ using the Molecular simulation module. After protein preparation, the active site of the protein must be identified before docking the small molecules. The active site of the protein can be represented as a binding site, essentially as a set of points on a grid that lie in a cavity. Two methods were applied to define the binding site for the protein: (i) based on the receptor shape using “eraser” algorithm and (ii) volume occupied by the known ligand position already in an active site. For this study, we preferred the second method to find the active cavity of BChE. Initially, the docking parameters were validated by docking the co-crystal molecule into the active site of BChE. The hit molecules from the virtual screening process and 5 active inhibitors were docked in the active site of BChE to find the most suitable orientation and compound binding ability. During the docking process, the top 10 conformations were generated for each ligand based on the docking score after energy minimization using the smart minimizer method, which begins with the steepest descent method followed by the conjugate gradient method. The docked poses were validated by the hydrogen bond interactions between the candidate molecules and active site residues.

Density functional theory

The main aim of the orbital energies calculation was to provide valuable information about the electrostatic properties of the BChE inhibitors. DFT is a successful and promising approach adopted by quantum chemists in the quantum mechanical simulation of periodic systems⁴¹. There is substantial evidence that DFT provides an accurate description of the electronic and structural properties of small molecules by computing the electronic structure of matter. The selected docked poses of the hit compounds from the molecular docking studies were used as input for the DFT instead of the compounds' bioactive conformations. Because the docking results showed the suitable binding orientation of hit compounds, it was suitable for calculating the orbital energies such as HOMO and LUMO using DS. Calculating the orbital energy using B3LYP provided information regarding the capacity of the molecules to transfer their energies from a HOMO, which can act as an electron donor, to a LUMO, which can act as an electron acceptor. These electrostatic property calculations could provide useful information for designing novel BChE inhibitors.

Results and Discussion

A ligand-based pharmacophore method was used to elucidate the spatial arrangement of chemical features that were crucial for the interaction of structurally diverse and potent BChE inhibitors with their target protein. Ligand-based approaches reveal the important and common chemical features of diverse ligands, and these features can then be used as 3D query in virtual screenings of large chemical databases to identify novel hit compounds.

Pharmacophore model

The HypoGen algorithm was used to construct quantitative hypotheses that correlated the experimental and the predicted activity values of the inhibitors. At the end of each run, the top ten hypotheses were generated based on a set of 26 chemically diverse inhibitors of BChE (Figure 1), and the statistical parameters values such as cost, correlation (r), and RMS for each hypothesis are shown in Table 1.

Table 1 Information of statistical significance and predictive power presented in cost values measured in bits for the top 10 hypotheses as a result of automated 3D QSAR pharmacophore generation.

Full size table

Among the ten hypotheses, nine hypotheses contained 1 hydrogen bond acceptor (HBA) and 1 hydrophobic aliphatic (Hy-Ali) group, which indicates that these chemical features are necessary for BChE inhibition. Out of the 10 hypotheses, only 3 hypotheses were selected for further processing based on the maximum fit value (greater than 9). Debnath's analysis⁴², used to select the best hypothesis, states that the best pharmacophore model should have the highest cost difference, good correlation coefficient, least RMS, and lowest total cost values. Cost differences represent the difference between the null and total cost of hypothesis. A 40–60 bit difference leads to a predictive correlation probability of 75%–90%, and if the difference is greater than 60 bits, the hypothesis is assumed to have a correlation probability of greater than 90%³¹. Hypo1 showed the highest cost difference of 120.12 bits, compared with Hypo4 and Hypo5, indicating its significance. The correlation coefficient is based on linear regression derived from the geometric fit index; Hypo1 showed the highest correlation coefficient (0.96), demonstrating its high predictive ability. The RMS factor represents the deviation of the predicted activity value from the experimental value, and the RMS values were 1.02, 1.23, and 1.24 for Hypo1, Hypo4, and Hypo5, respectively. This result also supports the conclusion that Hypo1 was significant when compared with the two other hypotheses. The reliability of a pharmacophore model also depends on whether the total cost value is distant from the null cost and close to the fixed cost. The fixed cost represents a simple model that fits all data perfectly, while the null cost presumes that there is no relationship in the data and that the experimental activities are normally distributed around their average value. The fixed and total cost values of Hypo1 were 94.82 and 108.57, respectively. Thus, Hypo1, which consisted of 2 HBA, 1 Hy-Ali, and 1 hydrophobic aromatic (Hy-Ar), was selected as the best hypothesis and was employed for further analyses. The chemical features and 3D spatial arrangement of Hypo1 are depicted in Figure 2.

Hypo1 was used to estimate the inhibitory activities of 26 training set compounds to elucidate its predictive accuracy. Hypo1 was able to predict the inhibitory activity value of the 26 training set compounds in the same order of magnitude (Table 2). One moderately active and two inactive compounds were underestimated and overestimated as inactive and moderately active, respectively. All of the active compounds were predicted in their own activity ranges, indicating the predictive ability of Hypo1. Hypo1 aligned with the most active compound 1 (IC₅₀: 3.6 nmol/L) and least active compound 26 (IC₅₀: 11 400 nmol/L) in the training set (Figure 3). From this analysis, we suggest that Hypo1 was able to estimate the activity of compounds to a high degree of accuracy relative to their experimental IC₅₀ values (Table 2). The error value was defined as the ratio between experimental and predicted activity value, and Hypo1 demonstrated remarkable consistency. The best pharmacophore model, Hypo1, was validated by various methods such as Fisher's randomization, a test set, and a decoy set to demonstrate its robustness and statistical significance.

Table 2 Actual and estimated activity of the training set molecules based on the pharmacophore model Hypo1.

Full size table

Validation of the pharmacophore model

Fischer's randomization test

Fischer's test was applied to evaluate the significance of Hypo1 based on statistical validation. A confidence level of 95% was chosen, and a total of 19 random spreadsheets were generated to produce the hypothesis. The significance of the hypothesis was calculated using the formula S=[1−(1+X)/Y]×100, where X is the total number of hypotheses having a total cost lower than the original hypothesis, and Y is the total number of HypoGen runs (initial+random runs). Here, X=0 and Y=(1+19), hence 95%={1−[(1+0)/(19+1)]}×100. The total cost of 19 random pharmacophore models compared with Hypo1 showed that the original hypothesis was far superior to the 19 other hypotheses, which indicated that the Hypo1 was not generated by chance (Figure 4). This result provided confidence that the Hypo1 could be a best hypothesis that contains all the necessary chemical features to inhibit BChE activity.

Test set validation

The test set contains 45 structurally distinct compounds from training set molecules. The test set was used to examine the ability of Hypo1 to predict the activity of external compounds in the same activity range. Except for one active compound that was underestimated as moderately active, all of the remaining compounds are predicted on their own activity range by Hypo1 (Table 3). Hypo1 shows the strong correlation coefficient of 0.94 between experimental and predicted BChE inhibitory activity values for the test set (Figure 5). This result also showed that Hypo1 fit not only for the training set compounds but also for the external compounds; this result also demonstrated the predictive ability of Hypo1 to differentiate the active and inactive BChE inhibitors.

Table 3 Experimental and predicted IC₅₀ values of 45 test set molecules against Hypo1.

Full size table

Decoy set validation

As a final validation, decoy set screening was performed using the Best Flexible searching module/DS. To determine the robustness of Hypo1, four parameters were calculated: false positives, false negatives, enrichment factor (EF), and goodness of fit score (GF). EF and GF were calculated using the following set of parameters: hit lists (H_t), number of active percent yields (%Y), percent ratio of actives in the hit lists (%A), false negatives, and false positives (Table 4). Hypo1 succeeded in the retrieval of 76% of the active compounds from the decoy set. It predicted 6 active compounds to be inactive compounds (false negatives). Hypo1 showed a GH score of 0.86, indicating that Hypo1 had a greater tendency to show true positives. On the basis of the overall validations, we were strongly assured that the Hypo1 demonstrated excellent prediction of BChE inhibitor activities.

Table 4 Statistical parameter from screening test set molecules.

Full size table