Introduction

Cholinesterases (ChEs) are involved in the degradation of choline and show similarity in protein sequence but differences in their kinetic properties. On the basis of their substrate and inhibitor specificities, cholinesterases are divided into two subfamilies: acetylcholinesterases (AChEs; EC 3.1.1.7) and butyrylcholinesterases (BChEs; EC 3.1.1.8). AChE is predominantly present in the central and peripheral nervous system, as well as in muscles. In muscles, AChE terminates impulse transmission by the rapid hydrolysis of acetylcholine to acetic acid and choline1. BChE is primarily synthesized in the liver and secreted into plasma, and it is responsible for the hydrolysis of a variety of choline (hydrophilic and hydrophobic) and non-choline esters2. BChE plays a key role in cholinergic synapses by terminating acetylcholine action, although the complete physiological function of BChE remains unclear3. Both cholinesterase enzymes belong to the super family of α/β-hydrolase fold proteins4. Both AChE and BChE exist as multimers of catalytic subunits in globular forms such as G1, G2, and G4 that contain one, two, or four subunits, respectively. The hydrolysis of substrates by both enzymes proceeds through a transacylation step involving nucleophilic and general acid-base elements5. BChE acts as a scavenger protein that protects the cholinergic system against anticholinesterase poisons. BChE is the sole carboxylesterase6,7 with recognized toxicological and pharmacological importance in scavenging and detoxification of numerous ester-containing drugs, pro-drugs8,9, and poisonous carbamyl- and phosphoryl-esters, including nerve agents10,11.

Currently, BChE is emerging as an important pharmacological target in Alzheimer's disease (AD) therapy12. A 40%–90% increase in BChE expression and activity have been found in AD brain neuronal plaques13. BChE is capable of compensating for reduced AChE catalytic functions in the synaptic cleft14,15 and shows significantly increased activity (30%–60%) during the time course of AD16,17. Hence, in recent years, many scientists and researchers have shown keen interest in designing small molecules that can inhibit BChE activity 18,19,20,21,22,23. However, there is also increasing evidence of BChE's involvement in non-cholinergic functions such as cell differentiation24, neurogenesis, and the formation of amyloid plaques in AD25,26,27.

In this work, we used computer-aided drug design approaches to identify novel and potent inhibitors of BChE. Pharmacophore studies are more cost-effective than experimental chemical screening of large databases. A 3D pharmacophore model was generated for BChE based on a series of well-known inhibitors. The best quantitative model was used as a 3D query for virtual screening of chemical databases to discover novel hit compounds. The virtual screening results revealed a small subset of database compounds that were promising potential hit compounds for BChE inhibition. The hits were subsequently filtered by Lipinski's Rule of Five, ADME (absorption, distribution, metabolism, and excretion) properties, and molecular docking. Finally, density functional theory (DFT) was used to calculate the orbital energy value and energy gap for the molecules screened by docking.

Computational methods

Pharmacophore modeling is one of the most frequently used and valuable methods to discover novel scaffolds for various targets.

Selection of compounds

To construct the BChE data set, 71 compounds were collected with their corresponding reported inhibitory activity values (IC50) which were tested using the same bioassay technique from various publications28,29,30,31,32. The BChE data set was divided into two sets: training and test sets that contained 26 and 45 compounds, respectively. The training set was prepared based on the following criteria: (i) a minimum of 16 diverse compounds were selected to avoid any chance correlation; (ii) the activity data should have a range of 4–5 orders of magnitude; (iii) the compounds should be selected to provide clear, concise information to avoid redundancy or bias in terms of both structural features and activity range; (iv) the most active compounds should be included so that they provide information on the most critical features required for a reliable/rational pharmacophore model; and (v) the inclusion of any compound known to be inactive due to steric hindrance must be avoided. The training set was used to build the quantitative hypothesis based on principles of structural diversity and IC50 values that spanned a wide activity range, from 3.6 nmol/L to 11000 nmol/L (Figure 1). The test set was used to evaluate the predictive ability of the generated pharmacophore model. Both the training and test set compounds were classified into three categories based on their activity values. The compounds with IC50 values less than or equal to 100 nmol/L were considered to be highly active (+++), compounds with an activity range between 100 nmol/L and 10000 nmol/L were considered to be moderately active (++), and compounds with IC50 values greater than or equal to 10000 nmol/L were set as low activity compounds (+). The 2D structures of the training and test set molecules were drawn using ChemSketch24, and the structures were converted into their corresponding 3D form using DS.

Figure 1
figure 1

Set of 26 chemically diverse compounds used as the training set in 3D-QSAR discovery studio/pharmacophore generation. IC50 values, in nmol/L, are indicated in parentheses for each compound.

Pharmacophore modeling

Quantitative hypotheses were generated, and the best hypothesis was selected based on the models' ability to predict the biological activity of novel compounds from various chemical databases using Discovery Studio v2.5.5 (DS, www.accelrys.com, San Diego, CA, USA). There are generally two methods to generate molecular conformation: FAST and BEST. The FAST algorithm only considers existing conformers and interrupts a search as soon as a pharmacophore matching conformation is found, whereas the BEST algorithm additionally “tweaks” bond distances, angles, and dihedral angles of pregenerated conformers on the fly to achieve the best matches. Herein, we used the BEST conformation method to generate multiple acceptable conformations for each compound present in the training and test sets with 20 kcal/mol as the energy cutoff33. All default parameters were used to generate the pharmacophore, except the uncertainty default value (3.0) was changed to 2.034. The uncertainty is the ratio of the reported activity value relative to the minimum, and the maximum values must be greater than 1.0. The uncertainty value affects the categorization of ligands in the data set as either active or inactive compounds and is used during the constructive and subtractive phases. Here, an uncertainty value of 2.0 was more suitable for our data set because the compound activities spanned the requisite 4 orders of magnitude; this choice has been confirmed by evidence in the literature35,36. The feature mapping/DS protocol was used to identify common features present in the active inhibitors of BChE. This protocol computes a maximum of 1000 possible pharmacophore features mappings for the selected ligands. The selected features from the feature mapping were used as one of the key inputs for the 3D-QSAR pharmacophore generation module using a HypoGen algorithm. The HypoGen algorithm further estimates the activity of each training set compound by computing regression analysis using parameters such as the relationship of geometric fit value versus the negative logarithm of the activity. While generating the quantitative model, a minimum of 0 to a maximum of 5 features were selected to build a series of hypotheses. Ten quantitative pharmacophore models were generated with corresponding statistical parameters such as cost values, root mean square (RMS), and fit values. The best quality hypothesis was selected based on cost values as defined by Debnath's methods34.

Hypothesis validation

In general, pharmacophore models should be statistically significant, accurately predict the activity of molecules, and retrieve active compounds from databases. The best pharmacophore model was validated using various potent approaches such as Fischer's randomization, test set, and decoy set33. The main purpose of validating a quantitative pharmacophore model is to determine its capacity to identify active compounds, as well as its predictive ability for corresponding molecules. Fischer's randomization test was performed simultaneously during the original hypotheses generation and produced a number of random spreadsheets depending on the selected significance level (90%, 95%, 98%, and 99%) by shuffling the activity values present in the training set. Here, a 95% significance level was selected. Nineteen random spreadsheets were produced by randomly shuffling the activity value of the training set compounds, and the test generated hypotheses using the same chemical features and parameters used to develop the original hypothesis. Test and decoy sets were used to check whether the best hypothesis was able to select molecules with orders of magnitude of activity similar to that of the active training set and to determine how well the model hypothesis could differentiate potential BChE inhibitors from other compounds, respectively. The test set consisted of structurally diverse chemical compounds from the training set to ascertain the broadness of pharmacophore predictability. The decoy set was prepared by calculating the 1D property of 25 active inhibitors of BChE and 2075 inactive or unknown molecules. EF and GF were calculated using the following equations:

where Ha is the total number of active compounds in the hit list, Ht is the number of hits retrieved from the database, A is the total number of active compounds in the database, and D is the total number of molecules in the database.

Virtual screening

Pharmacophore-based database searching was used to find potential hit compounds that could repress or trigger BChE activity. The generated, well-validated hypothesis was used as a 3D structural query in the virtual screening of databases such as Maybridge and Chembridge to retrieve novel scaffolds for BChE inhibition. The Fast Flexible search method from Ligand Pharmacophore Mapping/DS was applied to retrieve hits that satisfy the chemical moiety requirements and spatially map with corresponding features in the pharmacophoric query37.

Drug likeness filtration

Poor pharmacokinetic properties are one of the main causes for the termination of a compound's entry or progression along the drug development pipeline. The medicinal chemist needs compounds with good pharmacokinetic properties; thus, all of the hit compounds obtained from database searching were filtered by applying ADME and the Rule of Five developed by Lipinski38. To obtain compounds with good pharmacokinetic properties, ADMET descriptors were calculated. ADMET was applied to check whether the compounds are able to cross the blood-brain barrier (BBB) and have good solubility, human intestinal absorption (HIA), and low toxicity. Here, we mainly focused on oral bioavailability, low or no hepatotoxicity, and the capacity to penetrate the BBB, which is a key decision filter for central nervous system drug discovery. The compounds that satisfied the abovementioned properties were selected for molecular docking studies. Lipinski's rule of 5 states that clogP≤5, molecular weight≤500, and number of hydrogen bond acceptors≤10 and donors≤5. Compounds violating more than one of these rules may have problems with bioavailability, therefore these parameters were calculated by DS to eliminate compounds that did not pass the above criterias.

Molecular docking

Molecular docking generates a score for each compound based on the binding affinities of protein-ligand complexes. Molecular docking was used to identify the small molecules that were able to fit well into the binding site of BChE proteins. LigandFit39 was used to execute the molecular docking studies and to determine the accurate orientation of ligands in protein active sites. The LigandFit module was classified into three stages: (i) docking, when an attempt was made to dock a ligand into a user defined binding site; (ii) in situ ligand minimization; and (iii) scoring, when various scoring functions were calculated for each pose of the ligands. The 3D crystal structure of BChE (PDB code: 1P0I) was downloaded from Protein Data Bank (PDB, www.rcsb.org) with good resolution (2 Å). The protein was prepared by adding the hydrogen atoms by applying CHARMm force field40 using the Molecular simulation module. After protein preparation, the active site of the protein must be identified before docking the small molecules. The active site of the protein can be represented as a binding site, essentially as a set of points on a grid that lie in a cavity. Two methods were applied to define the binding site for the protein: (i) based on the receptor shape using “eraser” algorithm and (ii) volume occupied by the known ligand position already in an active site. For this study, we preferred the second method to find the active cavity of BChE. Initially, the docking parameters were validated by docking the co-crystal molecule into the active site of BChE. The hit molecules from the virtual screening process and 5 active inhibitors were docked in the active site of BChE to find the most suitable orientation and compound binding ability. During the docking process, the top 10 conformations were generated for each ligand based on the docking score after energy minimization using the smart minimizer method, which begins with the steepest descent method followed by the conjugate gradient method. The docked poses were validated by the hydrogen bond interactions between the candidate molecules and active site residues.

Density functional theory

The main aim of the orbital energies calculation was to provide valuable information about the electrostatic properties of the BChE inhibitors. DFT is a successful and promising approach adopted by quantum chemists in the quantum mechanical simulation of periodic systems41. There is substantial evidence that DFT provides an accurate description of the electronic and structural properties of small molecules by computing the electronic structure of matter. The selected docked poses of the hit compounds from the molecular docking studies were used as input for the DFT instead of the compounds' bioactive conformations. Because the docking results showed the suitable binding orientation of hit compounds, it was suitable for calculating the orbital energies such as HOMO and LUMO using DS. Calculating the orbital energy using B3LYP provided information regarding the capacity of the molecules to transfer their energies from a HOMO, which can act as an electron donor, to a LUMO, which can act as an electron acceptor. These electrostatic property calculations could provide useful information for designing novel BChE inhibitors.

Results and Discussion

A ligand-based pharmacophore method was used to elucidate the spatial arrangement of chemical features that were crucial for the interaction of structurally diverse and potent BChE inhibitors with their target protein. Ligand-based approaches reveal the important and common chemical features of diverse ligands, and these features can then be used as 3D query in virtual screenings of large chemical databases to identify novel hit compounds.

Pharmacophore model

The HypoGen algorithm was used to construct quantitative hypotheses that correlated the experimental and the predicted activity values of the inhibitors. At the end of each run, the top ten hypotheses were generated based on a set of 26 chemically diverse inhibitors of BChE (Figure 1), and the statistical parameters values such as cost, correlation (r), and RMS for each hypothesis are shown in Table 1.

Table 1 Information of statistical significance and predictive power presented in cost values measured in bits for the top 10 hypotheses as a result of automated 3D QSAR pharmacophore generation.

Among the ten hypotheses, nine hypotheses contained 1 hydrogen bond acceptor (HBA) and 1 hydrophobic aliphatic (Hy-Ali) group, which indicates that these chemical features are necessary for BChE inhibition. Out of the 10 hypotheses, only 3 hypotheses were selected for further processing based on the maximum fit value (greater than 9). Debnath's analysis42, used to select the best hypothesis, states that the best pharmacophore model should have the highest cost difference, good correlation coefficient, least RMS, and lowest total cost values. Cost differences represent the difference between the null and total cost of hypothesis. A 40–60 bit difference leads to a predictive correlation probability of 75%–90%, and if the difference is greater than 60 bits, the hypothesis is assumed to have a correlation probability of greater than 90%31. Hypo1 showed the highest cost difference of 120.12 bits, compared with Hypo4 and Hypo5, indicating its significance. The correlation coefficient is based on linear regression derived from the geometric fit index; Hypo1 showed the highest correlation coefficient (0.96), demonstrating its high predictive ability. The RMS factor represents the deviation of the predicted activity value from the experimental value, and the RMS values were 1.02, 1.23, and 1.24 for Hypo1, Hypo4, and Hypo5, respectively. This result also supports the conclusion that Hypo1 was significant when compared with the two other hypotheses. The reliability of a pharmacophore model also depends on whether the total cost value is distant from the null cost and close to the fixed cost. The fixed cost represents a simple model that fits all data perfectly, while the null cost presumes that there is no relationship in the data and that the experimental activities are normally distributed around their average value. The fixed and total cost values of Hypo1 were 94.82 and 108.57, respectively. Thus, Hypo1, which consisted of 2 HBA, 1 Hy-Ali, and 1 hydrophobic aromatic (Hy-Ar), was selected as the best hypothesis and was employed for further analyses. The chemical features and 3D spatial arrangement of Hypo1 are depicted in Figure 2.

Figure 2
figure 2

The Hypo1 pharmacophore model and its geometric parameters, where HBA: hydrogen bond acceptor; Hy-Ali: hydrophobic aliphatic; and Hy-Ar: hydrophobic aromatic.

Hypo1 was used to estimate the inhibitory activities of 26 training set compounds to elucidate its predictive accuracy. Hypo1 was able to predict the inhibitory activity value of the 26 training set compounds in the same order of magnitude (Table 2). One moderately active and two inactive compounds were underestimated and overestimated as inactive and moderately active, respectively. All of the active compounds were predicted in their own activity ranges, indicating the predictive ability of Hypo1. Hypo1 aligned with the most active compound 1 (IC50: 3.6 nmol/L) and least active compound 26 (IC50: 11 400 nmol/L) in the training set (Figure 3). From this analysis, we suggest that Hypo1 was able to estimate the activity of compounds to a high degree of accuracy relative to their experimental IC50 values (Table 2). The error value was defined as the ratio between experimental and predicted activity value, and Hypo1 demonstrated remarkable consistency. The best pharmacophore model, Hypo1, was validated by various methods such as Fisher's randomization, a test set, and a decoy set to demonstrate its robustness and statistical significance.

Table 2 Actual and estimated activity of the training set molecules based on the pharmacophore model Hypo1.
Figure 3
figure 3

The best pharmacophore model Hypo1 aligned to training set compounds: A) active compound 1 (IC50 3.6 nmol/L) and B) low activity compound 26 (IC50 11 400 nmol/L). The pharmacophore features are color coded (HBA, hydrogen bond acceptor: green; Hy-Ali, hydrophobic aliphatic: light cyan; Hy-Ar, hydrophobic aromatic: cyan).

Validation of the pharmacophore model

Fischer's randomization test

Fischer's test was applied to evaluate the significance of Hypo1 based on statistical validation. A confidence level of 95% was chosen, and a total of 19 random spreadsheets were generated to produce the hypothesis. The significance of the hypothesis was calculated using the formula S=[1−(1+X)/Y]×100, where X is the total number of hypotheses having a total cost lower than the original hypothesis, and Y is the total number of HypoGen runs (initial+random runs). Here, X=0 and Y=(1+19), hence 95%={1−[(1+0)/(19+1)]}×100. The total cost of 19 random pharmacophore models compared with Hypo1 showed that the original hypothesis was far superior to the 19 other hypotheses, which indicated that the Hypo1 was not generated by chance (Figure 4). This result provided confidence that the Hypo1 could be a best hypothesis that contains all the necessary chemical features to inhibit BChE activity.

Figure 4
figure 4

The difference in cost between 19 scrambled runs. The 95% confidence level was selected.

Test set validation

The test set contains 45 structurally distinct compounds from training set molecules. The test set was used to examine the ability of Hypo1 to predict the activity of external compounds in the same activity range. Except for one active compound that was underestimated as moderately active, all of the remaining compounds are predicted on their own activity range by Hypo1 (Table 3). Hypo1 shows the strong correlation coefficient of 0.94 between experimental and predicted BChE inhibitory activity values for the test set (Figure 5). This result also showed that Hypo1 fit not only for the training set compounds but also for the external compounds; this result also demonstrated the predictive ability of Hypo1 to differentiate the active and inactive BChE inhibitors.

Table 3 Experimental and predicted IC50 values of 45 test set molecules against Hypo1.
Figure 5
figure 5

Plot of the correlation between the experimental activity and the activity predicted by Hypo1 for the test set molecules (in brown) and training set molecules (in blue).

Decoy set validation

As a final validation, decoy set screening was performed using the Best Flexible searching module/DS. To determine the robustness of Hypo1, four parameters were calculated: false positives, false negatives, enrichment factor (EF), and goodness of fit score (GF). EF and GF were calculated using the following set of parameters: hit lists (Ht), number of active percent yields (%Y), percent ratio of actives in the hit lists (%A), false negatives, and false positives (Table 4). Hypo1 succeeded in the retrieval of 76% of the active compounds from the decoy set. It predicted 6 active compounds to be inactive compounds (false negatives). Hypo1 showed a GH score of 0.86, indicating that Hypo1 had a greater tendency to show true positives. On the basis of the overall validations, we were strongly assured that the Hypo1 demonstrated excellent prediction of BChE inhibitor activities.

Table 4 Statistical parameter from screening test set molecules.

Virtual screening

The validated hypothesis, Hypo1, was used as a 3D structural query for retrieving novel candidate molecules from the Maybridge (60 000) and Chembridge (50 000) chemical databases. A total of 45 496 hit compounds were obtained from the first screening; among these, 1619 compounds were selected for further analysis by applying maximum fit value of greater than 8. Subsequently, these molecules were tested for ADMET and Lipinski's Rule of Five. ADMET properties calculated the values of blood-brain barrier (BBB) penetration, solubility, cytochrome P450 (CYP450) 2D6 inhibition, hepatotoxicity, HIA, plasma protein binding (PPB), and assessed a broad range of ligand toxicity. The drug should not cross the BBB, hence the level '3' was selected to represent low penetration of BBB. The value of 0 in CYP26 and hepatotoxicity indicates that the molecules are of low toxicity. The cutoff values of solubility and HIA were 3 and 0, respectively. Out of the 1619 molecules, 202 molecules passed the BBB level, absorption, solubility, and toxicity criteria. These hit compounds were subjected to further filtering by applying Lipinski's Rule of Five, which states that LogP should be less than 5, the molecular weight less than 500, the number of hydrogen bond donors less than 5, the number of hydrogen bond acceptors less than 10, and the number of rotatable bonds less than 10. The flexibility of the molecules and the total number of hydrogen bond acceptors and hydrogen bond donors are important predictors that a compound will have good oral bioavailability. Ultimately, 84 molecules were selected as hits based on drug-like properties. The hit compounds from the virtual screening process were subjected to molecular docking to reduce the false positive rate.

Molecular docking

A BChE complex with butanoic acid from PDB was chosen as the target protein for molecular docking. The establishment and reorganization of specific covalent or non-covalent interactions between substrates or inhibitors play a crucial role in biological function. Three distinct domains in the active site confer selectivity of BChE inhibitors. The first domain is an acyl binding pocket that contains two residues (L286 and V288) responsible for the binding of larger substrates with acyl groups43. A second domain is found near the lip of the active site cavity, and a third domain is defined as the choline binding site (or cation-pi site). The catalytic domain of BChE is composed of nucleophilic serine, histidine, and glutamate residues. The substrate was stabilized between the oxyanion hole and acyl binding pocket of the “catalytic triad” composed of S198, E197, and H438 of the active esteratic site. The mechanism of catalysis depends on the charge relay system, in which the imidazole ring of H438 relays electrons from E197 to S198 and causes the hydroxyl oxygen of S198 to become a nucleophile44. A nucleophilic attack of this hydroxyl oxygen on the ester bond of the substrate leads to an acyl-enzyme intermediate and a free choline moiety. Then, the acyl group is hydrolyzed from S198 by the nucleophilic attack of a water molecule that is activated by taking a proton from H438 to form a catalytic triad.

Initially the co-crystal was docked in the active site of BChE to check whether the selected parameters are able to produce the most suitable binding orientation. The RMSD value of 0.79 Å was obtained when the best docked pose overlapped with the co-crystal, which revealed that the default parameters are valid to find the best orientation of BChE in the active site. The same parameters were therefore employed to dock the candidate compounds. The selected candidate molecules from the virtual screening were docked in the BChE active site. The top-ranked 84 compounds based on the docking score were selected as the best potential inhibitors and were manually validated for critical interaction with vital amino acids in the active site of BChE. Intermolecular hydrogen bonding was observed between active residues S198, E197, and H438 in the active site of BChE (Figure 6). From the 84 compounds, 33 candidate molecules showed hydrogen bond interactions with S198 and H438, as well as reliable hydrophobic interactions with Y323 and F329.

Figure 6
figure 6

(A) Chembridge (compound 30080), (B) Maybridge (KM_02281), and (C) Maybridge (SPB_07954). The candidate compounds are represented as green sticks. Hydrogen bonds are shown in black. For the clarity of the docked view the Y323 was not shown.

Density functional theory

The orbital energies such as HOMO (highest occupied molecular orbital) and LUMO (lowest unoccupied molecular orbital) were calculated for 7 (5 active, one moderate and one inactive inhibitor) training set compounds and for the 33 hits from molecular docking. HOMO and LUMO are responsible for the charge transfer in a chemical reaction45. The calculated orbital energies of the 33 hits and known inhibitors were compared to analyze the energy transfer and stability of small molecules in protein active site. Comparing the HOMO energy with the activity value of known BChE inhibitors shows an inverse correlation, indicating that the HOMO energy of the inhibitor may transfer its electrons to some critical residues in the active site of BChE. The highest energy value of the HOMO in the hit compounds implied the greatest likelihood of strong inhibition of BChE. Hence, correlation of the HOMO values of hits and the training set molecules showed that 10 hit compounds possessed greater values than the reported inhibitors (training set) of BChE (Table 5). A smaller energy gap (between the LUMO and HOMO) of the hit molecules illustrates that the molecule are more reactive46. The wide energy gap in the hit molecules is unfavorable for the electron to be excited from the HOMO to the LUMO, which consequently leads to a weak affinity of the inhibitor for BChE. Among the 10 compounds, 5 hit compounds (Figure 7) were selected based on their lowest energy gaps that suggested the molecules would be reactive. Table 5 clearly showed that the moderate and inactive compounds had high energy gaps that were not suitable for the reactivity of the molecules. The atomic orbital composition of the frontier molecular orbital for compound 30080 is shown in Figure 8. On the basis of the results above, we suggest that the hit compounds may possess equivalent or greater electronic properties compared with most active compounds and could be used to design novel classes of BChE inhibitors.

Table 5 Orbital energy value of hit leads and training set compounds.
Figure 7
figure 7

A 2D representation of the final 5 hit compounds.

Figure 8
figure 8

The orbital energy values and energy gap for compound 30080.

Conclusions

In this study, pharmacophore models were generated based on a series of known BChE inhibitors. The main purpose of quantitative pharmacophore generation is to predict or differentiate the active inhibitors from inactive compounds. The best pharmacophore model, Hypo1, consisting of 2 HBA, 1 Hy-Ali, and 1 Hy-Ar, was validated by Fischer's randomization method, a test set, and decoy set. Fischer's method confirmed the 95% statistical confidence of Hypo1; the test set showed a fairly high correlation between experimental and predicted IC50 values (correlation coefficient of 0.96), indicating satisfactory predictive ability; additionally, good EF (0.76) and GF (0.87) values for Hypo1 were calculated from the decoy set. The three validation methods confirmed that Hypo1 was the best hypothesis to differentiate the active inhibitors from inactive inhibitors of BChE. Thus, Hypo1 was used as a 3D query to screen molecular structural libraries, including the Maybridge and Chembridge databases. The hit compounds were filtered using ADMET, Lipinski's Rule of Five, and molecular docking to reduce the number of false positive results. Finally, 33 compounds were selected based on their critical interactions with the significant amino acids in BChE's active site. To confirm the inhibitors' potencies, we calculated the orbital energies, such as HOMO and LUMO, for hit compounds and 7 training set compounds. From among the 33 hit compounds, 10 compounds with the highest HOMO values were selected, and this set was further culled to 5 compounds based on their energy gaps, which is important for stability and energy transfer. From the overall results, we confirmed that 5 hit compounds satisfied all the pharmacophoric features in Hypo1 and are potential BChE inhibitors.

Author contribution

Sugunadevi SAKKIAH contributed to all parts of the work, including experimental design, conduct, and analysis, and preparation of the manuscript. Prof Keun Woo LEE analyzed the results.