Introduction

Neuropathic pain is a multifactor neurogenic disorder caused by physical damage of neurons, cancer and other diseases1. Patients of neuropathic pain suffer from chronic pain as well as mental illnesses such as depression, anxiety and sleeping disorders2. To date, the mechanism of neuropathic pain remains unclear, making diagnosis and treatment difficult1,3,4.

Anandamide is an endogenous cannabinoid formed by the N-acyl-phosphatidylethanolamine-selective phosphodiesterase (NAPE-PLD) catalyzed hydrolysis of N-arachidonoyl-phosphatidyl-ethanolamine (NAPE)5 and has important physiological roles in pain regulation6. However, activity period of anandamide is short due to the rapid inactivation of anandamide by fatty acid amide hydrolyase (FAAH-1)7,8. Catoblism of anandamide is associated with many different diseases, including cancer, cardiovascular disease, obesity and particularly neuropathic pain9,10,11,12. One emerging approach in controlling pain is the modulation of anandamide degradation by targeting FAAH-113,14,15. Several antagonists of FAAH have been successfully developed16,17,18. Recent findings suggest FAAH-1 cytosolic variant FAAH-like anandamide transporter (FLAT)19 as a possible target for regulating pain. Decreased transportation of anandamide to FAAH-1 by inhibiting FLAT may be an alternative to direct antagonism of FAAH.

In this study, we screen for drug-like compound against FLAT from TCM Database@Taiwan20. Ligand based drug design methods were employed to predict bioactivity of the selected ligands. Molecular dynamics were employed to investigate underlying molecular mechanisms that may contribute to FLAT inhibition.

Results

Homology modeling and molecular docking

Suitability of rat proteins as templates for modeling human proteins was assessed by sequence alignment. Alignment of native rat FAAH-1 and native human FAAH-1 sequences showed 79.7% identity and 89.8% similarity. Re-alignment following removal of α2-helices (T9-T76) (termed FLAT sequences for clarification purposes) increased sequence identity and similarity to 86.1% and 95.6%, respectively (Figure 3). We proceeded to model human FLAT structure using rat FLAT structure based on the high sequence identity and similarity of the FLAT sequences. Structural correctness of the modeled human FLAT structure was evaluated with the Ramachandran plot. A total of 491 residues (98.2%) were distributed in the favored region (Figure 4). Table 1 lists the nine residues (1.8%) distributed in the allowed region. Results of the Ramachandran plot suggest that the modeled human FLAT structure is correct.

Table 1 Residues of the modeled human FLAT protein structure with ϕ and ψ angles located within the allowed region
Figure 1
figure 1

Experimental procedure and structural basis of FLAT simulation.

(a) Simplified scheme of experimental procedures. (b) Structural basis for FLAT structure simulation using FAAH-1. The α2-interacting loop (K255-L278; red) is the binding site opening loop and the helices (P411-N435) colored in cyan are regions in FAAH-1 that interact with the membrane. Presence of the α2-helix (T9-T76; orange) in FAAH-1 was the primary structural difference from FLAT. Human FLAT was modeled from rat FLAT structure, which was computationally prepared by deleting the α2-helix region (amino acids T9-T76) in rat FAAH-1.

Figure 2
figure 2

Cartoon representation of the anandamide binding site and docking poses of TCM candidates within the binding site.

(a) Enlarged view of the docking site (green) within the modeled human FLAT protein. The front and back sides of the binding site are depicted in red and purple, respectively. (b) Front view of docking site with docked ligands. (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one (control), Guineensine and Retrofractamide A are shown in orange, blue and green, respectively. Ser217 and Ile238 are amino acids found adjacent to the binding site. (c) Side view of docking site with docked ligands (45 degrees relative to b).

Figure 3
figure 3

Sequence alignment of target human FLAT sequence with template rat FLAT sequence.

FLAT sequences refer to that of rat FAAH-1 (PDB: 3K84) and human FAAH-1 (SwissProt: O00519) in which amino acids T9-T76 have been removed. Sequence identity and similarity were 86.1% and 95.6%, respectively.

Figure 4
figure 4

Ramachandran validation of the modeled human FLAT structure.

A total of 98.2% of the residues were distributed in the favored region. The remaining 1.8% were located in the allowed region. No residues have psi or phi angles in the disfavored regions.

Structure scaffolds of the control and TCM candidates are shown in Figure 5. Candidate selection was primarily based on structural similarities to the control and Dock Score which considers the internal energies of ligands and their corresponding protein-ligand interaction energy. Guineensine37 and Retrofractamide A38, compounds originating from Piper longum39,40 have Dock Scores of 61.45 and 54.01, respectively and were selected as candidates (Table 2). Front and side views of the docking poses of the TCM candidates and the control at the modeled anandamide binding site are illustrated in Figure 2b and 2c. Ligand-protein interactions can be extrapolated from Figure 6. For the control, 16 hydrophobic contacts were formed with 14 residues in FLAT (Figure 6a). Hydrogen bonds (H-bonds) were formed with Ser218, Ile239, Gly240 and Ser242. Guineensine was stabilized by solely hydrophobic interactions (Figure 6b). Similar to the control, Retrofractamide A formed both H-bonds and hydrophobic interactions with FLAT. Since Guineensine and Retrofractamide A primarily differ in the number of carbons in the hydrophobic tail (Figure 5b), the difference in hydrophobic tail length may contribute to differences in scoring functions (Table 2). Based on Ligplot analysis, the O atom located on the hydrocarbon chain was situated to form H-bonds with Met192, Ser194, Ser242 in Retrofractamide A. By contrast, the longer hydrocarbon chain in Guineensine pushes the same O atom away from the H-bond forming amino acids, resulting in Guineensine being stabilized by only hydrophobic interactions (Figure 6b). Cross comparing interaction residues among different test ligands reveals that Met192, Phe193, Ser194, Ile239 and Val492 are residues with which all test ligands formed interactions with (Table 3). The residues functioned to anchor the hydrocarbon tails of the ligands (Figure 7).

Table 2 Docking scores and bioactivity predictions of the TCM candidates and control compound
Table 3 Analysis of residues in FLAT that interact with test ligands
Figure 5
figure 5

Scaffold structures of control compound and TCM candidates.

(a) the alpha-ketoheterocycle inhibitor (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one (control) and TCM candidates (b) Guineensine and Retrofractamide A. Guineensine and Retrofractamide A differ in the number of carbons found in the hydrophobic tail region. The number of carbons for Guineensine is depicted in blue and that for Retrofractamide A is depicted in gray.

Figure 6
figure 6

Protein-ligand interactions analyzed by Ligplot.

(a) (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one, (b) Guineensine and (c) Retrofractamide A.

Figure 7
figure 7

Spatial orientation of key interaction residues with regard to test ligands.

(a) (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one (mint), (b) Guineensine (blue) and (c) Retrofractamide A (violet). Common residues are illustrated by yellow surfaces.

Activity prediction using MLR and SVM

Representative descriptors determined by GFA and used for constructing MLR and SVM models were: C_Count, ES_Sum_dsCH, ES_Sum_aasC, ES_Sum_ssNH, ES_Sum_dsN, ES_Sum_aaN, ES_Sum_dO, ES_Sum_sF, AverageBondLength, Num_Chains. Descriptions of each property are listed in Supplementary Table S1. In general, antagonistic activity against FLAT was associated with carbon atom numbers, electrotopolocial state, atom bond length and unbranched chain numbers. The optimum GFA model with an R2 of 0.9403 was:

The coefficient of determination (R2) for the MLR and SVM models constructed using the aforementioned descriptors are 0.8973 (Figure 8a) and 0.7988 (Figure 8b), respectively. Correlation between observed and predicted bioactivity values indicate good prediction accuracy of both models. Predicted pIC50 of the control and TCM candidates using the MLR and SVM models are listed in Table 2. Both models predict bioactivity for the TCM candidates. Similar bioactivities ranging between 7.72–7.89 were predicted for the three compounds with MLR. SVM model predictions show Guineensine and (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one to have bioactivities approximately 1.5 logs higher than Retrofractamide A. To reduce possible bias caused by one single model, we used a simplified consensus to summarize our bioactivity prediction results. The criteria for each score given is described in Table 2. The combined results of our MLR and SVM models indicate that Guineensine and the control have similar predicted bioactivities while Retrofractamide A may have lower bioactivities (Table 2). The variation in bioactivity could be related to the number of carbons within the hydrophobic tail since carbon numbers were the primary difference between Guineensine and Retrofractamide A. GFA results in which the number of C atoms is a representative descriptor of bioactivity support this view.

Figure 8
figure 8

Correlation plot of observed bioactivity verses predicted bioactivities generated by different models.

(a) MLR (b) SVM (c) CoMFA (d) CoMSIA.

Activity prediction using CoMFA and CoMSIA

PLS results of CoMFA and CoMSIA models are tabulated in Table 4. Steric field was the primary determinant of bioactivity in the CoMFA model. At an optimal number of components (ONC) of 6, the cross validation correlation coefficient (q2) of 0.690 and non-cross-validation correlation coefficient (r2) of 0.954 indicated a confident model. CoMSIA models considered multiple factors and the possible models at an ONC of 6 are shown in Table 4. The CoMSIA model integrating steric, hydrophobic and H-bond acceptor properties exhibited the highest cross validation (q2 = 0.802) and non-cross validation (r2 = 0.948) values, thus was selected as the CoMSIA model of choice.

Table 4 Partial least square (PLS) analysis of CoMFA and CoMSIA models

Correlation between the observed and predicted bioactivity values generated by the CoMFA and CoMSIA are shown in Figure 8c and 8d, respectively. Correlation coefficients (R2) of 0.9542 for the CoMFA model and 0.9478 for the CoMSIA model indicated good prediction strength of both models. The range of residuals between predicted and observed value is < ±1.0 (Table 5). Superimposition of aligned ligands with CoMFA feature contours indicate that ligand aromatic heads and bends in the hydrophobic tail region contoured to the steric favoring bulk (yellow) in the CoMFA map (Figure 9a). Additionally, the hydrophobic tail matches the contour disfavoring steric bulk (green). Alignment of the ligands with the bioactivity feature maps suggest activity of the ligands against the target protein FLAT. As expected, the CoMSIA feature map (Figure 9b) differed from CoMFA. Steric favoring contours (yellow), as well as steric disfavoring contours (green), were located near the hydrophobic tail. Both hydrophilic (cyan) and hydrophobic (gray) contours were located near the aromatic head. From the contour of the aromatic head region to the hydrophobic favoring bulk, we speculate that the ligands should exhibit some bioactivity despite the lack of bulk at the steric favoring region. Results from CoMFA and CoMSIA are consistent with results from MLR and SVM predictions and consistently suggest bioactivity of the TCM candidates toward FLAT.

Table 5 Bioactivity residuals1 between observed and predicted bioactivity values generated with the CoMFA and CoMSIA models
Figure 9
figure 9

Contour of superimposed ligands to CoMFA and CoMSIA feature maps.

(a) The aromatic head and bend in the hydrophobic tail contoured to the steric favoring region (yellow) and the hydrophobic tail regions contoured to the steric disfavoring region in the CoMFA map. (b) In the CoMSIA map, ligand tail regions contoured to both the steric favoring (yellow) and disfavoring (green) regions. Aromatic heads of the ligands contoured to both the hydrophobic (gray) and hydrophilic (cyan) features. (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one, Guineensine and Retrofractamide A (green) are represented in blue, red and green, respectively.

Molecular dynamics (MD) simulation

Molecular dynamics simulations were employed to investigate dynamic stability of the complexes. Characteristics of the protein-ligand complexes are summarized in Table 6. Complexes of the TCM candidates Guineensine and Retrofractamide A had lower total energy levels than the control complex. Globally speaking, binding of TCM candidates did not significantly alter protein characteristics compared to the control FAAH-1 antagonist. On a local scale, calculated RMSD values and total drifts (Table 6) indicate that Retrofractamide A complex was most stable, followed by (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one. Guineensine was the least stable of the three complexes.

Table 6 Characteristics of protein-ligand complexes formed by FLAT and select ligands

DSSP analysis allowed visualization of structural changes that contributed to stability differences. Three distinctive patterns, (1) without shake or vibration, (2) shake, or (3) vibration, were observed in the three complexes (Figure 10). The non-vibrating pattern implies a stable secondary protein structure of small changes. The shake pattern refers to isolated peaks of changes whereas the vibration pattern exhibits constant vibrations over a period of time. The shake pattern can be viewed as a pre-vibration signal. As shown in Figure 10a, all three patterns are present in the control complex. Isolated shaking patterns were present from 15–26 ns and accumulated to continuous vibrations from 26–40 ns. Guineensine complex was stable up to 6 ns, but exhibited vibration patterns after 6 ns to the end of MD (Figure 10b). Retrofractamide A complex remained stable during MD, with minor shakes observed in residues 497–577 after 30 ns (Figure 10c). The fluctuations observed here may provide an explanation for the larger RMSDs and drifts recorded in Table 6. Root mean square fluctuation (RMSF) analysis on docking interaction residues provide information on residues that may be important for the protein DSSP changes and ultimately bioactivity against FLAT. Large fluctuations at Ile239, Tyr272 and Gln274 could induce ligand instability and in turn affect the stability of FLAT. (Figure 11, Table 7).

Table 7 Root mean square fluctuations (RMSF) (nm) recorded for residues involved in docking
Figure 10
figure 10

Secondary structure changes observed during the 40 ns MD simulation.

(a) (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one, (b) Guineensine and (c) Retrofractamide A. The three complexes demonstrate distinctive change patterns. The “shake” pattern refers to a sudden peak of vibration; the “vibration” pattern refers to observation of continuous vibrations.

Figure 11
figure 11

Root mean square analysis of FLAT residues implicated in docking.

Yellow, red and blue bars represent data for (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one, (b) Guineensine and (c) Retrofractamide A, respectively.

Several analyses were conducted to determine the effects of TCM ligands on FLAT structure with (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one as the point of reference (Figure 12). Solvent accessible surface area (SASA) remained relatively constant for protein complexes (Figure 12a) and individual ligands (Figure 12b), indicating no significant changes in protein folding. Average ligand SASAs of control, Guineensine and Retrofractamide A are 6.895 nm/NS2, 6.458 nm/NS2 and 5.488 nm/NS2, respectively, which is most likely related to the hydrophobic tail lengths of each ligand. Changes in complex compactness were evaluated based on radius of gyration (Rg) which measures the mass of atoms relative to the center of mass of the complex. Average and maximum levels of Rg for each complex are tabulate in Table 8. Over the course of time, an increase in Rgcomplex starting at approximately 28 ns was recorded for the control and maximum Rgcomplex of 4.184 nm was reached at 38.46 ns. In Guineensine, changes from the baseline trajectory were observed around 10 ns and peaked to Rgcomplex of 4.7 nm at 18.32 ns. The Rgcomplex for Retrofractamide A was relatively static. In agreement with Rgcomplex results, higher Rgligand, was recorded for Guineensine compared to the other two test ligands (Figure 12d). Intriguingly, Rgcomplex trajectories seemed to be related to pattern changes in Figure 10. Changes in Rgcomplex coincided with shakes or vibrations observed for the control and Guineensine.

Table 8 Radius of gyration (Rg) calculated for the control and TCM candidates
Figure 12
figure 12

Analysis of MD trajectories generated by Gromacs.

Trajectories for (a) protein solvent accessible surface area (SASA), (b) ligand SASA, (c) protein radius of gyration (Rg), (d) ligand Rg, (e) protein mean square deviation (MSD), (f) ligand MSD, (g) protein root mean square deviation (RMSD) and (h) ligand RMSD are shown. Data for (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one, guineensine and Retrofractamide A are shown in mint, blue and violet, respectively.

The time dependent movement of atoms from their initial positions (MSD) were in agreement with gyration results. An increase in MSDcomplex (indicated by the increase in slope) was observed for the control (Figure 12e) as the complex becomes more relaxed after 25 ns (Figure 12c). A two-stage change in MSDcomplex was observed for Guineensine (Figure 12e). The steep slope in stage 1 (0–17 ns) matched the expansion of Guineensine complex (Figure 12c) and structure vibration (Figure 10b). During stage 2, the decrease in MSDcomplex slope coincided with increased Guineensine compactness and lower intensity vibrations. Displacement of ligands may also contribute to protein compactness and vibrations. Trajectory of control MSDligand was similar to that for Rgcomplex. For Guineensine, the MSDligand was similar to the trend of its MSDprotein. It is possible that in both cases, ligand shift was a primary force in initiating protein changes recorded in Figure 10 and Figure 12c. The high RMSDprotein of the control as opposed to the TCM candidates (Figure 12g) suggest that complexes formed by the TCM candidates cause less structural perturbation to the FLAT protein. No significant differences in RMSDligand was observed among the three compound (Figure 12h). Distance matrices of the three candidates were also similar (Figure 13).

Figure 13
figure 13

Distance matrices depicting the smallest distance between residue pairs.

(a) (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one, (b) Guineensine and (c) Retrofractamide A.

Discussion

The selected TCM compounds Guineensine and Retrofractamide A are near identical chemical structures with the only difference being the length of the hydrocarbon chain length (Figure 5). The longer tail of Guineensine enables its stabilization by hydrophobic interactions within the FLAT protein (Figure 6). By contrast, the shorter tail in Retrofractamide A formed H-bonds with Met192, Ser194, Ser242 within the hydrophobic interior of the protein. Residues Met192, Phe183, Ser194, Ile239, Val492 were important contact residues that stabilized the tail regions within the docking site. Bioactivity results obtained through QSAR model predictions also suggested an association between bioactivity strength and chain length. Guineensine and the control were both predicted to have higher bioactivities than the shorter Retrofractamide A. MD results further revealed protein secondary structure changes in Guineensine and the control that were not observed in Retrofractamide A. Since ability to initiate unstable vibrations in FLAT showed similar trends to bioactivity (Figure 10), we propose that a possible link might exist between these characteristics. Large fluctuations at Ile239, Tyr272 and Gln274 could contribute to bioactivity by inducing ligand instability, changing protein compactness and affecting the stability of FLAT. Based on the combined results from docking, QSAR predictions and MD simulation, Guineensine exhibits similar patterns to the control and could be a likely candidate for further studies.

The TCM compound Guineensine was predicted to have similar bioactivities with the demonstrated FAAH-1 inhibitor (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one. Molecular insights suggest that the inhibitory mechanism may be related to the ability to initiate unstable vibrations in FLAT. Guineensine induced violent vibrations in the FLAT complex. (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one induced similar, albeit less extreme, vibration. Gromacs analysis results suggest that ligand movement within the binding site might be associated with protein compactness changes which further translates into global protein structure vibrations. Based on these observations, we suggest that Guineensine may be a possible ligand for inhibiting FLAT transportation of anandamine. These findings may have important implications in designing better alternative for managing neuropathic pain.

Methods

Figure 1a summarizes the overall procedure used in this study.

Homology modeling

The crystal structure of rat fatty-acid amide hydrolase 1 (PDB: 3K84)21 was downloaded from Protein Data Bank. For clarification purposes, all sequence numberings referred to follow those of rat FAAH-1 (PDB: 3K84). The α2 helix (from T9 to T76) of FAAH-1 was computationally removed to form the FLAT structure of rats (Figure 1b). The homology model of human FLAT was constructed by using rat FLAT as the template structure and human FAAH-1 (SwissProt Index: O00519) as the template sequence. RAMPAGE22 was used to verify validity of the predicted structure.

Molecular docking

The modeled human FLAT structure was applied for molecular docking. Over 30,000 compounds were downloaded from TCM Database@Taiwan20 and docked into the anandamide binding site of the modeled human FLAT structure (Figure 2a). All compounds were prepared with CHARMm (Chemistry at HARvard Molecular Mechanics)23 prior to docking to add missing hydrogen atoms. The alpha-ketoheterocycle inhibitor (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one in rat FAAH-1 (PDB: 3K84) was used as the control21. Control and TCM compounds were docked using the LigandFit module in Discovery Studio 2.5 (DS 2.5)24. For each protein-ligand complex, five different conformations were generated based on default DS 2.5 settings. Each generated conformation was compared to the conformation of (9Z)-1-(5-pyridin-2-yl-1,3,4-oxadiazol-2-yl)octadec-9-en-1-one within the rat fatty-acid amide hydrolase 1 (PDB:3K84) crystal structure. The conformation most similar to the control in 3K84 and with the highest Dock Score was selected.

Activity prediction using quantitative structure activity relationship (QSAR) Models

A total of 52 ligands25 were selected to calculate molecular properties using the Calculate Molecular Properties protocol packaged in DS 2.5. Genetic function approximation (GFA) was used to identify ten representative descriptors associated with bioactivity. The calculated models were ranked by square correlation coefficient (R2) and the highest R2 model was used to predict bioactivities for the training set and test set. The linear Multiple Linear Regression (MLR) model was constructed with MATLAB26 and the non-liner model utilizing Support Vector Machine (SVM)27,28 was constructed with LibSVM29.

Comparative force field analysis (CoMFA) and comparative similarity indices analysis (CoMSIA) were applied to construct 3D-QSAR models. Ligand alignment was completed using the atom-fit module of SYBYL-X 1.130. In CoMFA, steric and electrostatic fields were calculated using Lennard-Jones potential (LJP) and Coulombic potential, respectively. In CoMSIA, descriptors including steric, hydrogen bond acceptor and donor, hydrophobic and electrostatic fields were calculated by Gaussian functions. Partial least squares (PLS)31 was used for property analysis. Cross validation model was ranked by cross-validated coefficient (q2), whereas non-cross validation was ranked by conventional correlation coefficient (r2), standard error of estimate (SEE) and standard error. The model with the highest q2, r2 values and the lowest SEE and standard error was selected as the optimum model and used to predict bioactivities of the TCM candidates.

Molecular dynamics (MD) simulation

All ligands are prepared by using web server SwissParam (http://swissparam.ch/)32 prior to MD simulation. MD was performed using GROMACS 4.0.733. The force field applied for simulation is detailed elsewhere34. Ligands and human FLAT were combined and immersed into a buffer (or solution) containing cubic box at a buffering distance of 1.2 nm between the complex and the edge of the box. Sodium and chloride ion were added to neutralize complex charge. Complex energies were subsequently minimized with the Steepest Descent method for 5000 steps. The last frame of each energy-minimized structure was used as the initial frame for MD simulation. Electrostatic interactions were calculated with the Particle-Mesh Ewald (PME) method35. The cutoff for PME was 1.0 nm. The time step was set at 2 fs and the number of steps set to 20000000, accumulating to a total MD simulation time of 40 ns.

MD trajectories were analyzed with built-in Gromacs tools. The secondary structure database (DSSP) was installed into Gromacs to analyze protein secondary structure changes36. Program g_energy was used to analyze potential energy, kinetic energy, total energy, temperature, pressure, volume, density, pV and enthalpy changes. Program g_gyrate was used the measure the radius of gyration. Program g_sas was used to compute interaction surface areas between solvent molecules and complexes. Program g_mdmat was used to generate distance matrices which calculate the smallest distance between each residue pairs.