Using macromolecular electron densities to improve the enrichment of active compounds in virtual screening

Ma, Wenzhi; Zhang, Wei; Le, Yuan; Shi, Xiaoxuan; Xu, Qingbo; Xiao, Yang; Dou, Yueying; Wang, Xiaoman; Zhou, Wenbiao; Peng, Wei; Zhang, Hongbo; Huang, Bo

doi:10.1038/s42004-023-00984-5

Download PDF

Article
Open access
Published: 22 August 2023

Using macromolecular electron densities to improve the enrichment of active compounds in virtual screening

Wenzhi Ma¹^na1,
Wei Zhang^2,3^na1,
Yuan Le¹^na1,
Xiaoxuan Shi¹,
Qingbo Xu¹,
Yang Xiao¹,
Yueying Dou¹,
Xiaoman Wang¹,
Wenbiao Zhou¹,
Wei Peng ORCID: orcid.org/0000-0002-0949-4487^2,3,
Hongbo Zhang ORCID: orcid.org/0009-0005-1780-1968¹ &
…
Bo Huang ORCID: orcid.org/0000-0003-3822-9110¹

Communications Chemistry volume 6, Article number: 173 (2023) Cite this article

1385 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The quest for effective virtual screening algorithms is hindered by the scarcity of training data, calling for innovative approaches. This study presents the use of experimental electron density (ED) data for improving active compound enrichment in virtual screening, supported by ED’s ability to reflect the time-averaged behavior of ligands and solvents in the binding pocket. Experimental ED-based grid matching score (ExptGMS) was developed to score compounds by measuring the degree of matching between their binding conformations and a series of multi-resolution experimental ED grids. The efficiency of ExptGMS was validated using both in silico tests with the Directory of Useful Decoys-Enhanced dataset and wet-lab tests on Covid-19 3CLpro-inhibitors. ExptGMS improved the active compound enrichment in top-ranked molecules by approximately 20%. Furthermore, ExptGMS identified four active inhibitors of 3CLpro, with the most effective showing an IC₅₀ value of 1.9 µM. We also developed an online database containing experimental ED grids for over 17,000 proteins to facilitate the use of ExptGMS for academic users.

Absolute binding free energy calculations improve enrichment of actives in virtual compound screening

Article Open access 10 August 2022

Artificial intelligence–enabled virtual screening of ultra-large chemical libraries with deep docking

Article 04 February 2022

A two-layer mono-objective algorithm based on guided optimization to reduce the computational cost in virtual screening

Article Open access 27 July 2022

Introduction

Over the past decade, high-throughput virtual screening has become a popular method for discovering hit compounds in the field of drug design^1,2,3. When a receptor’s three-dimensional (3D) structure is available, molecular docking is used to identify potential binders for the target pocket⁴. However, due to the simplifications made to achieve high computational speed, such as treating the protein as mostly rigid and handling the solvent crudely⁵, docking and scoring accuracy is still suboptimal and has room for improvement. Numerous attempts have been made to address these challenges by focusing on algorithm and calculation protocol optimizations. For example, ensemble docking and induced-fit docking attempt to consider the flexibility of the pocket⁶, while molecular mechanics/generalized-Born surface area method (MM/GBSA) considers the effect of solvation⁷. Moreover, various designs for scoring functions have been created by considering more ligand–protein interactions, or by training machine-learning models with structural features, and biochemical and biophysical assay results as labels^8,9. Despite the success of these approaches in improving active compound enrichment for docking results, virtual screening still has a relatively low success rate, and more effective approaches are imperative. Since most of these approaches are designed from the perspective of algorithm and calculation protocol optimizations—and are approaching a bottleneck due to lack of training data—it is important to consider alternative perspectives by leveraging additional information that can experimentally reflect the dynamics of ligands and solvents.

Electron density (ED) maps from X-ray crystallography and Coulomb potential maps from cryogenic electron microscopy (Cryo-EM) are experimental data that provide valuable information about the dynamics of macromolecular systems, including the ligands and solvents present in the pocket^10,11. Some studies have explored the use of these maps for intermolecular noncovalent interaction (NCI) identification¹², artificial intelligence (AI)-based molecule generation¹³, and quantum mechanics parameter refinement¹⁴. Despite these advancements, the current virtual screening approaches rely predominantly on static structures and implicit solvent models. Thus, there is an urgent need to establish an efficient method for using these maps in docking-based virtual screening to enhance active compound enrichment.

In this study, we present a novel method, ExptGMS (Experimental ED-based Grid Matching Score), which utilizes experimental ED maps to screen docking poses for better enrichment of active compounds. A machine-learning model was built for the effective use of ExptGMS generated from multi-resolution ED maps. When tested using the Directory of Useful Decoys–Enhanced (DUD-E) dataset¹⁵, ExptGMS displayed the ability to complement molecular docking technology by achieving an over 20% increase in active compound enrichment in the top 10, 50, and 100 ranked compounds without affecting the diversity of the screening results. Approaches like 2D and 3D molecular similarity comparisons and MM/GBSA were used as benchmarks in our study. To further confirm the real-world effectiveness of ExptGMS, we performed virtual screening for Covid-19 3CLpro inhibitors. Using a biochemical assay, we tested the protease inhibition activity of the top-ranked compounds and discovered that the combination of ExptGMS and docking score provided three times more active compounds than the use of docking score alone. Furthermore, to facilitate the use of ExptGMS by academic users, we prepared ExptGMS grids for over 17,000 proteins and developed a database that provides web-based services (https://exptgms.stonewise.cn/#/create).

Results

Construction of ExptGMS

X-ray diffraction of macromolecular crystals generates an average ED over numerous crystal cells, which represents a time-average distribution of the molecules in the crystal. As shown in Fig. 1a, two ligand conformations were observed in the binding pocket. In addition, some solvent molecules with relatively intense dynamics may exhibit low intensity due to time-averaged effects, and may get overlooked during model building (Fig. 1b), resulting in incomplete modeling of the pocket contents. Given that most computational methods for virtual screening rely on static or incomplete models, the full profile of the pocket contents and their dynamic information embedded in the experimental ED maps are considered important complements to the methods currently in use.

**Fig. 1: Time-averaged information embedded in experimental electron density (ED) maps.**

To fully utilize the time-averaged signals in ED maps, we developed ExptGMS, which has two key components: an experimental ED-based grid and a scoring function. We used 2F_o–F_c ED maps with above-zero contour levels (>0 σ) for grid generation. To avoid excessive experimental noise, ED lower than zero σ were excluded. Grid points were placed in and around the pocket, and were assigned values reflecting the ED intensity at that position (Fig. 1c). A given ligand conformer is scored based on its degree of matching with the grid. In general, we developed a scoring function based on three principles: (1) rewarding ligand atoms occupying grid points with strong ED intensity; (2) penalizing ligand atoms occupying space with no grid points; (3) penalizing grid points with strong ED intensity but not occupied by any ligand atom. Details regarding the grid construction and scoring function development can be found in “Methods”.

To address the bias introduced by the grid being constructed on the ED of binders and solvents observed in a limited number of experiments with a limited range of binder types, we used the concept of ED map resolution. An ED map with lower resolution contains fewer details and is more abstract than the one with higher resolution; thus, it can provide more generalized conformation matches and consequently enhance the diversity of matched molecules. For pilot testing, we chose a median resolution of 3.0 Å to create ExptGMS.

Performance of 3.0 Å resolution ExptGMS on DUD-E dataset

The evaluation of ExptGMS generated using experimental ED with 3.0 Å resolution was described from three perspectives: dataset, benchmark technologies, and evaluation framework.

From a total of 102 targets in the DUD-E dataset, 85 targets were selected, since the remaining had no qualified experimental ED available. For each target in the dataset, about 13,000 compounds were available, with an active-to-decoy ratio of 1:30. The binding positions of the active compounds and decoys in the corresponding pockets were obtained from a previous study which used GlideSP for docking¹⁶.

GlideSP was selected as the docking program in our study because of its widespread use in the industry. We included two types of benchmarks for comparison with ExptGMS: pocket-based and ligand-based approaches. For the pocket-based approach, MM/GBSA was used for binding energy-focused evaluation; while pocket shape-focused evaluation was done using alpha spheres and ExptGMS shape-only (ExptGMS grid with all grid point intensities set as one). For the ligand-based approach, extended-connectivity fingerprint (ECFP) with Tanimoto index was included as the 2D similarity descriptor. In addition, Ultrafast Shape Recognition with CREDO Atom Types (USRCAT)¹⁷—a 3D similarity method that incorporates information on shape and pharmacophores—was used as the 3D similarity descriptor. Furthermore, three-dimensional force field fingerprint (TF3P)¹⁸—a newly developed 3D fingerprint for small molecules—was also included to represent the force field and deep learning-based approaches. Finally, because our goal was to test whether the addition of ExptGMS could assist docking procedures in eliminating false positives and false negatives, we included tandemly linked GlideSP and ExptGMS scores in the test. This hybrid approach was termed GlideSP + ExptGMS and involves the process of selecting the top 10% molecules based on their ExptGMS scores, and then ranking them according to their GlideSP scores.

For the evaluation framework, two key indexes were considered: (1) enrichment of active compounds, measured by the number of active compounds in the top 10, 50, and 100 ranked molecules, and (2) diversity of the top-ranked active compounds measured using the average Tanimoto similarity of each pair of selected active compounds. We measured both enrichment and diversity because virtual screening methods should identify a variety of scaffolds in addition to a large number of active compounds.

The results were analyzed using a two-dimensional scatter plot. The highest enrichment was achieved by 2D similarity comparison, but at the cost of a significant loss in diversity (Fig. 2, Supporting Information Fig. S1 and Supplementary Data 1). Considering both enrichment and diversity, ExptGMS outperforms most of the benchmark approaches. More importantly, Glide + ExptGMS enriched more active compounds in both top 10 and top 50 than GlideSP alone, indicating that ExptGMS is complementary to GlideSP. As a pilot test using single-resolution ExptGMS, the observed complementarity is not strong, but it confirms that our research is moving in the right direction. In addition, the significant drop of performance of ExptGMS-shape-only relative to ExptGMS confirms the effectiveness of introducing ED intensity to the grid.

**Fig. 2: Comparison of electron density-based grid matching score (ExptGMS) with benchmark technologies using 85 targets from DUD-E dataset.**

The complementarity of ExptGMS to GlideSP was also demonstrated in the case studies (Fig. 3). As shown in Fig. 3a, an inactive compound with a good GlideSP score was eliminated due to a poor match to the ExptGMS grid. This molecule had a docking score of −7.3, with Rewards and HBond scores accounting for −2.9 and −1.1, respectively. As GlideSP scoring function assigns empirical terms with high weights, molecules having empirically recognizable interactions with pockets tend to score well. However, from the perspective of ExptGMS, this molecule failed to occupy a strong ED blob, resulting in a poor ExptGMS score. In addition to this false-positive elimination case, we have also listed two false-negative elimination cases. Figure 3b shows an active molecule fitted well with the ExptGMS grid, but it did not have any empirically favored interactions, and therefore had a low GlideSP score of −4.8. Furthermore, as shown in Fig. 3c, an active molecule with its carboxyl group occupying the ED originally contributed by a water molecule in the crystal structure and achieved a good ExptGMS ranking. Based solely on the low GlideSP score, this compound would have been eliminated. This case demonstrates the effectiveness of preserving the solvent ED information in the ExptGMS grid.

**Fig. 3: Case study of electron density-based grid matching score (ExptGMS) supporting GlideSP in eliminating false-positive and false-negative results.**

In summary, our pilot testing on 3.0 Å resolution ExptGMS confirmed our hypothesis that ExptGMS contains signals useful for the improvement of active compound enrichment. To further maximize the effectiveness of such signals, we considered an ExptGMS with multiple resolutions.

Performance of multi-resolution ExptGMS on DUD-E dataset

The ExptGMS grids display varying resolutions, much like the experimental EDs which can also vary in resolution. As shown in Fig. 4a, an ExptGMS grid with a specific resolution can be constructed using the ED map at that resolution. Furthermore, the curve in Fig. 4b illustrates that decreasing resolution results in a more uniform distribution of grid values, suggesting a higher degree of tolerance for conformational matches with ligand candidates. Such characteristics affect the recall of compounds that differ significantly from the reference ligand topology, and may consequently improve enrichment.

**Fig. 4: Electron density-based grid matching score (ExptGMS) grids with different resolutions.**

To quantify the ability of ExptGMS with varying resolutions in enriching active compounds, we extended the aforementioned pilot testing on 3.0 Å resolution ExptGMS to four additional resolutions—2.5 Å, 3.5 Å, 4.5 Å, and 5.5 Å. To enhance clarity, we listed all 85 tested targets in a circle and colored the targets using a resolution-specific color, if the active compound enrichment of ExptGMS+GlideSP at that resolution outperformed GlideSP. Figure 5a displays the union of these colored targets across different resolutions, covering ~75% of the targets, while the intersection of these colored targets accounts for only one-third of the total. This observation suggests the potential of using multi-resolution ExptGMS to achieve superior performance.

**Fig. 5: Performance of electron density-based grid matching score (ExptGMS) with varying resolutions on 85 targets from the Directory of Useful Decoys–Enhanced (DUD-E) dataset.**

The question arises as to why ExptGMS with different resolutions can complement each other in terms of enriching active compounds. One possible explanation is that ExptGMS with different resolutions intend to score ligands from different perspectives. Low-resolution grids focusing on scaffold-level information, whereas high-resolution grids focusing on R group of atomic-level information. This distinction arises due to the intrinsic characteristic of X-ray or electron diffraction-based density, where decreasing the resolution results in a more uniform intensity distribution with fewer details expressed in the density map. To illustrate this point, we present a case involving PDB ID 2HV5. Here, an active compound exhibits a similar binding mode and scaffold to the co-crystallized ligand of the protein (Fig. 5b). This active ligand (yellow) can be ranked in top 100 by using ExptGMS with 3.5 Å but not with 2.5 Å. To highlight the difference of ExptGMS grids at these two resolutions, we selected grid points with strong intensities (i.e., over 3 σ) and showed them side by side in Fig. 5c. The 2.5 Å grid appears more fragmented, containing numerous blobs with high intensity (red grid points) than 3.5 Å grid. When scoring the original co-crystallized ligand (cyan), the fragmented blobs in the 2.5 Å grid exhibit a higher degree of matching with the ligand than the 3.5 Å grid (Fig. 5d). However, when scoring the active compound sharing similar scaffold but with different substitution groups, the 3.5 Å grid shows better match than the 2.5 Å grid. Fig. 5e illustrates that the penalty introduced by the fragmented blob (#1) in the 2.5 Å grid is waived in the 3.5 Å grid, and the strong blob (#2) in the 2.5 Å grid spreads across a wider region, fitting more accurately with the scaffold profile of the compound.

Multi-resolution ExptGMS-powered machine-learning model

To further demonstrate the value of multi-resolution ExptGMS, we developed a straightforward machine-learning model using Gradient Boosting Decision Tree (GBDT) for signal integration. We did not select a more complicated model because we focused on testing the value of the data. Our GBDT model is a classification model that was trained and tested using 85 targets from the DUD-E dataset. Specifically, the training set contained 73 targets, and the test set contained 12 targets. To prevent information leakage, the division of the training and test sets (Supporting Information Table S1) was split in a way that no targets in the test set had homology with a sequence identity greater than 30% in the training set. The activities reported in the DUD-E dataset were used as labels.

The GBDT model was trained in different versions, in which the selected features provided by benchmark technologies were used. As shown in Table 1 (details in Supplementary Data 2 and 3), the combination of GlideSP and ExptGMS exhibited the highest enrichment and good diversity of active compounds. The addition of multi-resolution ExptGMS improved the enrichment of active compounds in top 10 and top 50 by more than 20%, compared to the use of GlideSP alone.

Table 1 Performance of GBDT models trained using different features on the DUD-E test set (N = 12).

Full size table

The confidence intervals for the active compounds in the top N were obtained using the bootstrapping method. Specifically, samples were randomly selected with replacement from the test dataset until the selected sample size matched the size of the test dataset. Considering all the selected samples, the average numbers of active compounds in the top 10, 50, and 100 results were calculated, respectively. By repeating this process 200 times, a distribution of results was generated. From this distribution, the mean value and percentile confidence interval were computed.

In addition to enhancing the enrichment of active compounds within the top N ranked molecules, we sought to assess the impact of ExptGMS on the classification of active and decoy compounds. For evaluation, we utilized the area under the receiver operating characteristic curve (AUROC). The GBDT model incorporating both GlideSP score and ExptGMS features, demonstrated a higher AUROC compared to the model that solely utilized GlideSP score as a feature (Table 1), reflecting the classifier’s improvement with the inclusion of ExptGMS. Nonetheless, it is important to acknowledge that this improvement is mild, and the absolute AUROC value remains relatively low, indicating the need to incorporate ExptGMS features into more sophisticated models in future research.

Application of ExptGMS in virtual screening of Covid-19 3CLpro inhibitors

To further validate the efficiency of ExptGMS in the real world, we applied this method for the virtual screening of 3CLpro inhibitors. Using the pocket structure extracted from the crystal structure of SARS-CoV-2 3CL protease (PDB ID 7VU6), GlideSP-based molecular docking was performed against an 8-million-compound library compiled by consolidating commercially available compounds. Subsequently, the 3 Å resolution ExptGMS score was calculated for the conformations obtained from the molecular docking. 24 molecules were selected by intersecting the top 500 compounds ranked by ExptGMS score with the top 1000 compounds ranked by docking score. These 24 compounds were evaluated using wet-lab tests to determine their inhibitory rates and IC₅₀ values. The top 24 compounds, ranked solely by docking scores, were also tested to serve as controls. It is important to mention that no visual inspection or manual selection was involved in the selection of the aforementioned 48 compounds.

The structures, binding modes, and IC₅₀ values of the tested compounds are presented in Fig. 6 and Supporting Information (Supplementary Tables S2 and S3 and Supplementary Figs. S2 and S3). Among the 24 molecules selected using ExptGMS and GlideSP, nine molecules exhibited an inhibition rate greater than 50%, and four molecules exhibited IC₅₀ values of less than 25 µM, with the best one hitting 1.9 µM. In contrast, only three molecules exhibited inhibition rate greater than 50%, in the top 24 ranked molecules obtained using GlideSP alone, and only one of them exhibited IC₅₀ around 10 µM.

**Fig. 6: Active inhibitors of Covid-19 3CLpro.**

In conclusion, ExptGMS significantly enhanced the enrichment of active compounds in our Covid-19 3CLpro-inhibitor screening study.

Construction of ExptGMS database and online service

Despite the value of multi-resolution ExptGMS demonstrated in the above study, the construction of ExptGMS grids is not straightforward for end users. To facilitate the use of our approach by academic users, we processed multi-resolution ExptGMS grids for over 17,000 proteins and created a web-based server that can be accessed at this link (https://exptgms.stonewise.cn/#/create). Prior to working, users are required to select grids by specifying the PDB code from a drop-down list, upload the ligand poses, and upload the pocket structure which will be used to align the ligands with the grids. Typically, our service takes about one hour to complete the ExptGMS scoring for 100,000 compounds. ExptGMS grids at 2.5, 3.0, 3.5, 4.5, and 5.5 Å resolutions will be used to score the docking conformations. The ExptGMS scores at each resolution are written in the output SDF file. If a “docking_score” attribute is available in the uploaded SDF file, it will be combined with multi-resolution ExptGMS scores and submitted to our GBDT model, and the predicted probability of being an active compound will be added to the output SDF file.

Discussion

Our study on ExptGMS demonstrates that the use of experimental ED can improve the enrichment of active compounds in molecular docking-based virtual screening. In this section, we discuss two topics: (1) how to further leverage multiple-crystal information, if available; and (2) what limitations ExptGMS currently has, and how they can be overcome in the future.

Given that most of the popular targets have more than one available crystal structure, it is necessary to discuss whether ExptGMS can benefit from using multi-crystal information. An intuitive approach involves creating an ExptGMS grid for each crystal structure, followed by an averaging procedure. We tried such approach on four crystal structures of RAC-alpha serine/threonine-protein kinase (AKT1) and tested the performance of the multi-crystal averaged ExptGMS using active compounds and decoys of this target in DUD-E (Fig. 7). As shown in Table 2, the multiple-crystal-averaged ExptGMS significantly outperformed the single-crystal ExptGMS. Such an analysis provides a good start for future scope of investigating the optimal strategy for using ExptGMS to improve ensemble docking.

**Fig. 7: Construction of multi-crystal averaged ExptGMS grid for AKT1.**

Table 2 Performance of multi-crystal averaged ExptGMS vs. single-crystal ExptGMS.

Full size table

ExptGMS has three limitations. First, the current use of ExptGMS relies on the binding conformation achieved by the docking programs, which limits the value of ExptGMS if the binding pose is incorrect. An example is shown in Fig. 8, where an incorrect binding pose can be intuitively corrected by aligning the molecule to the ExptGMS grid. Real-space refinement technology¹⁹ used in crystallography can serve as a good starting point for the development of ExptGMS-based high-speed binding-pose-search engines. Second, the construction of ExptGMS depends on the availability of experimental ED. However, these conditions cannot always be satisfied. For example, it is common to use apo-protein structures processed by molecular simulation for virtual screening, especially for studies in which allosteric pockets are considered. In this scenario, an experimental ED is not available. One plausible approach to address this challenge is to conduct co-solvent molecular dynamics²⁰ to find a fragment-sized binder in the potential pocket and use computational ED to construct the ExptGMS grid. Using an AI generative model¹³ to create a filler ED in the pocket is also an alternative approach. Third, the current version of ExptGMS lacks information to support the estimation of the interactions between ligands and pockets. This explains why the combination of GlideSP and ExptGMS performs better than ExptGMS alone; GlideSP complements this interaction estimation. A clue to address this challenge can be found in a previously reported study on identifying NCIs between ligands and pockets by studying the saddle points of the ED¹². Incorporating NCI-related saddle points into ExptGMS grids and assigning them appropriate weights could be a potential solution. An alternative solution could be the introduction of electrostatic surface potential (ESP)-matching score into ExptGMS grids. A previous study²¹ that has discussed this topic is a good starting point for future research.

**Fig. 8: Example showing the necessity of applying ExptGMS-based binding-pose searching.**

In summary, our research highlights the importance of data, unlike other computational approaches that focus on algorithms, and demonstrates the value of experimental ED in enriching active compounds in virtual screening. We hope that our study will contribute to the community as a novel data source and open a new door for future algorithmic studies.

Methods

Datasets

The DUD-E¹⁵ dataset was used in this study. After removing the targets lacking a qualified ED map in the PDB, 85 targets were retained. GlideSP²² docking poses and scores of all active compounds and decoys cited in DUD-E, for these 85 targets, were obtained from a previous study¹⁶.

Experimental ED map preparation and ExptGMS grid generation

The coordinates and map coefficients were downloaded from the RCSB PDB web server²³. The sigma (σ)-scaled 2F_o–F_c maps were synthesized at specific resolutions using Phenix²⁴, to cover the ligands and a 5 Å region around them. Specifically, the highest available resolution X-ray diffraction data in .mtz format were used as input for phenix.fft to generate electron density maps at various resolutions, which were specified by the parameter d_min. To create an ExptGMS grid based on an ED map, the map was first discretized into grids with a 0.3 Å interval. The value assigned to a grid point was the 2F_o–F_c ED intensity at that particular point. Grid points within the van der Waals radius of the pocket residue atoms were removed. Grid points with values of less than 0 σ were also removed. All the experimental maps involved in this study are electron density maps obtained through X-ray crystallography. No Coulomb potential maps from Cryo-EM were involved.

Alpha sphere-based pocket shape preparation and grid generation

An alpha sphere-based pocket shape was produced using FPocket version 4.0²⁵. To generate a grid based on pocket shape, the region filled with alpha spheres was covered with grid points at intervals of 0.3 Å. Each grid point was assigned a value of one. Grid points within the van der Waals radius of the pocket residue atoms were removed.

ExptGMS scoring function

A scoring function was designed to measure the degree of matching between the ligand conformations and ExptGMS grids [Eq. (1)].

$${{{{{\rm{MatchScore}}}}}}={{{{{{\rm{S}}}}}}}_{{vac}}-{{{{{{\rm{S}}}}}}}_{{occ}}+{{{{{\rm{P}}}}}}$$

(1)

where ${{{{{{\rm{S}}}}}}}_{{vac}}$ represents contribution of vacant grid points that have intensity values, but are not occupied by any ligand atoms; ${{{{{{\rm{S}}}}}}}_{{occ}}$ represents contribution of grid points occupied by ligand atoms; and P represents contribution of ligand atoms with no nearby grid points. A smaller MatchScore indicates a better match.

S_occ is defined using Eq. (2):

$${{{{{{\rm{S}}}}}}}_{{occ}}=\mathop{\sum}\limits_{m\in M}\mathop{\sum}\limits_{{{{{{\bf{r}}}}}}{{\in }}{R}_{m}}{w}_{m}\rho ({{{{{\bf{r}}}}}})$$

(2)

where M represents all heavy atoms in the ligand; R_m represents grid points located within a radius of 0.4 Å around a given atom m; ρ(r) indicates the intensity value at grid point r; and ${w}_{m}$ represents the electron number of atom m with a ceiling value of 9.

S_vac is defined using Eq. (3):

$${{{{{{\rm{S}}}}}}}_{{vac}}=\mathop{\sum}\limits_{{{{{{\boldsymbol{v}}}}}}{{\in }}V}\rho ({{{{{\boldsymbol{v}}}}}})$$

(3)

where V indicates vacant grid points with no ligand atoms within a radius of 0.4 Å; and ρ(v) represents the intensity value of grid point v.

P is defined using Eq. (4):

$${{{{{\rm{P}}}}}}={n}_{{out}}\left(\frac{{{{{{{\rm{S}}}}}}}_{{occ}}}{{n}_{{in}}}\right)$$

(4)

where ${n}_{{in}}$ and ${n}_{{out}}$ denote the number of ligand atoms with and without grid points found within a radius of 0.4 Å, respectively.

Molecule similarity and diversity

The Tanimoto index was used to measure the 2D similarity between two molecules based on their ECFP4 fingerprints. The diversity of a set of molecules was defined as:

$${Diversity}=1-({average}\,{pairwise}\,2D\,{similarity}\,{among}\,{the}\,{molecules})$$

(5)

To measure 3D similarity, the Manhattan distance between the two molecules was calculated using their USRCAT descriptors¹⁷. ECFP4 and USRCAT calculations were performed using RDKit²⁶.

MM/GBSA

The receptor structure was prepared using the Protein Preparation Wizard program. To calculate the single point MM/GBSA binding free energy of the ligand–receptor complex, we used the Prime program. All residues within 4 Å of the ligand were treated as flexible during minimization. The Protein Preparation Wizard and Prime programs used in this study were sourced from the Schrödinger Suite (Release 2022-3). The force-file used is OPLS4.

Machine-learning approach utilizing multi-resolution ExptGMS

The DUD-E dataset was split into two separate subsets, a training set containing 73 targets, and a test set containing 12 targets (Supplementary Table S1). The sequence identities between target proteins were calculated using the Basic Local Alignment Search Tool (BLAST)²⁷ from NCBI. To avoid data leakage during the machine-learning process, it was ensured that none of the targets in the test set shared more than 30% sequence identity with any sequence in the training set, thus minimizing the potential for sequence similarity bias.

A series of ExptGMS grids were generated using experimental ED maps of varying resolutions (2.5, 3.0, 3.5, 4.5, and 5.5 Å). For each ExptGMS grid, small molecules were scored using Eq. (1), and the scores were normalized to uniform features. Since the ExptGMS score may vary significantly among different targets, we employed by-target normalization in our approach. The mean and standard deviation of the ExptGMS scores for each target were calculated, and the original ExptGMS score was transformed to Z-score (Z = (x − μ)/σ, where μ represents mean value and σ represents standard deviation). Similarly, the alpha sphere matching, GlideSP, and MM/GBSA scores were also normalized using by-target Z-score, while the other features remained unchanged during our feature generation procedure.

In our GBDT model, ED scores from different resolutions were treated as a single feature with different preferences. Instead of using a statistical value such as the mean or median as a unique representation, we trained a group of decision trees by combining ExptGMS scores with other features. These submodels were further ensembled using the gradient boosting method. To address the issue of imbalanced positive and negative samples in the dataset, during training, a label-balancing strategy was introduced in which weights were assigned to different samples that were inversely proportional to their quantity. GBDT model was implemented using Scikit-learn²⁸ with parameters documented in Supporting Information (Supplementary Table S4 and Supplementary Fig. S4).

Software for figures and tables

All structures and ED figures were made using PyMOL. Analyses were performed using the Pandas²⁹, NumPy³⁰, and Scikit-learn²⁸. Scatter plots were constructed using Matplotlib³¹ and Seaborn³² libraries.

Covid-19 3CLpro-inhibitor virtual screening and biochemical assay

The pocket used for virtual screening was obtained from SARS-CoV-2 3CL protease (PDB ID 7VU6) and prepared using Protein Preparation Wizard program. To conduct the screening, docking was performed on Glide program, against an in-house virtual compound library containing the structures of more than 8 million commercially accessible compounds. During screening, constraints were set so that the output ligand would require to form at least one hydrogen bond with the amide group of G143 or E166 in the pocket.

After docking, compounds were ranked based on their GlideSP score. The top 100,000 compounds were then subjected to an ExptGMS score calculation using a 3.0 Å ED map. By intersecting the top 500 compounds ranked by ExptGMS score with the top 1000 compounds ranked by docking score, we selected 24 molecules. These 24 compounds were evaluated using wet-lab tests to determine their inhibitory rates and IC₅₀ values. As controls, an additional 24 compounds ranked solely by docking scores were also tested.

SARS-COV-2 3 CLpro (EC: 3.4.22.69) is a 3C-like proteinase that recognizes substrates containing the core sequence [ILMVF]-Q- ↓ -[SGACN]^33,34. The inhibition potency of a potential inhibitor was determined by FRET-based assay using a FRET-compatible peptide substrate MCA-AVLQ ↓ SGFR-Lys (Dnp)-Lys-NH₂ (“↓” indicates the cleavage site). MCA fluorescence is initially quenched by the Dnp group until cleavage (at the cleavage site) separates them. The maximum excitation light of MCA is 320 nm, while the maximum emission wavelength is 405 nm. The activity of 3CLpro was detected by measuring fluorescence. The protease inhibition rates of the compounds were measured as follows: each reaction mixture contains 0.15 µM 3CLpro (having a P132H mutation, 3CLpro^P132H) and 40 µM inhibitor in 120 µL total volume in 96-well black polystyrene, flat bottom plates (Labselect, China). For IC₅₀ determination, the reaction mixtures had 0.15 µM 3CLpro^P132H and different concentrations of inhibitors in 120 µL total volume. 3CLpro^P132H was preincubated with the compound for 30 min at room temperature. Subsequently, the fluorescence resonance energy transfer (FRET)-compatible peptide substrate MCA-AVLQSGFR-Lys(Dnp)-Lys-NH₂ was added to the reaction mixtures to initiate the reaction. Fluorescence was recorded for 20 min using 340 nm excitation and 405 nm emission filters at 10 s intervals on a multimode microplate reader (Thermo Scientific^TM Varioskn^TM LUX). The IC₅₀ values were determined by curve fitting, using a four-parameter equation in GraphPad Prism 8 software.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The data analyzed in this study is included in this published article and its Supplementary Information files. Three additional supplementary data files were provided, with Supplementary Data 1 containing more details about Fig. 2, and Supplementary Data 2 and 3 containing more details about Table 1. The partial codes are available from the corresponding author upon reasonable request. Our model provides a service for academic users at https://exptgms.stonewise.cn/#/create.

References

Clyde, A. et al. High-throughput virtual screening and validation of a SARS-CoV-2 main protease noncovalent inhibitor. J. Chem. Inf. Model 62, 116–128 (2022).
Article CAS PubMed Google Scholar
Giordano, D., Biancaniello, C., Argenio, M. A. & Facchiano, A. Drug design by pharmacophore and virtual screening approach. Pharmaceuticals 15, 646 (2022).
Article CAS PubMed PubMed Central Google Scholar
Maia, E. H. B., Assis, L. C., de Oliveira, T. A., da Silva, A. M. & Taranto, A. G. Structure-based virtual screening: from classical to artificial intelligence. Front. Chem. 8, 343 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bender, B. J. et al. A practical guide to large-scale docking. Nat. Protoc. 16, 4799–4832 (2021).
Article CAS PubMed PubMed Central Google Scholar
Feng, M., Heinzelmann, G. & Gilson, M. K. Absolute binding free energy calculations improve enrichment of actives in virtual compound screening. Sci. Rep. 12, 13640 (2022).
Article CAS PubMed PubMed Central Google Scholar
Miller, E. B. et al. Reliable and accurate solution to the induced fit docking problem for protein-ligand binding. J. Chem. Theory Comput. 17, 2630–2639 (2021).
Article CAS PubMed Google Scholar
Mishra, S. K. & Koca, J. Assessing the performance of MM/PBSA, MM/GBSA, and QM-MM/GBSA approaches on protein/carbohydrate complexes: effect of implicit solvent models, QM methods, and entropic contributions. J. Phys. Chem. B 122, 8113–8121 (2018).
Article CAS PubMed Google Scholar
Guedes, I. A. et al. New machine learning and physics-based scoring functions for drug discovery. Sci. Rep. 11, 3198 (2021).
Article CAS PubMed PubMed Central Google Scholar
Dong, L., Qu, X. & Wang, B. XLPFE: a simple and effective machine learning scoring function for protein-ligand scoring and ranking. ACS Omega 7, 21727–21735 (2022).
Article PubMed PubMed Central Google Scholar
Mehrabi, P. et al. Liquid application method for time-resolved analyses by serial synchrotron crystallography. Nat. Methods 16, 979–982 (2019).
Article CAS PubMed Google Scholar
Riley, B. T. et al. qFit 3: protein and ligand multiconformer modeling for X-ray crystallographic and single-particle cryo-EM density maps. Protein Sci. 30, 270–285 (2021).
Article CAS PubMed Google Scholar
Ding, K. et al. Observing noncovalent interactions in experimental electron density for macromolecular systems: a novel perspective for protein-ligand interaction research. J. Chem. Inf. Model 62, 1734–1743 (2022).
Article CAS PubMed Google Scholar
Wang, L. et al. A pocket-based 3D molecule generative model fueled by experimental electron density. Sci. Rep. 12, 15100 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kasai, H. et al. X-ray electron density investigation of chemical bonding in van der Waals materials. Nat. Mater. 17, 249–252 (2018).
Article CAS PubMed Google Scholar
Mysinger, M. M., Carchia, M., Irwin, J. J. & Shoichet, B. K. Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking. J. Med. Chem. 55, 6582–6594 (2012).
Article CAS PubMed PubMed Central Google Scholar
Shen, C. et al. Beware of the generic machine learning-based scoring functions in structure-based virtual screening. Brief. Bioinforma. 22, bbaa070 (2021).
Article Google Scholar
Schreyer, A. M. & Blundell, T. USRCAT: real-time ultrafast shape recognition with pharmacophoric constraints. J. Cheminforma 4, 27 (2012).
Article CAS Google Scholar
Wang, Y. et al. TF3P: three-dimensional force fields fingerprint learned by deep capsular network. J. Chem. Inf. Model 60, 2754–2765 (2020).
Article CAS PubMed Google Scholar
Afonine, P. V. et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr. D. Struct. Biol. 74, 531–544 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ghanakota, P. & Carlson, H. A. Driving structure-based drug discovery through cosolvent molecular dynamics. J. Med Chem. 59, 10383–10399 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhao, L., Pu, M., Wang, H., Ma, X. & Zhang, Y. J. Modified electrostatic complementary score function and its application boundary exploration in drug design. J. Chem. Inf. Model 62, 4420–4426 (2022).
Article CAS PubMed Google Scholar
Friesner, R. A. et al. Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. J. Med. Chem. 47, 1739–1749 (2004).
Article CAS PubMed Google Scholar
Burley, S. K. et al. RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning. Nucleic Acids Res. 51, D488–D508 (2023).
Article CAS PubMed Google Scholar
Liebschner, D. et al. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr. D. Struct. Biol. 75, 861–877 (2019).
Article CAS PubMed PubMed Central Google Scholar
Le Guilloux, V., Schmidtke, P. & Tuffery, P. Fpocket: an open source platform for ligand pocket detection. BMC Bioinforma. 10, 168 (2009).
Article Google Scholar
RDKit: open-source cheminformatics. http://www.rdkit.org (2021).
Mount, D. W. Using the basic local alignment search tool (BLAST). CSH Protoc. 2007, pdb top17 (2007).
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Google Scholar
McKinney, W. Data structures for statistical computing in Python. In Proc. 9th Python in Science Conf. (eds van der Walt, S. & Millman, K. J.) 56–61 (2010).
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357–362 (2020).
Article CAS PubMed PubMed Central Google Scholar
Hunter, J. D. Matplotlib: a 2D graphics environment. Comput. Sci. Eng. 9, 90–95 (2007).
Article Google Scholar
Waskom, M. L. Seaborn: statistical data visualization. J. Open Source Softw. 6, 3201 (2021).
Article Google Scholar
Jin, Z. et al. Structure of M(pro) from SARS-CoV-2 and discovery of its inhibitors. Nature 582, 289–293 (2020).
Article CAS PubMed Google Scholar
Zhang, L. et al. Crystal structure of SARS-CoV-2 main protease provides a basis for design of improved alpha-ketoamide inhibitors. Science 368, 409–412 (2020).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was funded by the National Key R&D Program of China (grant number 2022YFF1203004). This study was also supported by the Emergency Key Program of Guangzhou Laboratory (Grant No. EKPG21-30-2) and R&D Program of Guangzhou Laboratory (Grant No. SRPG22-011). This work was also supported by the Beijing Municipal Science and Technology Commission (No. Z211100003521001).

Author information

These authors contributed equally: Wenzhi Ma, Wei Zhang, Yuan Le.

Authors and Affiliations

Beijing StoneWise Technology Co Ltd., Haidian Street #15, Haidian District, 100080, Beijing, China
Wenzhi Ma, Yuan Le, Xiaoxuan Shi, Qingbo Xu, Yang Xiao, Yueying Dou, Xiaoman Wang, Wenbiao Zhou, Hongbo Zhang & Bo Huang
State Key Laboratory of Respiratory Disease, First Affiliated Hospital of Guangzhou Medical University, 510182, Guangzhou, China
Wei Zhang & Wei Peng
Innovation Center for Pathogen Research, Guangzhou Laboratory, 510320, Guangzhou, China
Wei Zhang & Wei Peng

Authors

Wenzhi Ma
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Le
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoxuan Shi
View author publications
You can also search for this author in PubMed Google Scholar
Qingbo Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yang Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Yueying Dou
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoman Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wenbiao Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Wei Peng
View author publications
You can also search for this author in PubMed Google Scholar
Hongbo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Bo Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.H. and H.Z. conceived the study. B.H. provided instructions for all experiments. W.M. processed the ED map and conducted ExptGMS construction and evaluation. W.Z. conducted the biochemical tests. Y.L. developed the GBDT model. X.S. developed the scoring function. Y.X., Q.X. and W.Z. conducted the 3CLpro virtual screening. Y.D. and X.W. built online services. W.P. provided instructions for the biochemical assays. W.Z. provided instructions for developing the machine-learning model.

Corresponding authors

Correspondence to Hongbo Zhang or Bo Huang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Chemistry thanks Diogo Santos-Martins and James Fraser for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supporting Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ma, W., Zhang, W., Le, Y. et al. Using macromolecular electron densities to improve the enrichment of active compounds in virtual screening. Commun Chem 6, 173 (2023). https://doi.org/10.1038/s42004-023-00984-5

Download citation

Received: 18 April 2023
Accepted: 15 August 2023
Published: 22 August 2023
DOI: https://doi.org/10.1038/s42004-023-00984-5

This article is cited by

Generation of 3D molecules in pockets via a language model
- Wei Feng
- Lvwei Wang
- Wenbiao Zhou
Nature Machine Intelligence (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.