QSAR Classification Models for Predicting the Activity of Inhibitors of Beta-Secretase (BACE1) Associated with Alzheimer’s Disease

Ponzoni, Ignacio; Sebastián-Pérez, Víctor; Martínez, María J.; Roca, Carlos; De la Cruz Pérez, Carlos; Cravero, Fiorella; Vazquez, Gustavo E.; Páez, Juan A.; Díaz, Mónica F.; Campillo, Nuria E.

doi:10.1038/s41598-019-45522-3

Download PDF

Article
Open access
Published: 24 June 2019

QSAR Classification Models for Predicting the Activity of Inhibitors of Beta-Secretase (BACE1) Associated with Alzheimer’s Disease

Ignacio Ponzoni ORCID: orcid.org/0000-0002-6923-9592^1,2^na1,
Víctor Sebastián-Pérez^1,3^na1,
María J. Martínez^1,2^na1,
Carlos Roca³,
Carlos De la Cruz Pérez³,
Fiorella Cravero⁴,
Gustavo E. Vazquez⁵,
Juan A. Páez⁶,
Mónica F. Díaz ORCID: orcid.org/0000-0002-6680-8067^4,7 &
…
Nuria E. Campillo³

Scientific Reports volume 9, Article number: 9102 (2019) Cite this article

11k Accesses
36 Citations
16 Altmetric
Metrics details

Subjects

Abstract

Alzheimer’s disease is one of the most common neurodegenerative disorders in elder population. The β-site amyloid cleavage enzyme 1 (BACE1) is the major constituent of amyloid plaques and plays a central role in this brain pathogenesis, thus it constitutes an auspicious pharmacological target for its treatment. In this paper, a QSAR model for identification of potential inhibitors of BACE1 protein is designed by using classification methods. For building this model, a database with 215 molecules collected from different sources has been assembled. This dataset contains diverse compounds with different scaffolds and physical-chemical properties, covering a wide chemical space in the drug-like range. The most distinctive aspect of the applied QSAR strategy is the combination of hybridization with backward elimination of models, which contributes to improve the quality of the final QSAR model. Another relevant step is the visual analysis of the molecular descriptors that allows guaranteeing the absence of information redundancy in the model. The QSAR model performances have been assessed by traditional metrics, and the final proposed model has low cardinality, and reaches a high percentage of chemical compounds correctly classified.

Therapeutic peptides: current applications and future directions

Article Open access 14 February 2022

Lei Wang, Nanxi Wang, … Caiyun Fu

Neurofilaments as biomarkers in neurological disorders — towards clinical application

Article 12 April 2024

Michael Khalil, Charlotte E. Teunissen, … Jens Kuhle

Discovery of potent inhibitors of α-synuclein aggregation using structure-based iterative learning

Article Open access 17 April 2024

Robert I. Horne, Ewa A. Andrzejewska, … Michele Vendruscolo

Introduction

Alzheimer’s disease (AD) is a chronic and irreversible brain disorder, which mostly affect to age people. This neurodegenerative disease is characterized by steady cognitive impairment, short-term memory loss, and problems with language. AD constitutes one of the main causes of death in the world, representing the majority of cases of dementia¹. Actually, it is estimated that 47 million people live with dementia around the world, and according to projections it is expected that the number of cases grows to more than 131 million by 2050². Besides, dementia has an enormous economic impact. During 2016, the total estimated global cost of dementia was US$818 billion, and it will become a trillion-dollar disease by 2018.

Ample experimental studies about the hallmarks of AD conclude that deposition of amyloid plaques in the brains of Alzheimer’s patients constitutes one of the crucial causes of the disease progression^3,4. The essential component of the amyloid plaques is the amyloid-β protein (Aβ)⁵, which is generated by subsequent cleavage of the amyloid precursor protein (APP) by two proteolytic enzymes β- and γ-secretase^6,7. The whole biochemical mechanism of proteolytic cleavage depends on the protein-protein interactions between β-site amyloid cleavage enzyme 1 (BACE1) and APP⁸. Due to several failures in clinical trials, there is currently some controversy with the use of BACE1 inhibitors. The possible reasons of the failure of BACE1 inhibitors are mainly three. One of them is that BACE1 inhibitors fail as they prevent amyloid production later in the course of illness and may be more effective if used earlier. The second one is about the complexity of AD, given the multifaceted nature of AD, it is unrealistic to expect that BACE1 inhibitors will be work alone^9,10. And the last reason is the latter reason seems to be related to the side effects, since the conclusions of different studies show that to block completely the activity of BACE1 is not advisable due to severe side effects¹¹. But even with these critical points, this way, the inhibition of BACE1, is a promising therapeutic strategy for instance BACE1 inhibitors with a multitarget profile¹². It is evident that still more effort is needed today in this field.

During last years, several quantitative structure-activity relationships (QSAR) models have been developed in order to predict potential inhibitors for protein BACE1^{13,14,15,16,17}. QSAR methods correlate molecular structure to different biological properties such as activity or ADMET properties, providing a relevant data to help during the development of drug design projects¹⁸. A key step in QSAR studies is the definition or codification of the chemical structure by a diversity of molecular descriptors, such as constitutional, topological, thermodynamic, functional groups, quantum mechanical, geometrical, etc. Nowadays, the development of new cheminformatics software allows to calculate thousands of molecular descriptors¹⁹, but usually only a small subset of the calculated descriptors brings necessary information for generating the QSAR model of interest. Consequently, the accuracy of these models depends on the correct analysis and selection of computed descriptors as independent variables for the QSAR model definition. For all these reasons, the design of an effective QSAR model constitutes a challenging problem²⁰.

Other kind of studies has employed the QSAR models to predict, using a new approach like Fragment Based-QSAR and Group Based-QSAR. This kind of studies consists of counting different chemical fragments and groups in the desired compounds or leads in order to optimize the QSAR models^15,21,22. Goyal et al.²³ used a group of 20 derivatives of the family of dihydropyridines (DHP) due to their good activities against BACE1 enzyme. Using Vlife MDS software²⁴, 705 physicochemical descriptors were obtained and reduced to 311 independent variables. Several selection methods such a step-wise and searching algorithm among others were employed to correlate the biological information to descriptors information. Finally, statistical methods like partial least square (PLS), multiple regression and other techniques were used to obtain models with good correlation and predictive values.

On the other hand, several authors have performed QSAR approaches based on target properties that are correlated to the most important ligand-target interactions¹⁴ used PoseView^25,26 and Ligand Explorer software²⁷ to find the most important ligand-target interactions of structures deposited in the PDB Bank²⁸ visualizing and adjusting the best distance between the atom’s interactions. In this sense, two descriptors were obtained by the last procedure to predict the activity of BACE1 inhibitory compounds, the hydrophobic contacts at 4–5 Å and the number of hydrogen bonds between ligand and target.

Finally, Gupta et al.¹⁷ published a QSAR model integrated by four molecular descriptors that encoded 3D features of the compounds. These descriptors were chosen by combining a ranking approach, which computes the Pearson correlation between each descriptor and the biological activity, and a forward/backward selection approach. Using these descriptors, different QSAR models are inferred by using multiple linear regressions. As a final point, it is interesting to mention that Gupta employs a methodology for computing the molecular descriptors equivalent to the applied in our study, for this reason, we have decided to compare our QSAR models with those obtained by Gupta´s group.

In this paper, we present novel QSAR models for predicting putative inhibitors for protein BACE1. Our proposal combines the application of several machine learning methods, model hybridizing strategies, backward elimination and visual analytics, in order to choose the most informative subset of molecular descriptors for building the QSAR models. Another distinctive aspect of our approach is the characterization of the QSAR modelling problem as a classification system, which helps to achieve a straightforward interpretation of the predictions. The QSAR classification models obtained from our experiments have been analysed, hybridized and compared, from a chemical and mathematical perspective, with Gupta’s models to compare the strengths and weaknesses of proposed models as virtual screening methods.

Results

Database

A database with 215 molecules was assembled, where the half maximal inhibitory concentration (IC₅₀) values of the compounds were extracted from the literature and web servers (the complete database is shown in the Table S1 of the Supporting Information). All the compounds used on the dataset were obtained from different sources like pharmaceutical industries or web servers like PDB Bank and CheMBL²⁹ among others. In order to obtain robust models with a wide applicability domain, the dataset is formed by compounds chemically diverse, with different scaffolds, reflecting a wide chemical space. All of them have been tested experimentally on the BACE1 enzyme with a reported IC₅₀ value.

Is important have a representative chemical space in QSAR studies, consequently, is necessary to build a structural diverse set of compounds in order to achieve it. Following this aim, the dataset was analysed characterizing its drug-like properties from a physicochemical point of view. Thus, two different approaches have been used to analyze our datasets diversity, from the point of view of structural and drug-like.

For the drug-like analysis of the compounds, we used a Qikprop’s module: Small-Molecule Drug Discovery Suite in Schrödinger³⁰, which through a set of mathematical functions and structural analysis allows to predict physical-chemical properties such as polar surface area, logP and others. Among the set of properties that the module is able to analyze; for Fig. 1, the molecular weight (MW) was chosen and compared to the coefficient QPlogBB and colored by the percentage of human oral absorption (%); these last two are physical-chemical properties predicted by the program.

Molecular weight property, included as one of the 4 Lipinski’s rules³¹ is one of the parameters analyzed for the dataset. In general, compounds with a molecular weight greater than 500 will not be good candidates as drugs-like compounds, this is considered as a determinant factor. However, in this case, the special topology of the BACE1 enzyme, which presents a large cavity around 10–15 Å, makes important for the inhibition to stablish several protein-ligand interactions in different key points of the pocket far from one another. For this reason, BACE1 drug discovery programs are based in bigger molecules in order to obtain low nanomolar inhibitors comparing with other targets.

The QPlogBB coefficient represents a prediction of the ability of the different compounds to cross the blood-brain barrier for the analysed compounds. Due to the location of the BACE1 enzyme, this property has to be taken into account when selecting compounds that are desired to present some type of inhibitory enzymatic effect in vivo. The calculated values of that parameter are in the following interval [−3.0/1.2], the compounds in the dataset present a wide distribution for this value in Fig. 1A. The more positive the value is, the more ability of the compound for crossing the blood-brain barrier. On the contrary, compounds that present negative values theoretically will not be able to cross it satisfactorily. As a result, it is observed that the majority of the compounds in the database is in the upper side of the figure. We can confirm the study with dopamine that present negative values because it is too polar and is not able to cross that barrier well.

Also the (%) Human oral absorption was analyzed due to the fact that drug-like compounds must be absorbed in an adequate rate by the human body as one of the key ADME (Absorption, Distribution, Metabolism and Excretion) properties. This value is obtained through a multiple linear regression of a series of parameters such as the number of rotatable bonds, and the permeability or solubility of the compounds at the time of predicting that property. In the Fig. 1A, we observe that the vast majority of the compounds present values of human oral absorption over 50%.

To conclude the characterization, Fig. 1B illustrates the drug-like behaviour of the dataset. The lower number of starts and violation number of Lipisnki rule, the better drug-like profile. Furthermore, most of these violations are in relation with MW, something expected due to the bigger size of molecules in medicinal chemistry programs for BACE1 inhibitors as stated before. Therefore, as no compounds present more than 2 violations of Lipinski rule of 5, it is possible to confirm that their properties make them likely orally active drugs in humans. A detailed supplementary statistical analysis of several properties is included in the Supporting Information (Table S2 and Figs S1–S6).

Data pre-processing

As it was mentioned before, the dataset is integrated by 215 molecules. The target variable for this study is BACE1 inhibitory activity (half maximal inhibitory concentration (IC₅₀) of each compound. The values are typically expressed as molar concentration. For classification purpose, it was defined one threshold (1000 nM) for discretization of the IC₅₀ values in order to define two classes of compounds: High Activity (HA) as inhibitors (IC₅₀ ≤ 1000 nM), and Low/Null Activity (LA/NA) as inhibitors (IC₅₀ > 1000 nM). Applying this criterion, the 215 molecules of this dataset are distributed in 126 HA compounds and 89 LA compounds.

Molecular descriptor calculation

DRAGON tool was the software used to compute the Molecular descriptors (MDs)³². In Table 1, a summary of the quantity of MDs calculated, clustered by family and type (0D, 1D, 2D, 3D and others), is detailed. A total number of 1867 MDs were finally incorporated to the BACE1 database after removing redundant descriptors with correlation coefficients bigger than 0.95.

Table 1 Number of molecular descriptors of each family computed for the database compounds.

Full size table

In silico experimental design

The employed protocol to develop QSAR models by feature selection is displayed in Fig. 2. In the first phase the IC₅₀ values are discretized using target discretization thresholds explained before. Next, these molecules were optimized to the configuration of minimum energy and, after that, 1867 molecular descriptors were computed using DRAGON software. After that, 25% of the molecules has been left apart for the last step of external validation, and the 75% of the remaining compounds were used for the feature selection and model construction steps. In the second phase, to select the subsets of molecular descriptors (MDs), we used three different approaches from the set of variables returned by DRAGON. The first approach uses DELPHOS tool, which run a machine learning method for selection of MDs in QSAR modelling³³. DELPHOS infers multiple alternative selections of MDs for defining a QSAR model by applying a wrapper method³⁴. In this case, twenty putative subsets had been computed. From them, we chosen two subsets, Subsets A and B (Table 2), since these subsets show the lowest relative absolute error (RAE) values reported by DELPHOS and small numbers of MDs.

Table 2 Molecular descriptors of DRAGON associated with the selected subsets.

Full size table

The second one was generated by WEKA tool³⁵, applying as feature selection method the Wrapper Subset Evaluator with Random Forest as classifier and Best First technique as Search Method. The selected subset is integrated by ten MDs and it was named Subset C. The most elevated cardinality of this subset is manageable but not desirable, because the physicochemical interpretation of resulting QSAR models usually became a cumbersome and time-consuming process. Besides, the QSAR models integrated by many variables usually suffer of poor generalization in statistical terms. The last one was provided by the scientific literature. In particular, the Subset D corresponds to the selection of four MDs recommended in Gupta et al.¹⁷.

Later, the performance of these four subsets has been evaluated by inferring QSAR classification models. All classifiers have been generated by WEKA software using alternative machine learning methods: the Neural Networks (NN), the Random Forest (RF), and the Random Committee (RC). Recent studies have shown that does not exist a more advisable strategy for learning the QSAR models from the subsets of descriptors³⁶. Random Forest and Random Committee are ensemble methods that combine different models with the aim to obtain accurate, robust and stable predictions. The first one implements an ensemble of decision trees where each tree is trained with a random sample of the data and the growth of these trees is carried out with a random selection of features. In a similar way, Random Committee allows building an ensemble of a base classifier that is chosen, for example, a neural network or a decision tree. On the other hand, Neural Networks are configurations of artificial neurons interconnected and organized in different layers to transmit information. The input data crosses the neural network through various operations and then the output values are computed. In this sense, we decided to test these several methods to infer the classifiers. The parameter settings provided by default for WEKA, were used in the experiments for each inference method. Several metrics were calculated using WEKA, regarding the performance assessment: the percentage of cases correctly classified (%CC), the average receiver operating characteristic (ROC) area, and the confusion matrix (CM). In all cases, the stratified sampling and 10-fold cross validation methods provided per default by WEKA were applied. The best QSAR models obtained per each subset is reported in Table 3, where the classifier with best performance is highlighted.

Table 3 Performances of the best QSAR classifiers obtained per each subset during external validation. The best model is highlighted in bold.

Full size table

In the third phase, the first step corresponds to a QSAR model hybridization experiments. These strategies that combine MD subsets obtained from different methodologies has been useful tested in other scenarios^{37,38,39,40,41} and for this reason it was also evaluated in this work. The main goal of these experiments is to improve the accuracy obtained for the best model by adding features included in remaining subsets. Following this idea, three hybridized subsets were defined by combining the subset D, which achieves the best performance in Table 3, with each other subsets in a pairwise fashion. Besides, a fourth hybrid subset was defined as the union of all subsets. These straightforward hybridizations are reported in Table 4.

Table 4 Hybridized subset obtained from the union of different subsets.

Full size table

Once the hybridized subsets were defined, the next step was the evaluation of them by inferring QSAR classification models employing identical experimental conditions and criteria applied to the original non-hybridized subsets (Table 5). Subset HS1 was the best hybridized subset considering such as accuracy as low cardinality.

Table 5 Performances of the best QSAR classifiers obtained per each hybridized subset during external validation. The best model is highlighted in bold.

Full size table

From this table, it is possible to observe that the QSAR model inferred from the pairwise hybridization between subsets A and D overcame the performance obtained by the models generated from these subsets by alone. However, at this point, it is valid to ask whether there is any subset of the hybridized subset HS1, different from subsets A and D (HS1) that can achieve models with even better performance. This is possible because some of the molecular descriptors provided by subsets A and D can be redundant or introduce some noise to the classification procedure affecting the generalization properties of the QSAR models.

To evaluate the relevance of each molecular descriptor a combinatorial reduction analysis, commonly known as backward elimination was performed. This analysis consists of eliminating a molecular descriptor of HS1 each time and calculating its performance. This process is repeated until not improvement of the performance is observed (see Tables S3–S5 of the Supporting Information). The higher performance was obtained by removing two MD with a cardinality subset of 6. The results of the best QSAR classifiers obtained in each step are included in Table 6, where the minus operator denotes the deletion of one MD from the subset. In the first step, eight subsets with cardinality seven were obtained by removing one molecular descriptor from HS1 at once, and their performances were tested following the same experimental conditions described before. The higher performance is obtained by removing the molecular descriptor MW (%CC = 85%), improving the HS1 accuracy (first row of the Table 6).

Table 6 Performances during external validation of the best QSAR classifiers inferred for HS1 reduced subsets in each step. The final model has 6 molecular descriptors, an 85% of cases correctly classified and a ROC curve of 0.88.

Full size table

In the second reduction step, seven subsets of cardinality six were obtained by alternative removing one MD at once from the reduced subset HS1-MW. The performance of the best QSAR model inferred from each reduced subset is reported in the second row of the Table 6, where the model with higher performance is obtained using HS1-MW-RDF080m. This model preserves the %CC de subset HS1-MW but using one MD less and increases the value of the ROC curve to 0.88. In the third backward elimination step, six subsets of cardinality five were obtained by alternative removing one MD at once from the subset HS1-MW-RDF080m. The performances of the best QSAR model inferred from each of these new reduced subsets are reported in the third row of the Table 6. In this case, the model with higher accuracy corresponds to HS1-MW-RDF080m-N-069, but the performance falls to 83%. For this reason, the final best model (HS1-MW-RDF080m) includes the final selection of MDs is integrated by the following subset of MDs: GGI7, H1e, H6m, Mor31p, N-069 and nCrs. As summary, Fig. 3 shows a performance comparison of best QSAR models obtained in each step of our experimental work. The percentage of corrected classified molecules is expressed as a ratio in order to improve the plot visualization.

It is important to note that this final subset does not fully include the subset A neither the subset D, because one descriptor of both subsets was removed during the reduction of the hybrid subset HS1. One logical question that can be formulated at this point it is why the traditional feature selection approaches (DELPHOS, WEKA and Gupta’s methodology) do not find directly this subset of cardinality six. This occurs because feature selection constitutes a NP-hard problem in computational complexity theory and, therefore, any algorithm used for addressing this problem will explore some part of the huge combinatorial space associated to all possible subsets of MDs. For this reason, the hybridization of subsets obtained by different FS strategies together with the backward elimination can be useful for improving the solutions provided by traditional FS methods.

Concerning to their physicochemical meanings, H1e and H6m are GETAWAY (Geometric Topology and Atom Weights Assembly) descriptors. H1e is an H index autocorrelation of lag 1 weighted by Sanderson electronegativity and H6m is an H index autocorrelation of lag 6 weighted by atomic mass. This type of descriptors is related to the influence of atoms in the determination of the molecular form. On the other hand, GGI7 belong to 2D autocorrelations descriptors class and is the topological charge index of order 7. Another descriptor is nCrs that represents the number of ring secondary C (sp3) and belongs to the Functional group counts. The descriptor N-069 belongs to the Atom-centered fragments class and indicates the number of substructures in which a sp3 nitrogen atom is connected by a simple bond to an electronegative atom or an aromatic substituent. Lastly, Mor31p is a 3D-MoRSe (Molecule Representation of Structures Based on Electron diffraction) descriptor that represents the signal 31 weighted by polarizability. This class of descriptors captures the three-dimensional structure of a molecule and expresses distinctive characteristics of it.

Finally, as last phase of the proposed methodology, it was analyzed the pairs correlation between these six descriptors by using VIDEAN³⁹. This tool uses visual analytical methods for molecular descriptors analysis in statistical terms. Figure 4 shows the relationship between the descriptors in the Kendall correlation mode. In this representation, the light orange and light blue tones of the edges (correlation) between the nodes (descriptors) are showing a low level of correlation. This result is the expected one, confirming that each descriptor is contributing unique information to the model.

Random experiments for assessing the risk of correlation by chance

In this section, we present two random experiments in order to assess the risk of chance correlation both in the final subset of MDs selected by our methodology and in the final QSAR model inferred from these MDs. Whenever in a QSAR modelling method a “best” combination of a few (m) descriptors is selected from a pool of many (M) descriptors in order to best fit given target variable, there is an enhanced risk of chance correlation^42,43. The risk is boosted (compared to using a predefined subset of descriptors) due to the number of possible models considered, which for increasing m and M speedily exponentially grows to huge orders of magnitude:

$${\rm{MCm}}={\rm{M}}!/({\rm{m}}!(M-{\rm{m}})!)$$

Among so many potential subsets of m descriptors, a few of these combinations are likely to fit the data reasonably well by chance, i.e., without having a true association with the response variable. Therefore, descriptor selection methods, led by a typical measure of fit, are blind with respect to presence or absence of such a true association. Nowadays this problem, also now as selection bias6, has become urgent because the number of molecular descriptors available in computer programs has been significantly increasing during last decades⁴⁴.

In order to address this problem, the first step consists into evaluating if the final subset of MDs selected by our methodology has an accuracy performance significantly higher than the achieved by MD subsets selected randomly. Once this risk can be left behind, the second step should be to determine if our proposed QSAR model, inferred from the final subset of MDs by using machine learning, truly learned how to segregate compounds among the different classes associated to the target variable. In other words, we want to assess if our QSAR model does not classify the compounds randomly.

For the first step, a feature selection randomization experiment (fs-randomization) has been executed. In this experiment, a thousand combinations of six MDs, the same cardinality of the final subset, selected by our methodology, has been randomly selected from the original pool of features integrated by 1867 MDs (see Fig. 5). Therefore, from each random subset, a QSAR model is inferred following the same experimental conditions and criteria used for learning our final QSAR model. Finally, the accuracies (as a percentage of corrected classified samples) of the QSAR models obtained from these random subsets had been computed. The average accuracy (%CC) obtained by the fs-randomization was 69%, and only three of the thousand random trials achieved a performance similar to our final subset (around 85%). Therefore, the performance of the MD subset selected by our methodology is located in the percentile 99 of the accuracies distribution obtained by the fs-randomization, showing a performance significantly higher than the random subsets.

For this second step, a y-randomization experiment has been executed. This method consists of randomly shuffled the values of the target variable (y-variable) both in the training set and external validation set, leaving the MDs values intact. Then, a new QSAR model is applied to these scrambled data following the same experimental conditions used to infer the original QSAR model. Every run will yield estimates of the accuracy of the QSAR model, which are recorded. If in each case the scrambled data give much lower accuracy values than the original data, then we can be confident about the relevance of the “real” QSAR model. This methodology is “probably the most powerful validation procedure” for assessing the risk of obtaining QSAR models by chance correlation⁴⁵, and combined with the fs-randomization allows us to achieve confident QSAR models⁴⁴. The average accuracy (%CC) obtained by the y-randomization was 53% (see Fig. 6), and none of the thousand random trials achieved a performance similar to our final subset (best random performance was 75%). Therefore, from these results, we can confidently discard the risk of chance correlation in the final QSAR model proposed in this work.

Materials and Methods

Preparation of databases

Ligand preparation

The BACE1 dataset on SMILES format were converted to 3D structures using LigPrep⁴⁶ software implemented on Maestro Suite⁴⁷. LigPrep is a 2D-to-3D conversion application that carries out the addition of hydrogen atoms and includes the generation of possible tautomers, stereoisomers and ring conformations using molecular mechanics force fields. The tool also calculates the ionization state of the molecule at a selected pH range. In order to perform our studies, possible states were generated at pH 7.3 with the aim of obtaining the most suitable ionization states of the molecules at physiological pH. The ionization states were assigned with Epik module⁴⁸. Also, no tautomers were generated and all the compounds were desalted. In this process, the search has been restricted to obtain only one low energy ring conformation, as well as one stereoisomer among all that can be found by the tool. The final step of the preparation is an energy minimization of the 3D conformers generated using the OPLS2005 force field⁴⁹.

Different ionization states and conformers of the same molecules were reduced to keep the most suitable 3D structure per initial compound. This preparation is a critical step to carry out further studies with these compounds, such as the physicochemical properties calculation to characterize the dataset.

Drug-like properties calculation

All the prepared molecules were studied using Qikprop application of the Small-Molecule Drug Discovery Suite in Schrödinger, an accurate software that predicts structurally significant 2D and 3D properties and pharmaceutically relevant characteristics of chemical compounds. Absorption, Distribution, Metabolism, and Excretion (ADME) properties were predicted using this tool where a total of 44 properties are calculated. The program also calculates properties like molecular weight, polar surface area, molecular volume, QPlogPo/w (predicted octanol/water partition coefficient), number of H-bond donors and acceptor groups, and violations related to the Lipinski’s Rule of 5 and Jorgensen’s Rule of 3 and allows to filter out compounds with clear-cut undesirable properties for drug discovery.

Statistical analysis of the database

All the data obtained in the previous step was further analyzed, using SPSS software⁵⁰. Statistical parameters were calculated for the database of compounds, such as modal value, average, median, standard deviation or variance for some key physicochemical parameters of the analysis. Furthermore, several histograms are shown in the Supporting Information where the frequency of the different values can be found grouped by properties.

Software used for processing molecular descriptors

DRAGON³² is a calculation of molecular descriptors tool. It provides almost 5,000 molecular descriptors of different types: 0D, 1D, 2D, and 3D. These molecular descriptors can be used to evaluate molecular structure-activity/property relationships of molecule databases. DRAGON required to calculate their molecular structure files, and also can deal with H-depleted molecules and 2D-structures.

Machine learning tools used for feature selection and classification models

DELPHOS is a descriptors selection tool that implements a wrapper multi-objective optimization technique based on two phases. In the first phase, a wrapper method and different machine learning algorithms are used to perform an exploration within the search space and find appropriate subsets of descriptors. Then in the second phase, a final subset selection is performed using the selections of the first phase and metrics for a more accurate prediction. In this way, DELPHOS allows identifying relevant subsets of descriptors related to the property under study^33,34,51.

VIDEAN is a visual analytics tool that combines statistical methods with interactive visualizations for helping in the selection of a subset of molecular descriptors for predicting a target property. It shows the relationships and interactions between the molecular descriptors and the target property in different information panels. A basic visualization used in this work consists in an undirected graph, where nodes represent each descriptor, the size of the node indicates the correlation between this node and the target property, and colored edges reflect the pairwise correlation between the descriptors that it connects. More detailed information about this software is available in Martinez et al. 2015³⁹.

Weka tool is a machine learning algorithms suit, for data mining tasks³⁵. The learning methods used in this work are:

Wrapper Subset Evaluator: method to evaluate a set of attributes using a learning technique. It uses cross validation to estimate the precision⁵².

Best first: is a method that performs searches in the space of subsets of attributes through a greedy hill climbing and using a backtracking strategy.

Neural Networks: a multilayer perceptron method that use back-propagation to classify instances. The network, is allowed to be monitored and modified during training time. The nodes are all sigmoid, except for numeric class.

Random Forest: allows constructing a Random Trees forest⁵³. The random trees considers K randomly chosen attributes at each node for build a tree. Performs no pruning, and allow class probabilities estimation (or target mean in the regression case) based on a hold-out set (back fitting).

Random Committee: this method allows to build an ensemble of randomized base classifiers. Each base classifier is built based on the same data, but using a different random number seed. The final prediction is a straight average of all predictions generated by each individual base classifiers.

Methods used for random experiments

These methods aim to evaluate the risk of random correlation in both the subset of molecular descriptors selected through a specific methodology and in the final QSAR model obtained from these descriptors.

fs-randomization (feature selection randomization): is a method that consists of randomly selecting a number n of molecular descriptors from the original features set, where n is the cardinality of the MDs subset that have been selected by a specific technique. Then, from these random MDs and with the original values of the target property, a new model is inferred following the same experimental criteria that were used to obtain the final QSAR model. Finally, the accuracy value (percentage of correctly classified samples) is reported. This method is applied repeatedly in order to obtain a distribution of values with statistical significance.

y-randomization: is a technique that randomly reorders the values of the target property (y-variable) both in the training set and the external validation set, without modifying the values of the MDs. To apply this technique, the n MDs of the final QSAR model are used. In this way, a new model is generated following the same experimental conditions that were used to obtain the final QSAR model, and the accuracy value is reported. Like fs-randomization, this process is repeated a fixed number of times in order to obtain a distribution of values with statistical significance.

Additional information on these and other techniques for randomizing experiments in QSAR modeling can be found in⁴⁴.

Conclusions

Alzheimer’s disease is one of the neurodegenerative disorders with stronger impact in elder population around the world. A promising target for its pharmacological treatment is the β-site amyloid cleavage enzyme 1 (BACE1). Several studies proposed that BACE1 inhibitors have high therapeutic potential for decelerating the long-term progression of AD, and during last decade several quantitative structure-activity relationships (QSAR) models have been proposed in the literature.

In this paper, a new QSAR model for virtual screening of potential inhibitors of BACE1 protein has been developed using a novel computational strategy. The main goal of the proposed strategy is to obtain accurate QSAR models integrated by a minimum number of molecular descriptors, because models with large number of descriptors suffer of poor generalizability and complex interpretability. QSAR approaches based on machine learning use feature selection methods for choosing the most informative subset of molecular descriptors, but all these algorithms only explore a fraction of the whole combinatorial space of potential subsets. Therefore, these methodologies cannot guarantee a QSAR model of minimum size. For this reason, our proposal combines different methodologies of feature selection, model hybridization approaches, backward elimination and visual analytics to improve the performance of the traditional QSAR methods.

Thanks to this methodology we have developed a robust QSAR model, improving around an 8% regarding the Gupta’s model in terms of both central performance metrics (percentage of corrected classified molecules and average ROC values) by preserving a low cardinality model. Additionally, the risk of chance correlation in the proposed QSAR model has been discarded by executing and analyzing both feature selection randomization and y-randomization experiments. Furthermore, the wide chemical diversity of the database used in our study compared to previous studies enhances the applicability of our model. Therefore, the results obtained by this novel strategy show that our approach contributed to achieve a QSAR model that can be a useful virtual screening method for prediction of BACE1 inhibitors. Nevertheless, as any QSAR model generated by machine learning, it is important to know that these classifiers preserve their levels of accuracy for molecules structurally similar to the chemical compounds used during the training of the model. For this reason, potential users of these models should be employ applicability domain methods. As a future work, we plan to evaluate this new methodology in the development of other classification models and continue testing our QSAR model for predicting BACE1 over other datasets.

Description of additional data files

The following additional data are available with the online version of this paper. “Supplementary Information” contains more information about the dataset, its statistical properties and additional results about intermediate calculations during the backward elimination analysis.

Data Availability

All data generated or analyzed are available upon request.

References

Burns, A. & Iliffe, S. Alzheimer’s disease. BMJ. 338, b158 (2009).
Article PubMed Google Scholar
Prince, M., Comas-Herreras, A., Knapp, M., Guerchet, M. & Karagiannidou, M. World Alzheimer report 2016: improving healthcare for people living with dementia: coverage, quality and costs now and in the future (2016).
Guo, T. & Hobbs, D. W. Development of BACE1 inhibitors for Alzheimer’s disease. Curr. Med. Chem. 13(15), 1811–29 (2006).
Article CAS PubMed Google Scholar
Cole, S. L. & Vassar, R. BACE1 structure and function in health and Alzheimer’s disease. Curr. Alzheimer Res. 5(2), 100–20 (2008).
Article CAS PubMed Google Scholar
Selkoe, D. J. Translating cell biology into therapeutic advances in Alzheimer’s disease. Nature. 399(6738), A23 (1999).
Article ADS CAS PubMed Google Scholar
Citron, M. Alzheimer’s disease: strategies for disease modification. Nat. Rev. Drug Discovery. 9(5), 387–98 (2010).
Article CAS PubMed Google Scholar
De Strooper, B., Vassar, R. & Golde, T. The secretases: enzymes with therapeutic potential in Alzheimer disease. Nat. Rev. Neurol. 6(2), 99–107 (2010).
Article PubMed PubMed Central Google Scholar
Zou, L., Yang, R., Zhang, P. & Dai, Y. The enhancement of amyloid precursor protein and beta-site amyloid cleavage enzyme 1 interaction: amyloid-beta production with aging. Int. J. Mol. Med. 25(3), 401–7 (2010).
Article CAS PubMed Google Scholar
Coimbra, J. R. et al. Highlights in BACE1 inhibitors for Alzheimer’s disease treatment. Front. Chem. 6 (2018).
Voytyuk, I., De Strooper, B. & Chavez-Gutierrez, L. Modulation of γ-and β-secretases as early prevention against Alzheimer’s disease. Biol. Psychiatry. 83(4), 320–327 (2018).
Article CAS PubMed Google Scholar
Chatila, Z. K. et al. BACE1 regulates proliferation and neuronal differentiation of newborn cells in the adult hippocampus in mice. eNeuro. 5(4) (2018).
Article PubMed PubMed Central Google Scholar
González-Naranjo, P. et al. Indazolylketones as new multitarget cannabinoid drugs. Eur. J. Med. Chem. 166, 90–107 (2019).
Article PubMed Google Scholar
Manoharan, P., Vijayan, R. S. & Ghoshal, N. Rationalizing fragment based drug discovery for BACE1: insights from FB-QSAR, FB-QSSR, multi objective (MO-QSPR) and MIF. studies. J. Comput.-Aided Mol. Des. 24(10), 843–64 (2010).
Article ADS CAS PubMed Google Scholar
Nastase, A. F. & Boyd, D. B. Simple structure-based approach for predicting the activity of inhibitors of beta-secretase (BACE1) associated with Alzheimer’s disease. J. Chem. Inf. Model. 52(12), 3302–7 (2012).
Article CAS PubMed Google Scholar
Huang, D. et al. Comprehensive 3D-QSAR and binding mode of BACE-1 inhibitors using R-group search and molecular docking. J. Mol. Graphics Modell. 45, 65–83 (2013).
Article ADS CAS Google Scholar
Chakraborty, S., Ramachandran, B. & Basu, S. Encompassing receptor flexibility in virtual screening using ensemble docking-based hybrid QSAR: discovery of novel phytochemicals for BACE1 inhibition. Mol. BioSyst. 10(10), 2684–92 (2014).
Article CAS PubMed Google Scholar
Gupta, K. Qsar studies on gallic acid derivatives and molecular docking studies of Bace1. enzyme–A potent target of Alzheimer disease. BIOEJ. 1(1), 11–27 (2014).
MathSciNet Google Scholar
Sullivan, K. M., Manuppello, J. R. & Willett, C. E. Building on a solid foundation: SAR and QSAR as a fundamental strategy to reduce animal testing. SAR QSAR Environ. Res. 25(5), 357–65 (2014).
Article CAS PubMed Google Scholar
Khan, A. U. Descriptors and their selection methods in QSAR analysis: paradigm for drug design. Drug discovery today. 21(8), 1291–302 (2016).
Article PubMed Google Scholar
Shahlaei, M. Descriptor selection methods in quantitative structure-activity relationship studies: a review study. Chem. Rev. 113(10), 8093–103 (2013).
Article CAS PubMed Google Scholar
Klebe, G., Abraham, U. & Mietzner, T. Molecular similarity indices in a comparative analysis (CoMSIA) of drug molecules to correlate and predict their biological activity. J. Med. Chem. 37(24), 4130–46 (1994).
Article CAS PubMed Google Scholar
Pandey, A., Mungalpara, J. & Mohan, C. G. Comparative molecular field analysis and comparative molecular similarity indices analysis of hydroxyethylamine derivatives as selective human BACE-1 inhibitor. Mol. Diversity. 14(1), 39–49 (2010).
Article CAS Google Scholar
Goyal, S., Dhanjal, J. K., Tyagi, C., Goyal, M. & Grover, A. Novel fragment-based QSAR modeling and combinatorial design of pyrazole-derived CRK3 inhibitors as potent antileishmanials. Chem. Biol. Drug Des. 84(1), 54–62 (2014).
Article CAS PubMed Google Scholar
VLifeMDS: Molecular Design Suite, Pune, India, 3rd edition (2004).
Stierand, K. & Rarey, M. Consistent two-dimensional visualization of protein-ligand complex series. J. Cheminf. 3(1), 21 (2011).
Article CAS Google Scholar
Schomburg, K., Ehrlich, H. C., Stierand, K. & Rarey, M. From structure diagrams to visual chemical patterns. J. Chem. Inf. Model. 50(9), 1529–35 (2010).
Article CAS PubMed Google Scholar
Moreland, J. L., Gramada, A., Buzko, O. V., Zhang, Q. & Bourne, P. E. The Molecular Biology Toolkit (MBT): a modular platform for developing molecular visualization applications. BMC Bioinf. 6, 21 (2005).
Article Google Scholar
Berman, H. M. et al. The Protein Data Bank. Nucleic Acids Res. 28(1), 235–42 (2000).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Gaulton, A. et al. ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res. 40(D1), D1100–D1107 (2012).
Article CAS PubMed Google Scholar
QikProp, v. S., Schrödinger (2015).
Lipinski, C. A., Lombardo, F., Dominy, B. W. & Feeney, P. J. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv. Drug Delivery Rev. 46(1–3), 3–26 (2001).
Article CAS Google Scholar
Dragon, Version 5.5, Talete srl (2007).
Soto, A. J., Martínez, M. J., Cecchini, R. L., Vazquez, G. E. & Ponzoni, I. DELPHOS: computational tool for selection of relevant descriptor subsets in ADMET prediction. 1st International Meeting of Pharmaceutical Sciences. (2010).
Soto, A. J., Cecchini, R. L., Vazquez, G. E. & Ponzoni, I. Multi‐objective feature selection in QSAR using a machine learning approach. QSAR Comb. Sci. 28(1112), 1509–1523 (2009).
Article CAS Google Scholar
Eibe, F., Hall, M. A. & Witten, I. H. The WEKA workbench. Online appendix for “Data Mining: practical machine learning tools and techniques”. (Morgan Kaufmann, 2016).
Eklund, M., Norinder, U., Boyer, S. & Carlsson, L. Choosing feature selection and learning algorithms in QSAR. J. Chem. Inf. Model. 54(3), 837–43 (2014).
Article CAS PubMed Google Scholar
Mansouri, K., Ringsted, T., Ballabio, D., Todeschini, R. & Consonni, V. Quantitative structure-activity relationship models for ready biodegradability of chemicals. J. Chem. Inf. Model. 53(4), 867–78 (2013).
Article CAS PubMed Google Scholar
Zakharov, A. V., Peach, M. L., Sitzmann, M. & Nicklaus, M. C. QSAR modeling of imbalanced high-throughput screening data in PubChem. J. Chem. Inf. Model. 54(3), 705–12 (2014).
Article CAS PubMed PubMed Central Google Scholar
Martinez, M. J., Ponzoni, I., Diaz, M. F., Vazquez, G. E. & Soto, A. J. Visual analytics in cheminformatics: user-supervised descriptor selection for QSAR methods. J. Cheminf. 7, 39 (2015).
Article Google Scholar
Ponzoni, I. et al. Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery. Sci. Rep. 7(1), 2403 (2017).
Article ADS PubMed PubMed Central Google Scholar
Cravero, F., Martinez, M. J., Vazquez, G. E., Diaz, M. F. & Ponzoni, I. Feature learning applied to the estimation of tensile strength at break in polymeric material design. J. Integr. Bioinform. 13(2), 286 (2016).
PubMed Google Scholar
Topliss, J. G. & Costello, R. J. Change correlations in structure-activity studies using multiple regression analysis. J. Med. Chem. 15(10), 1066–8 (1972).
Article CAS PubMed Google Scholar
Topliss, J. G. & Edwards, R. P. Chance factors in studies of quantitative structure-activity relationships. J. Med. Chem. 22(10), 1238–44 (1979).
Article CAS PubMed Google Scholar
Rucker, C., Rucker, G. & Meringer, M. y-Randomization and its variants in QSPR/QSAR. J. Chem. Inf. Model. 47(6), 2345–57 (2007).
Article PubMed Google Scholar
Kubinyi, H. QSAR in Drug Design in Handbook of Chemoinformatics, (ed. Gasteiger, J.) 1532-1554 (Wiley-VCH, 2003).
LigPrep, version 3.1, Schrödinger (2015).
Maestro, version 9.9, Schrödinger (2014).
Epik, version 3.1, Schrödinger (2015).
Jorgensen, W. L. & Tirado-Rives, J. The OPLS [optimized potentials for liquid simulations] potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin. J. Am. Chem. Soc. 110(6), 1657–66 (1988).
Article CAS PubMed Google Scholar
IBM SPSS. Statistics for Windows, Version 22.0, IBM Corp (2013).
Soto, A. J., Cecchini., R. L., Vazquez, G. E. & Ponzoni, I. A wrapper-based feature selection method for ADMET prediction using evolutionary computing. Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics. 188–189 (2008).
Kohavi, R. & John, G. H. Wrappers for feature subset selection. Artif. Intell. 97(1–2), 273–324 (1997).
Article MATH Google Scholar
Breiman, L. Random Forest. Mach. Learn. 45(1), 5–32 (2001).
Article MATH Google Scholar

Download references

Acknowledgements

This work is kindly supported by CONICET, grant PIP 112–2012–0100471, by UNS, grants PGI 24/N042 and PGI 24/ZM17, Ministerio de Economía y Competitividad (CTQ2015-66313-R) and by MINCYT, for their economic support given to IP in the “6° Programa de Movilidad Docente a Madrid”. We also acknowledge MECD, VSP grant FPU15/01465 and Banco Santander for VSP fellowship AY21/17-D-27 in the program “Becas Iberoamerica-Santander Investigación”. Finally, we also acknowledge support of the publication fee to the “Programa de Desarrollo de Ciencias Básicas” (Pedeciba) of Uruguay, Grant “Informática 2019”.

Author information

Ignacio Ponzoni, Víctor Sebastián-Pérez and María J. Martínez contributed equally.

Authors and Affiliations

Instituto de Ciencias e Ingeniería de la Computación (UNS-CONICET), Bahía Blanca, Argentina
Ignacio Ponzoni, Víctor Sebastián-Pérez & María J. Martínez
Departamento de Ciencias e Ingeniería de la Computación, Universidad Nacional del Sur, Bahía Blanca, Argentina
Ignacio Ponzoni & María J. Martínez
Centro de Investigaciones Biológicas. Consejo Superior de Investigaciones Científicas (CSIC), Ramiro de Maeztu 9, 28040, Madrid, Spain
Víctor Sebastián-Pérez, Carlos Roca, Carlos De la Cruz Pérez & Nuria E. Campillo
Planta Piloto de Ingeniería Química – PLAPIQUI (UNS-CONICET), Bahía Blanca, Argentina
Fiorella Cravero & Mónica F. Díaz
Facultad de Ingeniería y Tecnologías, Universidad Católica del Uruguay, Av. 8 de Octubre, 2738, Montevideo, Uruguay
Gustavo E. Vazquez
Instituto de Química Médica. Consejo Superior de Investigaciones Científicas (CSIC), Juan de la Cierva 3, 28006, Madrid, Spain
Juan A. Páez
Departamento de Ingeniería Química, Universidad Nacional del Sur (UNS), Bahía Blanca, Argentina
Mónica F. Díaz

Authors

Ignacio Ponzoni
View author publications
You can also search for this author in PubMed Google Scholar
Víctor Sebastián-Pérez
View author publications
You can also search for this author in PubMed Google Scholar
María J. Martínez
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Roca
View author publications
You can also search for this author in PubMed Google Scholar
Carlos De la Cruz Pérez
View author publications
You can also search for this author in PubMed Google Scholar
Fiorella Cravero
View author publications
You can also search for this author in PubMed Google Scholar
Gustavo E. Vazquez
View author publications
You can also search for this author in PubMed Google Scholar
Juan A. Páez
View author publications
You can also search for this author in PubMed Google Scholar
Mónica F. Díaz
View author publications
You can also search for this author in PubMed Google Scholar
Nuria E. Campillo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.P. and N.E.C. conceived the study, guided the experimental design and drafted the paper. V.S.P. and M.J.M. conducted the main experiments for inferring and validating the QSAR models, and organized the results. C.D.C.P. collected the dataset and make the computation of the molecular descriptors. V.S.P., C.R. and J.A.P. performed the drug-like properties calculation, similarity assessments, and also the discussion of the importance of the parameters. M.J.M. programmed the data pre-processing required for VIDEAN. F.C., G.E.V. and M.F.D. achieved the visual analytic study using VIDEAN software and the discussion about the statistical correlations among the molecular descriptors selected for the QSAR models. M.J.M., F.C. and M.F.D. designed and conducted the validation experiments based on generating random QSAR models. All the authors helped to design the QSAR methodology with suggestions and ideas for improving its performance, and reviewed and approved the submitted manuscript.

Corresponding authors

Correspondence to Ignacio Ponzoni or Nuria E. Campillo.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ponzoni, I., Sebastián-Pérez, V., Martínez, M.J. et al. QSAR Classification Models for Predicting the Activity of Inhibitors of Beta-Secretase (BACE1) Associated with Alzheimer’s Disease. Sci Rep 9, 9102 (2019). https://doi.org/10.1038/s41598-019-45522-3

Download citation

Received: 31 January 2019
Accepted: 30 May 2019
Published: 24 June 2019
DOI: https://doi.org/10.1038/s41598-019-45522-3

This article is cited by

AiKPro: deep learning model for kinome-wide bioactivity profiling using structure-based sequence alignments and molecular 3D conformer ensemble descriptors
- Hyejin Park
- Sujeong Hong
- Jae-Min Shin
Scientific Reports (2023)
Machine learning models for predicting the activity of AChE and BACE1 dual inhibitors for the treatment of Alzheimer’s disease
- G. Dhamodharan
- C. Gopi Mohan
Molecular Diversity (2022)
Drug repurposing for ligand-induced rearrangement of Sirt2 active site-based inhibitors via molecular modeling and quantum mechanics calculations
- Shiv Bharadwaj
- Amit Dubey
- Umesh Yadava
Scientific Reports (2021)
Structure-based design and classifications of small molecules regulating the circadian rhythm period
- Seref Gul
- Fatih Rahim
- Ibrahim Halil Kavakli
Scientific Reports (2021)
Artificial intelligence to deep learning: machine intelligence approach for drug discovery
- Rohan Gupta
- Devesh Srivastava
- Pravir Kumar
Molecular Diversity (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Database

Data pre-processing

Molecular descriptor calculation

In silico experimental design

Random experiments for assessing the risk of correlation by chance

Materials and Methods

Preparation of databases

Ligand preparation

Drug-like properties calculation

Statistical analysis of the database

Software used for processing molecular descriptors

Machine learning tools used for feature selection and classification models

Methods used for random experiments

Conclusions

Description of additional data files

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links