Introduction

Cancer is a life-threatening disease and remains a major health problem around the globe, which is the second most prevalent disease after cardiovascular disease1. Esophageal cancer2,3,4 and pancreatic cancer5 are the most common cancers in digestive and gastrointestinal tract. Conventional chemotherapy drugs play a role through cell toxicity. Due to the inability to accurately identify cancer cells and play a wide range of cytotoxic effects, some patients show obvious toxic and side effects with the prolongation of medication time, and some even produce drug resistance. These toxic and side effects limit the treatment of cancer. Therefore, the development of safe, low-toxic and efficient anticancer drugs has become the focus of drug researchers.

It is an essential compound library for the development of various new structural drugs from Chinese medicine. Catalpol is an important active ingredient of Chinese medicine, which has significant inhibitory effects on breast cancer6, gastric cancer7, lung cancer8 and colorectal cancer9. Due to the complexity of catalpol molecular structure, low activity intensity and poor drug-forming ability, it is necessary to modify the structure of catalpol10. The chemical structure of catalpol was shown in Fig. 1.

Figure 1
figure 1

Chemical structure of catalpol.

Numerous heterocyclic compounds have attracted the medicinal chemists′attention due to their anticancer activities, and the introduction of heterocyclic groups in various natural products can enhance their biological activity11,12,13,14. Pyrazole motif received an extensive attention due to its varied pharmacological utilities15,16, it emerges as a useful pharmacophore scaffold in the synthesis of potent anticancer agents17. For example, Ruxolitinib and Crizotinib are two examples of pyrazole based anticancer drugs which are currently available in the market (Fig. 2) 18.

Figure 2
figure 2

Pyrazole based anticancer drugs.

Based on the previous research foundation of our research group on iridoids19,20 and the anticancer activity of catalpol and pyrazole, a series of pyrazole modified catalpol compounds were synthesized by the principle of drug combination (Fig. 3). Different substituents were introduced on the pyrazole group in order to obtain catalpol derivatives with anticancer activities.

Figure 3
figure 3

Design of pyrazole modified catalpol derivatives.

Results and discussion

Chemistry

Catalpol is a polyhydroxyl compound, in which the reactivity of hydroxyl groups at different positions varied, thus, it is difficult to selectively modify specific hydroxyl groups at the particular position. In the design of the synthetic route of the target compound, the multi-functional characteristics of catalpol should be considered and appropriate reaction conditions should be selected in every step of the synthesis to avoid damaging the parent structure of catalpol. In the structure of catalpol, the glycosidic bond was easily destroyed by enzymatic hydrolysis, acid hydrolysis and alkaline degradation, and epoxides also can undergo ring-opening reactions with acid and strong bases. Furthermore, the functional group of cycloalkene ether bond in catalpol is extremely active, and it is prone to addition reaction, oxidation reaction, polymerization reaction, and the addition reaction of carbon–carbon double bonds and halogens can also occur under acidic conditions.

The experimental result showed various reaction products were generated from “one-pot” synthesis, which led to tremendous difficulty in separation and purification afterward (Fig. 4a). Therefore, the multistep synthesis approach would be a priority consideration for this experimental design. Here, a series of pyrazole modifed catalpol derivatives were synthesized by replacing C10-position hydroxyl groups of catalpol with the pyrazole (Fig. 4b).

Figure 4
figure 4

Synthesis of pyrazole modified catalpol derivatives. (a) One-pot synthesis of pyrazole modified catalpol derivatives. (b) Stepwise synthesis of pyrazole modified catalpol derivatives.

The reaction method of hydroxy halogenation on catalpol and redirected introduction of heterocyclic pharmacophore was explored. Imidazole, triphenylphosphine and iodine were selected to iodinate catalpol hydroxyl selectively. The reaction solution is refluxed at 70 °C to obtain new products 2a and 2aa. In order to obtain compound 2a selectively, the reaction conditions of catalpol hydroxyl iodination were screened (Fig. 5). In this reaction, the hydroxyl group in catalpol was poorly selected to generate two iodine products. In order to obtain a single product, the reaction conditions must be optimized.

Figure 5
figure 5

Synthesis method of catalpol hydroxyl iodide.

In the reaction, since the structure of catalpol contained two primary hydroxyl groups, it is difficult to obtain a single product in the experiment. Through molecular simulation docking, there is little difference in activity between the two primary hydroxyl groups. However, the temperature, the initial concentration of iodine impinges heavily on the product category, yield and diversity. According to the reported reaction mechanism, it is necessary to explore the reaction equivalence ratio, temperature, initial concentration of iodine, and solvent conditions.

According to the result of catalpol iodization reaction, shown in Table 1, the selectivity of iodine reaction was greatly affected by reaction temperature. The higher reaction temperature, the more diversified products are generated. When 1.5 eq and 3 eq of iodine are respectively dissolved in ultra-dry tetrahydrofuran refluxing at 70 °C, the selectivity is seemingly pretty poor. When 1.5 eq of iodine is used for the reaction at 70 °C, the catalpol keep unreacted. When 3 eq of iodine is used for the reaction at 70 °C, the reaction is completed in 0.5 h, so the reaction was affected by the initial concentration of iodine as raw material (Entries 1–2, Table 1). Considering that the selectivity of the reaction was affected not only by the reaction temperature, but also by the initial concentration of the raw material iodine, the reaction selectivity was increased by lowering the reaction temperature under 3 eq of iodine at room temperature. It is found that the reaction was completed in 1 h, but the reaction selectivity was still poor, and the yield of compound 2a was low; when the reaction was carried out with 3 eq of iodine at 0 °C, the reaction selectivity was relatively high with almost only the product 2a, although the catalpol still remained after prolonged time of reaction. When the temperature of the reaction system was increased or iodine as the feedstock was added, there was surplus of feedstock catalpol (entries 3–4, Table 1). The reactions were explored by different initial concentrations of iodine at 0 °C (entries 5–7, Table 1). The experimental results showed the maximum yield of compound 2a was 70% when the amount of iodine was 6 eq and the reaction time was 18 h. When tetrahydrofuran (THF) was used as the reaction solvent, the synthesis process was not a homogeneous reaction system and there is solid precipitation in the process of reaction, so N, N-dimethylformamide (DMF) was used as the additional reaction solvent (entries 8–10, Table 1). Meanwhile, iodine with different initial concentrations were selected for the reaction, and the white solid (Ph3PO) was found to be the by-product. The reason for the low reaction yield is that the reaction solvent DMF was used to dissolve the solid in the experiment, which slows down the reaction speed and affects the output of the product. The reactions were also explored by different initial concentrations of iodine at − 10 °C and − 20 °C (entries 11–13, Table 1). The results showed that the raw material catalpol was not completely reacted at each reaction temperature, and the reaction selectivity was the same as that at 0 °C.

Table 1 Exploration of the reaction conditions of catalpol hydroxyl iodination.

Through the exploration of reaction conditions of catalpol iodination, the optimal reaction conditions were finally determined as the ratio of n(catalpol):n(iodine):n(triphenylphosphine):n(imidazole) was 1:6:6:12, the solvent was ultra-dry tetrahydrofuran, the reaction temperature was 0 °C.

A series of C10-position pyrazole modified catalpol derivatives (3a–3m) were synthesized by C10-iodocatalpol treated with different substituted pyrazole derivatives under K2CO3 in DMF at 70 °C (Fig. 6). When the unsubstituted pyrazole was used, the reaction could react and obtain the yield of 70% (3a). When pyrazole was substituted by the electron-donating group (–CH3), the reaction can proceed well, whether in the C3- or C5-substituted (3b–3c) or C3- and C5-substituted pyrazoles (3d). Pyrazole modified catalpol derivatives can also be obtained in moderate yields when C4-position pyrazole was substituted by halogen atom (3e–3i). When the C4-position pyrazole was replaced by electron-withdrawing groups (–NO2, –CF3), this reaction can be obtained in high yields (3j–3m). Under this optimal reaction condition, both electron-donating and electron-withdrawing substituted pyrazole compounds can react with catalpol to produce a series of high-yield pyrazole modified catalpol derivatives.

Figure 6
figure 6

Synthesis of pyrazole modified catalpol derivatives.

Biological

The target compounds were evaluated for their anticancer activity against human cancer cell lines of different origins, including Eca109 (human esophageal cancer cell line) and EC9706 (human esophageal cancer cell line), PANC-1 (human Pancreatic cancer cell line), BxPC-3 (human pancreatic cancer cell line) and HPDE6-C7 (human normal pancreatic ductal epithelial cell line) via MTT assay.

Human pancreatic cancer cells BxPC-3 and PANC-1 and human normal pancreatic ductal epithelial cell HPDE6-C7 were purchased from the Cell Bank of Shanghai Institute of Biochemistry, Chinese Academy of Sciences. Human esophageal cancer cell line Eca109 and EC9706 used in the experiments were obtained from the American Type Culture Collection (ATCC, Manassas, VA).

In vitro esophageal cancer cells inhibitory activities

MTT method was used to investigate the inhibitory effect of the target compound on the cell formation of Eca109 and EC9706 cells cultured in vitro. The drug concentration was 2 mM. From the preliminary results in Table 2. It can be seen that catalpol has weak cell inhibition effect (Supplementary Fig. S3). And compound 3e has a strong inhibitory effect on Eca109 and EC9706 cells. For Eca109 cell, the cell survival rate was 62% at 24 h and 35% at 48 h after treatment with 2 mM of drug concentration. For EC9706 cell, the cell survival rate was 48% at 24 h and 31% at 48 h after treatment with 2 mM drug concentration (Supplementary Fig. S4).

Table 2 In vitro esophageal cancer cells viability of pyrazole modified catalpol derivatives.

In vitro pancreatic cancer cells inhibitory activities

MTT method was used to investigate the inhibitory effect of the catalpol and its derivatives on the cell formation of PANC-1, BxPC-3 and HPDE6-C7 cells cultured in vitro. The drug concentration was 1 mg/mL. From the preliminary results in Table 3. Catalpol and its derivatives possessed relatively weak inhibitory activity on two pancreatic cancer cell lines. It can be seen that catalpol has weak cell inhibition effect. At 72 h, for the PANC-1 cell, compounds 3g and 3k exhibited stronger inhibitory effect than catalpol, and for the BxPC-3 cell line, compounds 3d and 3k exhibited stronger inhibition than catalpol. The results showed that catalpol and its derivatives treatment had almost no effect on human normal pancreatic ductal epithelial cell (HPDE6-C7). Therefore, catalpol and its derivatives might not be a significantly toxic.

Table 3 In vitro pancreatie cancer cells inhibitory activities of pyrazole modified catalpol derivatives.

SAR for cancer cells

SAR for esophageal cancer cells inhibitory

According to the anti-esophageal cancer data of catalpol and its derivatives, the results revealed that the parazole structures play an important role in the structure–activity relationship (SAR) for esophageal cancer cells inhibitory activities. The SAR was discussed based on variable of “R” groups (Fig. 7). The variation in the activities rely upon variable features at R1, R2 and R3, among them, compound 3e with C3-position bromo (halogen) substituents of pyrazole at the C10-position of catalpol displayed the most activity. Secondly, the anti-esophageal cancer activity of R1 halogen substituted compounds was superior to other substituted compounds.

Figure 7
figure 7

Structure–activity relationship of catalpol and its derivatives on pancreatic and esophageal cancer cells.

SAR for pancreatic cancer cells inhibitory

According to the anti-pancreatic cancer data of catalpol and its derivatives, the results revealed that the pyrazole derivatives with substituents structures play an important role in the structure–activity relationship (SAR) for pancreatic cancer cells inhibitory activities. The SAR was discussed based on variable of “R” groups (Fig. 7). The variation in the activities rely upon variable features at R1, R2 and R3, among them, catalpol and its derivatives have a weak inhibitory effect on pancreatic cancer. The activity of catalpol derivatives with substituents on pancreatic cancer is slightly stronger than that of catalpol itself.

Mechanism

In order to study the synthesis mechanism and biological activity mechanism of catalpol, we made a preliminary exploration of computer-aided theoretical calculation.

Synthesis mechanism

The structure of catalpol contains six hydroxyl groups, of which two primary hydroxyl groups (C10-OH, C6′-OH) have different reactivity based on theoretical calculation, and the aglycone (C10-OH) is better than that of primary hydroxyl group on glucose (C6′-OH). According to the classical hydroxyl iodination reaction process21, we speculated the reaction mechanism of iodine-modified catalpol derivative was shown in Fig. 8. Imidazole, triphenylphosphine and iodine form intermediate A, which reacted with the catalpol to form the intermediate B, then N–P and O–H bonds were broken in the intramolecular to get intermediate C, then was followed by intermolecular nucleophilic substitution to obtain the final compound 2a.

Figure 8
figure 8

Iodination reaction mechanism.

Molecular binding studies

The mechanism of pharmacological activity in this study was preliminarily explored through molecular docking. AutoDock Tools 1.5.6 was used for molecular docking in this study. It has been confirmed that the expression of VEGFR-2 in cancer tissue is closely related to tumor angiogenesis, regulation, occurrence, development and metastasis of cancer 22. According to the previous literature research, it was found that VEGFR-2 was highly expressed in pancreatic cancer and esophageal cancer23,24. We selected a crystallographic structure of human VEGFR-2 protein tyrosine kinase (PDB ID: 4AGD) as a potential target, which is extracted from the RCSB PDB (RCSB Protein Data Bank) database. The obtained 13 compounds 3a–3m and the original protein ligand (ligand) were pretreated and stored in pdbqt format after structural optimization and charge calculation. The protein crystal was pretreated by deliberately removing water molecules and non-standard residues and replacing them with hydrogen atoms. The protein was set to be rigid, the small molecules were set to be flexible, and the conformation search was performed using Lamarck genetic algorithm, the center coordinate of the lattice box is x = 51.335 y = − 2.78 z = − 15.563, and the number of runs is 10.

The 13 compounds were connected with ligand obtained with the VEGFR-2 binding pocket. In the analysis of protein docking score, it was found that the synthesized pyrazole modified catalpol derivatives and VEGFR-2 could be docked well. The specific docking score results are shown in Table 4. By docking scoring, the best compounds 3e were selected to bind to the receptor protein and their interaction with the target enzyme was investigated. The free energy of binding of compound 3e to the target enzyme was found to be − 6.97 kcal/mol, which was significantly higher than that of catalpol (− 4.5 kcal/mo1). This demonstrates that the catalpol derivatives designed for synthesis all have strong target enzyme interactions. Using the scoring function in Discovery Studio 2016 Client, analysis of the binding pattern with the target enzymes showed that there are four main interactions between ligand and protein, namely hydrophobic binding alkyl interaction, hydrocarbon bond, hydrogen bond interaction, and π-alkyl interaction. Compound 3e mainly interacts with amino acid residues LEU840 to form hydrophobic binding alkyl, and forms hydrocarbon bonds with amino acid residues LYS920 and CYS919, while the hydroxyl group on the parent nucleus of catalpol derivatives forms hydrogen bonds with amino acid residues LEU840 and LYS838, which increases the combination of compound molecules and target enzymes. It can be seen that the binding action of compound 3e with 4AGD and the binding action mode of protein with the original ligand are similar. It can be speculated that compound 3e has potential inhibitory effect on VEGFR-2. The interactions were shown in Figs. 9 and 10.

Table 4 Scores of docking between target compounds and related proteins.
Figure 9
figure 9

The interactions between compound 3e and VEGFR-2.

Figure 10
figure 10

The interactions between ligand and VEGFR-2.

Conclusion

Catalpol is a polyhydroxy compound and it is challenging to modify its hydroxyl at the specific position. In this paper, a simple method for pyrazole heterocyclic modification at C10-position hydroxyl of catalpol was invented without damaging the parent of catalpol and a series of catalpol derivatives were synthesized by using the principle of drug combination. The anti-esophageal activity and anti-pancreatic cancer activity of these pyrazole modified catalpol derivatives were tested. The reaction mechanism and activity mechanism were preliminarily explored. Finally, the compound 3e has good anti-esophageal activity and is expected to develop into a new candidate drug.