Metabolite Profiling and Quantitation of Cucurbitacins in Cucurbitaceae Plants by Liquid Chromatography coupled to Tandem Mass Spectrometry

Cucurbitaceae is an important plant family because many of its species are consumed as food, and used in herbal medicines, cosmetics, etc. It comprises annual vines and is rich in various bioactive principles which include the cucurbitacins. These steroidal natural products, derived from the triterpene cucurbitane, are mainly the bitter principles of the family Cucurbitaceae. Their biological activities include anti-inflammatory, hepatoprotective, and anti-cancer activities. A total of 10 species belonging to 6 genera of the Cucurbitaceae family along with Cissampelos pareira (Menispermaceae) were included in this study. A comprehensive profiling of certain natural products was developed using HPLC-QTOF-MS/MS analysis and a distribution profile of several major natural products in this family was obtained. A total of 51 natural products were detected in both positive and negative ionization modes, based on accurate masses and fragmentation patterns. Along with this, quantitation of four bioactive cucurbitacins, found in various important plants of the Cucurbitaceae family, was carried out using multiple reaction monitoring (MRM) approach on an ion trap mass spectrometer. Cucurbitacin Q was found to be the most abundant in C. pareira, while Citrullus colocynthis contained all four cucurbitacins in abundant quantities. The developed quantitation method is simple, rapid, and reproducible.

various interesting biological activities and are widely distributed among the plants of Cucurbitaceae family, there is a need to establish metabolite distributions of important genera and species in this family.
Metabolic profile development requires prior identification of natural products for which HPLC-MS/MS is a fast and reliable method. It is often done without the use of chemically pure standards as the availability of a compound in question through synthesis, isolation or commercial sources is not always possible. However, the level of certainty in natural product identification through mass spectrometry varies. This depends upon whether the data was obtained in reference to purified standards or only an untargeted study was performed. Authentic identification through mass spectrometry is only possible when a purified compound is available. However, in case of a metabolomics study it is neither economical nor practically possible to have a large number of purified standards available. However, in cases where purified standards are not available, it is still possible to identify natural products through a sample based on MS/MS fragmentation data 14 .
Keeping in view the important bioactivities of cucurbitacins, quantitation of these compounds in various plants of the Cucurbitaceae family was carried out. Various reports on the quantitation of important cucurbitacins in plants such as the zucchini 15 , bottle gourd 16 and bitter melon 17 have appeared in the literature. A notable report is the quantitation and pharmacokinetics study of cucurbitacin IIa and cucurbitacin IIb from Hemsleya amabilis in rat plasma. These two compounds are considered major bioactive constituents in this plant and have promising antiproliferative activities 18 .
We present herein a comprehensive study focusing on the generation of metabolic profile of various plants of the Cucurbitaceae family, along with quantitation of four biologically active cucurbitacins in ten species belonging to six different genera of the Cucurbitaceae family, along with Cissampelos pareira belonging to the Menispermaceae family. C. pareira is also reported to possess antioxidant 19 , anti-inflammatory 20 , antiviral 21 , antidiabetic 22 , anticancer 23,24 and other activities 25 . There are studies in literature that report of the profiling of Cucurbitaceae family plants, such as profiling of phenolics and other polar components in watermelon 26 and zucchini 27 through LC-MS/MS. The study presented herein should serve as a stepping stone for more detailed metabolomics studies on important plants of the Cucurbitaceae family.

Experimental
Chemicals and reagents. Analytes 1, 2 and 4 namely cucurbitacin E 2-O-β-D-glucopyranoside (1), cucurbitacin I 2-O-β-D-glucopyranoside (2) and 22-deoxocucurbitoside B (4) were isolated from methanolic extract of Citrullus colocynthis fruits. The crude methanolic extract was fractionated using dichloromethane (DCM) and ethyl acetate. Compound 1 was isolated from the DCM fraction, while compounds 2-4 were isolated from the ethyl acetate fraction. Cucurbitacin (3) was isolated from DCM extract of C. pareira. The DCM extract was fractioned using hexanes and ethyl acetate. The structures of analytes are shown in Fig. 1. The compounds were characterized based on comparison of their 1 H-and 13 C-NMR spectral data with the data reported in literature 28,29 . Details of isolation are provided with supplementary information along with necessary spectroscopic data.
Formic acid, purchased from Daejung (Daejung Chemicals & Metals Co. Ltd., Korea), was used as an additive for the mobile phase. Methanol for mobile phase was purchased from Merck (Merck KGaA, Darmstadt, Germany). Type I water (ISO 3696) for the mobile phase was obtained from a Barnstead ™ GenPure ™ ultrapure water system (Thermo Fisher Scientific Inc., USA).
Instrumentation and experimental conditions. HPLC-ESI-MS/MS analysis for natural product identification was performed on a Bruker maXis II ™ HR-QTOF mass spectrometer (Bremen, Germany), coupled to a Dionex UltiMate ™ 3000 series HPLC system (Thermo Fisher Scientific, Waltham, MA, USA) fitted with a binary RS pump, column thermostat, and auto-sampler. Sample chromatography was performed on a Macherey-Nagel Nucleodur ® C18 Gravity column (3.0 × 100 mm, 1.8 µm), kept at 40 °C. 4-µL samples were injected while the mobile phase consisted of A (0.1% formic acid in H 2 O) and B (0.1% formic acid in MeOH). The mobile phase www.nature.com/scientificreports www.nature.com/scientificreports/ flow rate was set at 0.7 mL/min using a linear gradient of A and B starting at 10% B, increased to 90% B in 5.5 min, maintained at 90% for 1.5 min, and returned to 10% B in 1 min. The total run time was 10 min, including a 1 min holding time at the start and 1 min equilibration time at the end of the gradient.
Mass spectra were recorded using electrospray ionization employing the Bruker CaptiveSpray ™ ion source.
MS and MS/MS spectra were recorded separately both in positive and negative modes. Ion source parameters were set as follows (parameters for negative mode next to positive mode parameters): capillary voltage at 4500 V (−3500 V), end plate offset at 500 V, nebulizer gas 45.0 psi, drying gas at 12.0 L/min and drying gas temperature at 270 °C. All spectra were recorded in the mass range of m/z 100 to 2000, while the scan speed was set at 5 Hz for MS and 12 Hz for MS/MS spectra. Active exclusion feature of the instrument was used which enables the instrument to remove a precursor ion from further consideration after a set number of MS/MS spectra have been recorded for that particular precursor ion. The active exclusion number was set at 3, and the precursor reconsideration time was set at 30 s.
HPLC-MS/MS analysis for quantitation was performed on a Bruker amaZon speed ion trap mass spectrometer (Bremen, Germany), coupled to a Dionex UltiMate ™ 3000 series HPLC system (Thermo Fisher Scientific, Waltham, MA, USA) fitted with a binary pump, column thermostat and auto-sampler. Chromatographic separation of analytes was achieved on a reverse-phase C18 column (Agilent Poroshell 120 EC-C18 3.0 × 50 mm, 2.7 µm), kept at 40 °C. 2-µL samples were injected while the flow rate was set at 0.5 mL/min. A linear gradient was used for analyte elution starting at 10% B, increased to 95% B in 3.5 min, maintained at 95% for 1.5 min, and returned to 10% B in 1 min. The column was equilibrated for 1 min at the end of the gradient. Total run time for analysis was 8 min.
Mass spectra were recorded using electrospray ionization under positive mode employing the Bruker CaptiveSpray ™ ion source. Ion source parameters were set as follows: capillary voltage at 4500 V, end plate offset at 500 V, nebulizer gas 35.0 psi, drying gas at 8.0 L/min, and drying gas temperature at 250 °C. Mass spectra scan range was set at m/z 100 to 1000, while the number of spectral averages was set at 3. Ion charge control (ICC) was used for transferring a certain number of ions to the ion trap and set at 60,000, while accumulation time was set at 100 ms. Fragmentation time under collision-induced dissociation (CID) mode was set at 20 ms while fragmentation amplitude was optimized for each analyte to obtain the maximum abundance of fragment ions.
Method performance. All MS and MS/MS data were saved using both profile and line spectra to minimize the chance of instrumental noise being taken as a precursor ion. Mass spectra for all samples were recorded under both ionization modes (positive and negative) to counter check the authenticity of a molecular ion peak while active exclusion was used to minimize the chances of common contaminant peaks being placed under MS/MS fragmentation. Each sample was injected in triplicate. The developed quantitation method was assessed for accuracy and precision. Accuracy (% bias) and precision (% RSD) were assessed by analyzing three different QC samples with six replicates for intra-day, and 12 replicates on two different days (6 replicates/day) for inter-day analysis. Excellent accuracy and precision (<5%) were found for the developed method. The accuracy of analysis was calculated using the expected concentration (C E ) and the mean value of measured concentration (C M ) by using the following relation: Accuracy (bias, %) = [(C E -C M )/C E ] x100. Similarly, the relative standard deviation (% RSD) was used as an indicator of the analytical precision, and calculated from the standard deviation and mean value of measured concentrations by the following equation: Precision (RSD, %) = (Standard Deviation (SD)/C M )x100. Method performance was further evaluated through the analysis of fortified samples prepared by spiking additional amounts of analytes 1-4 at three levels of 50, 100, and 150 ng/mL, respectively, in the original sample solutions used for analysis. Details about method precision and validation along with calibration equations, LOD and LOQ values are provided with supplementary information (Supplementary Tables 1, 2

and 3).
Sample preparation. Shade-dried plant material (whole plants) were crushed in a blender. 1 g of each plant was weighed and extracted with 10 mL methanol through sonication for 20 min. Each sample was centrifuged for 15 min at 6000 rpm to settle large particles, and the supernatant was filtered through a 0.22 µm PTFE syringe-driven filter. 50 µL of the filtered extract was diluted to 1000 µL with methanol for LC-MS, and LC-MS/ MS analysis.
For quantitation, 1 mg of each standard compound was weighed and dissolved into 1 mL methanol to prepare standard stock solutions. These solutions were diluted with 50:50 water: methanol in a serial manner to prepare ten calibrant solutions ranging between 50-2000 ng/mL. The analysis of plant samples was performed using diluted plant extract. 50 µL of filtered plant extract was diluted to 1500 µL with 50:50 water: methanol for LC-MS/ MS analysis.
Spiked samples for method validation were prepared in a similar manner as the plant samples. 50 µL of filtered plant extract plus an amount of standard solution equivalent to spike concentrations of 50, 100, and 150 ng/mL was diluted to a final volume of 1500 µL with 50:50 water: methanol for three samples, and labelled as S1, S2 and S3, respectively.

Results and Discussion
LC-MS/MS optimization. The method for LC-MS/MS in the profiling study was optimized using a RP-C18 column in a way that the various sample components eluted in a 10 min runtime. No carryover was detected in the next blank sample run after the plant sample. Analysis were performed in both positive and negative ionization modes, and the mobile phase composition for both polarities was kept identical (0.1% formic acid in both solvents). However, to obtain a reasonable cycle time for the MS/MS analysis, the scan frequency of the instrument was kept at maximum (12 Hz), and active exclusion was used to avoid solvent contaminant peaks being www.nature.com/scientificreports www.nature.com/scientificreports/ placed under MS/MS fragmentation. Precursor reconsideration time was set at 0.5 min after careful examination of the peak widths. This ensured that no precursor ions are excluded from the analysis.
The mobile phase gradient for quantitative analysis was adjusted in order to elute all analytes in the shortest possible runtime and to have large enough differences in runtimes to avoid overlapping MRM transitions. Figure 2 shows that all analytes eluted from the column between 5.0-6.6 min of analysis with small peak widths, and without any observable peak tailing or fronting. The analysis was performed under positive ionization mode, and all of the analytes were observed as sodium adducts ([M + Na] + ). Due to the large structures and presence of various oxygen atoms (as hydroxyls) in the structures, all analytes showed a good affinity towards the formation of sodium adducts. Ammonium adducts were also observed along with sodium adducts when the mobile phase composition was changed from 0.1% formic acid to 20 mM ammonium acetate. However, the use of ammonium acetate decreased the instrument sensitivity. 0.5% acetic acid was also used as a mobile phase, but this resulted in a lower sensitivity as compared to 0.1% formic acid in positive mode, whereas chloride adducts were obtained in negative mode. 0.1% formic acid in negative mode also resulted into the formation of formate adducts. However, it was observed that all analytes in negative mode with different mobile phase compositions showed smaller instrumental response as compared to sodium adducts in the positive mode. Therefore, it was concluded that 0.1% formic acid in positive mode was the best mobile phase for analysis.
The observed sodium adducts were subjected to MS/MS fragmentation analysis in the ion trap and the fragmentation amplitudes were tuned for each analyte. All analytes showed good fragment yields in the fragmentation amplitude range of 0.90-1.55 V. Table 1 summarizes optimized MRM parameters for analytes 1-4. A standard mixture of analytes was prepared at a concentration of 50 ng/mL and analyzed under optimized chromatographic and MRM conditions. Excellent chromatographic peak shapes with good intensities were observed (Fig. 2). Figure 3 shows extracted ion-chromatograms, and product ion spectra of analytes 1-4 in the standard mixture at a concentration of 50 ng/mL.

Identification and quantitation of natural products.
A total of 51 compounds were putatively identified based on their high-resolution masses and fragmentation data in positive and negative ionization modes. Their identification was performed using a library of natural products previously reported from the plant species included in this study. The library was custom-built as follows. Plant names (with all synonyms) were queried in the Dictionary of Natural Products (DNP) Ver. 26.2 (Dec 2017), and all resulting hits were used to build a library of natural products. All the samples were screened against the prepared library using Bruker Compass TargetAnalysis Ver. 1.3 software, which compares the mass errors (ppm) and isotopic patterns of the compounds in the library with the observed mass spectra and ranks the probable compounds based on match score. The samples were then analyzed again for MS/MS spectra of compounds which were found using TargetAnalysis. Entries with higher ppm errors (>10 ppm) were discarded, and no MS/MS data analysis was performed. It was observed that mass errors were below 2 ppm in most cases. The fragment ions in the MS/MS data were analyzed using in-silico fragmentation. Fragments were generated by manually dissecting the molecules at various possible sites and comparing the theoretical fragments with those obtained from the data. Details about the compounds identified in positive and negative ionization modes are presented in Table 2.  www.nature.com/scientificreports www.nature.com/scientificreports/ The identification of natural products was performed through the acquisition of full range mass spectra, and it was observed that most analytes, under the positive ionization source conditions, were observed as sodium adducts while a few were observed as protonated adducts. The formed sodium adducts were observed to be stable as they did not exhibit extensive fragmentation under CID conditions. Fragments with high abundances were generated through the loss of H 2 O or acetate group (if present), while other fragments were only seen in low abundances. In the negative ionization mode, mass spectra contained formate adducts, while deprotonated molecules ([M-H] − ) were also seen. It was observed that the formate adducts, upon fragmentation, resulted in the loss of formic acid and generated deprotonated molecules which exhibited further fragmentation behaviour, such as the loss of water and acetate group.
Based on the ion intensities observed, heat maps were generated for both ionization modes (positive and negative) to show a distribution of various plant metabolites across 6 genera and 10 species of the Cucurbitaceae family (Figs 4, 5). Heat maps were generated using GraphPad Prism 7 on a PC running Windows 7 SP1.
The developed quantitation method was applied for the detection and determination of analytes 1-4 in 10 different plants of the Cucurbitaceae family, along with Cissampelos pareira which belongs to the family Menispermaceae. This plant is rich in alkaloids and finds some uses in the Indian and Chinese medicine. Many alkaloids from this plant exhibit cytotoxic 23,30 , anti-inflammatory 20 , antiplasmodic activities and this is why it has gained some attention as a natural remedy for malaria in Kenya due to its antimalarial properties 31 . Although this plant is well-known for its alkaloidal content, our research group recently isolated cucurbitacins F and Q from this plant. The isolation of cucurbitacin F and Q is quite surprising from C. pareira. The structures of compounds  www.nature.com/scientificreports www.nature.com/scientificreports/ were confirmed through 1 H-NMR and 13 C-NMR spectroscopy. The details of isolation for analytes 1-3, along with 1 H-and 13 C-NMR data provided with supplementary information. Due to the isolation of analyte 3 from C. pareira, it was important to include this plant in the list of profiling of cucurbitacins. The results of quantitation showed that analytes 1-4 in various plants (in this study) occur in a very large range of concentrations between 0.12-5153.6 mg/Kg. The results of quantitation study are summarized in Table 3.  www.nature.com/scientificreports www.nature.com/scientificreports/ Analytes 1-4 were found to be present in significant quantities in Citrullus colocynthis which is well known for its cucurbitacin content. This plant exhibits various important bioactivities such as antidiabetic, anticancer, anti-inflammatory, etc. 32,33 . The fruits of C. colocynthis have been used traditionally in the Indo-Pak region for its antidiabetic properties 34 . The results of the current quantitation study concur with the traditional use C. colocynthis fruits as it contains high concentrations of cucurbitacins. Cucurbitacins E, I and Q have been shown to possess antitumor and antidiabetic activities [35][36][37][38][39][40] . C. pareira contains a substantial amount of cucurbitacin Q as found in our study and this concurs with the antiproliferative potential of the plant 30,41 . This does not signify that the anticancer potential of this plant is only due to the presence of a large amount of cucurbitacin Q as it requires further studies. Another interesting concurrence is the presence of analytes 1-4 in moderate amounts in other plants of the Cucurbitaceae family. These plants exhibit antidiabetic, anticancer, antibacterial, and other activities [42][43][44] .

Conclusion
The present study has putatively identified fifty-one compounds in ten species in six different genera of the Cucurbitaceae family and C. pareira of the Menispermaceae family using high-resolution masses and fragmentation data. Mass spectrometric data of the identified compounds was used to produce a distribution profile these compounds in the analyzed Cucurbitaceae plants. A quantitation method for four bioactive cucurbitacins in the Cucurbitaceae plants was also developed in the current study. The developed quantitation method is simple, rapid, and sensitive. The results of this study are useful for natural product chemistry, food quality control, herbal products standardization, and drug discovery and development.   Table 3. Quantitation of analytes 1-4 in Cucurbitaceae plants. *nd = not detected.