Introduction

Plants have significantly contributed to the longevity of human life by serving as medicinal sources for the treatment and prevention of various diseases. In recent years, there has been a notable surge in the utilization of plants and their derivatives within the food industry and for nutraceutical applications1,2,3,4. The utilization of plants as cooked products or raw materials, either fresh or dried, as whole plants or in parts, has been a common practice. The Lamiaceae family plays a crucial role within the plant kingdom due to its significant importance as a source of medicinal, aromatic, spice, and edible plants5. This family comprises more than 220 genera across 4,000 species, many of which possess medicinal properties and contain essential oils6. Essential oils, which are concentrated liquids containing volatile chemical components with aromatic properties, are extracted from different parts of plants as secondary metabolites. These components possess essential ecological and biological properties. The quality and quantity of essential oil can be influenced by various factors, such as genetics, nutrition, sunlight, temperature, humidity, location, and harvesting time7,8. The Lamiaceae family includes the genus Hymenocrater, which consists of more than 21 species found worldwide. Within Iran, there are nine naturally occurring species, with five of them being endemic to the country. These species are H. incanus Bunge, H. oxyodontus Rech. F., H. yazdianus Rech. F., H. platystegius Rech. F., and H. calycinus (Boiss.) Benth9. One particular species, H. longiflorus, is endemic to the western regions of Iran and eastern Iraq10,11,12. It thrives at altitudes ranging from 1550 to 2300 m above sea level in the Iranian plateau. In Iran, this particular plant is commonly referred to as Oraman's tulip or Gole Arvaneh-Avarmani. However, in the western part of the country, locals often refer to it as Soor-Halale or surehahala11,12,13. H. longiflorus is a densely packed, highly fragrant plant with green leaves and purple or light purple flowers. In the Zagros region, locals utilize the aerial parts of this plant in raw or baked form for their anti-inflammatory properties, sedative effects, and to alleviate allergic skin reactions12. Both traditional medicine and recent pharmacological studies have recognized the plant as having a wide range of beneficial properties. These include antimicrobial, antifungal, antioxidant, cytotoxic, larvicidal, and mosquito repellent properties as well as home freshening abilities10,12,14,15. Previous studies have primarily focused on analyzing the chemical composition of this plant using a population collected exclusively from Paveh, Kermanshah with predominant compounds hedycaryol (22.2%), α-cadinol (20.43%), and β-bourbonnene (5.76%)16. Additionally, plants collected from Kurdistan-Iran, were found to contain α-pinene, β-caryophyllene, and β-ocimene, with percentages of 22.47, 18.05, and 14.92, respectively12. Another study conducted in Bayangan-Kermanshah reported the presence of γ-cadinol, α-pinene, and p-menth-1-en-8-ol, with percentages of 18.49, 10.16, and 9.82, respectively15. Furthermore, a specific investigation examined the expression patterns of specific genes involved in biosynthesis, including TPS 27, L3H, TPS2, TPS1, OMT, and GDH3, which are associated with the synthesis of 1,8-cineole, carvone, α-pinene, thymol, estragole, and β-citronellol. This study analyzed the gene expression under in vitro in drought stress conditions13. These findings contribute valuable insights into the chemical composition and genetic makeup of this plant. Despite the medicinal properties associated with H. longiflorus, there has been a lack of evidence regarding its morphological and phytochemical variations in Iran. Therefore, this study aims to investigate the morphological and phytochemical variations, as well as the essential oil composition, of 200 individuals of H. longiflorus plants grown in eight natural habitats. The study also intends to conduct multivariate analysis to further explore these variations and introduce of possible new chemotypes. A fundamental understanding of chemical compounds, as well as the morphological and antioxidant status of plants, is essential for identifying superior plant species for domestication and breeding programs.

Results and discussion

Considering that this plant is endemic to western parts of Iran and eastern regions of Iraq and no studies have been conducted on the populations of the plant and their genotypes, and on the other hand, due to the reported effects of antimicrobial, antifungal, antioxidant, cytotoxic, larvicidal, and mosquito repellent properties as well as home freshening abilities in this plant, we need to conduct further studies in this regard8,10,12,13. In this study, the chemical and physical diversity among 200 individuals from 8 populations of H. longiflorus plants was evaluated. Collection areas, genotypes numbers, geographical and topographical characteristics of H. longiflorus of 8 populations are shown in Table 1. In addition the design of this study is illustrated in Fig. 1.

Table 1 Collection areas, genotypes numbers, geographical and topographical characteristics of H. longiflorus populations.
Figure 1
figure 1

The different segments of plant and research stages design in the present study.

Morphological dimension analysis

The average morphological characteristics of each population along with their standard error are shown in Table 2. The plant height (PH) ranged from 50.32 to 69.65 cm, with the highest observed in population P8. The number of nodes per stem (NNS) and the number of flowers per stem (FNS) were highest in population P8, with values of 16.40 and 57.92, respectively. The internode distances ranged from 4.7 to 6.47 cm, with the maximum distance found in P4. Flower lengths varied from 1.95 to 2.45 cm, with the minimum and maximum values observed in P4 and P3, respectively. The highest leaf length (5.20 cm) and leaf width (3.87 cm) were recorded in P2. Additionally, other measurements fell within the following ranges: Bract length (0.06 to 0.21 cm), Bract width (0.09 to 0.19 cm), Peduncle length (2.79 to 3.89 cm), Upper lip length (0.65 to 1.00 cm), and Calyx length (1.58 to 1.86 cm). The wide data ranges for all traits indicate significant diversity among the populations. The plant in question is currently classified as an endemic species that thrives under natural conditions. Given its significant medicinal value, it is crucial to cultivate and breed the plant to meet the growing demand. To achieve this goal, it is imperative to have a diverse range of plants, which facilitates the selection of suitable candidates for breeding purposes. Consequently, the existing diversity of the plant can be instrumental in this regard, as it provides a rich pool of genetic resources to draw from17.

Table 2 Morphological characteristics (Means ± standard errors) of Hymenocrater longiflorus of 200 genotypes of eight studied populations.

Essential oil (EO) content, compositions and determination of chemotypes

The colors of the essential oils ranged from colorless with a mild scent to yellow, dark yellow, and greenish-yellow with a mild to very strong fragrance. The main reason for the change in color of essential oil samples could be the alteration of the existing compounds in the EOs among the evaluated populations. Color changes in EO samples collected from various populations have been reported in previous report18. The EO content varied among populations, ranging from 0.40 to 0.78 (V/W) with the lowest and highest amounts was obtained in populations P5 and P8, respectively (Fig. 2a). The highest EO contents were found in the region with the lowest elevation and a westward slope which received ample sunlight throughout the day and predominantly featured dry slopes. In this plant, a previous report has stated that the EO content was 0.4 v/w, which is in line with the lowest reported level of EO in the current study14.

Figure 2
figure 2

Phytochemicals and antioxidant activity in each population (a) essential oil content (b) anthocyanin content (c) leaves total phenol content (TPC) (d) flowers TPC (e) leaves total flavonoid contents (TFC) (f) flowers TFC (g) DPPHsc activity of leaves (h) DPPHsc activity of flowers.

According to the GC and GC–MS analysis, 26 compositions were identified, which constituted approximately 99–99.50% of the EOs among the populations (Table 3). The main chemical groups found in EOs of H. longiflorus included hydrocarbon monoterpenes (0.17–4.70), oxygenated monoterpenes (0.64–15.82), sesquiterpene hydrocarbons (0.03–22.56), oxygenated sesquiterpenes (1.12–68.43), phthalates (9.73–94.72), and other compound groups (0.94–7.39). The main predominant compounds in the population's EO include tau-cadinol (0.62–55.56), mono (2-ethylhexyl) phthalate (8.10–94.70), elemol (0.21–19.11), β- spathulenol (0.08–14.39), 4-terpineol (0.23–10.19) and β-eudesmol (0.21–9.94). The highest percentage of tau-cadinol was recorded in population P3, while mono (2-ethylhexyl) phthalate was predominant in population P5. Elemol was found to be the most prevalent in population P6, β-spathulenol in population P4, 4-terpineol in population P8, and β-eudesmol in population P4. The diversity of predominant compounds (tau-cadinol and mono (2-ethylhexyl) phthalate) and their percentage in the collected samples was highly variable due to the fact that the samples were gathered from different regions. This characteristic represents a positive aspect of plants collected from natural habitats, as valuable chemotypes for domestication and breeding purposes are identified. The high diversity of essential oil compounds has also been reported in other studies conducted on different plant species. The composition and quantity of essential oils in plants are influenced by various factors including sexual, seasonal, ontogenetic, genetic variations, as well as ecological and environmental characteristics18,19. Chromatograms of GC/MS for eight population are illustrated in Fig. 3.

Table 3 Chemical composition of essential oils of Hymenocrater longiflorus Benth. samples collected from eight natural habitats.
Figure 3
figure 3

GC/MS chromatograms of eight samples from collected population.

Previous studies have primarily focused on analyzing the chemical composition with only a single population that have shown that hedycaryol, α-cadinol, β-bourbonnene, α-pinene, β-caryophyllene, β-ocimene, γ-cadinol, p-menth-1-en-8-ol, 1,8-cineole, carvone, thymol, estragole, and β-citronellol were the major constituents of the plant12,13,15,16. The oxygenated sesquiterpene, tau-cadinol, have also been identified in higher proportions within the EO of Pulicaria crispa (53.5%) and Pulicaria arabica (38.6%) that have been found to possess insecticidal activity20. Furthermore, tau-cadinol, which serves as a major constituent of Ipomoea carnea (38.6%), has been recognized for its significant antibacterial properties21. Eupatorium odoratum EO, with the predominant compound tau-Cadinol (20.10%), has shown antimicrobial and antioxidant effects22. α-cadinol as isomeric form of tau-cadinol with (20.43%) was the second dominant component of H. longiflorus16. Mono (2-ethylhexyl) phthalate, in addition to being a plasticizer, possesses various biological properties including cytotoxicity, antioxidant, anti-inflammatory, antimicrobial and antiviral activity23. In this regard, chemotypes with only a few dominant constitute are the attention of different industries24. Research has demonstrated that certain types of algae have the ability to produce phthalates such as di-n-butyl phthalate and mono (2-ethylhexyl) phthalate as a defense mechanism against environmental stressors. These phthalates can be released into the surrounding environment, potentially impacting the delicate balance of aquatic ecosystems. Within algal cells, these synthesized phthalates are believed to be stored in cell membranes to help maintain the flexibility and resilience of the cells. These findings indicate that phthalate production may be a widespread occurrence, both on land and in marine environments23. Mono (2-ethylhexyl) phthalate has been reported as a predominant compound in all parts of the plants Cirsium japonicum at 16%, Salvinia natans at 29.3%, and Eichhornia crassipes, as well as in the flowers of Osmanthus fragrans at 26.5%, serving as a natural component. These compounds have shown beneficial effects such as reducing soil-borne diseases, improving soil quality, and promoting plant growth23. The process of selecting new plant crops for the pharmaceutical industry is market-driven and requires careful consideration of various factors. This includes identifying valuable natural bioactive compounds, conducting extensive extraction procedures, and developing products that meet market demands. It is a complex and time-consuming process that requires expertise and careful evaluation25. In the present study, P3 and P5 with dominant compounds tau-cadinol and mono (2-ethylhexyl) phthalate can be used for domestication and extensive cultivation after supplementary studies.

In order to classify and determine the chemotype of eight populations, the EO components were subjected to hierarchical cluster analysis (HCA) in line with stacked column charts (Fig. 4a–d). The dendrograms of the eight populations, as shown in Fig. 4d, were divided into three main groups, indicating a distinct chemotypes. Additionally, a stacked column chart of the 26 constituents, as well as the EO component classification, were provided alongside the dendrogram to demonstrate which constituents correlated with the chemotypes and populations Fig. 4b,c. In chemotype I, which consisted of populations P1 and P2, three major components were identified, including mono (2-ethylhexyl) phthalate, tau-cadinol, and α-elemol. Chemotype II, which only had one population (P5), was rich in mono (2-ethylhexyl) phthalate but lacked tau-cadinol and α-elemol. Chemotype III, which included populations P3, P4, P6-8, contained tau-cadinol, β-eudesmol, and 4-terpineol. Furthermore, in the clustering of compounds, it was found that three compounds, mono(2-ethylhexyl) phthalate (26), tau-cadinol (22), and elemol (20), had the greatest impact on the separation and differentiation of populations (Fig. 4e). Additionally, the principal component analysis (PCA) yielded similar results, where these three compounds (26, 22 and 20) had the greatest impact on the clustering. Specifically, compound 26 had the highest concentration in population P5, compound 22 was predominant in P3 and to some extent in P1, and compound 20 was observed in P4 and P6-8. These two principal components (PC1 and PC2) were able to explain more than 95% of the variations in population compositions in terms of phytochemical compounds and chemotypes determination.

Figure 4
figure 4

Multivariate analysis and stacked column chart of the 26 constituents among eight population (a) Hierarchical clustering of 26 EO constituents (b) stacked column of EO classification group’s percentage (c) stacked column of EO based on 26 EO constituents (d) classification and chemotyping of eight population based on EO (e) Di-plot of two first components based on PCA analysis of EO compositions.

Flowers anthocyanin content

The results of the analysis of variance showed that the populations of H. longiflorus had a significant difference in terms of total anthocyanin content (p < 0.01). The evaluated range of anthocyanin content in the populations varied from 1.89 to 3.75 mg cyanidin-3-glucoside equivalent C-3GE/g DW (Fig. 2b). The order of anthocyanin content, from highest to lowest, was observed in populations P1-3, P6, P8 > P4, P5, P7. However, there was no statistically significant difference between populations P1, P2, P3, P6, and P8. Although anthocyanins are most recognized as pigments contributing to coloration in fruits and flowers usually in attracting pollinators, however, high levels of these compounds can encourage the use of flowers as medicinal organs26.

Total phenolic content (TPC) and total flavonoid content (TFC)

Significant differences (p < 0.01) were observed in the levels of TPC and TFC in leaf and flower samples obtained from eight distinct populations. The TPC in the leaf samples exhibited a wide range, spanning from 7.89 to 117.18 mg gallic acid equivalent (GAE)/g DW (Fig. 2c). Notably, the population denoted as P6 exhibited the highest TPC, followed by P8, P1, P7, and populations P2 to P5. Conversely, the flower samples displayed a TPC ranging from 39.98 to 86.62 mg GAE/g DW, with the highest content observed in population P1 and the lowest in populations P8 and P6 (Fig. 2d). Furthermore, the TFC varied from 81.04 to 143.46 mg quercetin equivalent (QUE)/g DW in leaf samples and from 94.82 to 133.26 mg QUE/g DW in flower samples across the different populations (Fig. 2e,f). The order of TFC in leaf samples from populations, from highest to lowest, was as follows: P6 > P8 > P1 > P2-4, P7 > P5. Additionally, in flower samples, the highest and lowest amounts were observed in populations P5 and P6 respectively. It has been reported that phenolic compounds have antimicrobial and antioxidant properties, helping plants prevent infections from pathogens and pathogenic microorganisms. Additionally, their presence in plant tissues protects them against the toxic effects of reactive oxygen species27.

DPPH free radical scavenging activity

The antioxidant properties of leaf and flower samples from H. longiflorus populations were evaluated using the DPPH free radical scavenging activity method. The results showed a significant statistical difference (p < 0.01) in the fifty percentage inhibition concentration (IC50) values of DPPH scavenging activity (DPPHsc) among eight populations. The IC50 values ranged from 0.65 to 78.74 µg/ml in leaf samples and from 4.4 to 7.71 µg/ml in flower samples (Fig. 2g,h). To facilitate comparison of the samples with a standard antioxidant, the antioxidant power of ascorbic acid DPPHsc (IC50) was determined to be 3.24 ± 0.12 µg/ml. A lower IC50 value indicates higher antioxidant activity. Among the leaf samples, populations P6, P1, P7, and P8 exhibited the highest antioxidant potential, with lower IC50 values that were not statistically significant from each other. Similarly, among the flower samples, populations P1, P5, and P7 exhibited the highest antioxidant activity with lower IC50 values.

Correlations and multivariate analysis of combined data

Correlation analysis is a valuable tool for investigating relationships between traits, and it has practical implications in plant breeding and domestication. In Fig. 5, the correlation between traits is presented both quantitatively and visually, with large blue points indicating a positive correlation and large red points indicating a negative correlation. Among the morphological traits, the length and width of the bracts (BL, BW) exhibited a strong positive correlation (0.84). Moreover, these morphological traits showed a positive correlation with compound 24 of EO (BL = 0.40, BW = 0.46), while displaying a negative correlation with compounds 1 and 5. These correlations with phytochemical traits can contribute to cost-effective analysis in the selection process of these plants and serve as valuable indicators for marker-assisted selection (MAS)28. The negative correlation observed between DPPH leaf (DPPH L) IC50 and TFC (-0.73), TPC (-0.63), compound 20 (-0.90), EO (-0.85), and compound 9 (-0.73) suggests that these compounds exert a significant influence on the antioxidant activity of the leaves. Additionally, DPPH flower (DPPH F) IC50 exhibited a negative correlation with TPC (-0.79) and TFC (-0.78), but a positive correlation with flower anthocyanins (0.51), compound 15 (0.66), and compound 10 (0.57). These findings underscore the substantial impact of total phenols and flavonoids on the antioxidant activity of the flower organ. Notably, a significant positive correlation was observed between components 1, 2, 5, 8, 9, 10, 14, 17, 18, 20 and EO. Furthermore, the positive correlation between leaf flavonoids (TFC L) (0.63), leaf total phenolic content (TPC L) (0.65), and the EO content indicates that conditions promoting an increase in essential oil content also lead to elevated levels of phenolic compounds. The simultaneous increase of one group of metabolites with another group is of great interest to breeders, particularly in medicinal plants where an increase in all beneficial secondary metabolites may be desired. simultaneous breeding of several attributes is interested among the breeders29. Additional correlations between traits are depicted in Fig. 5.

Figure 5
figure 5

Simple correlation among 49 morphological and chemical traits obtained from 200 individuals from eight population of Hymenocrater longiflorus.

Canonical correspondence analysis (CCA) is a multivariate statistical technique used to explore the relationship between two sets of variables. In this study, CCA was employed to examine the association between phytochemical and morphological traits in 200 genotypes from 8 populations. The mean values of various morphological traits were compared with those of phytochemical components, including EO constituents and content, total anthocyanins, total phenols, and total flavonoids. Table 4 displays the results of the CCA, presenting seven canonical correlations (CC 1–7) along with their corresponding eigenvalues and the percentage of variance explained by each correlation. CC1, with an eigenvalue of 0.116, accounts for 60.13% of the total variance. CC2 and CC3 explain 16.27 and 10.83% of the variance, respectively. In this multivariate analysis, positive signs ( +) indicate positive correlations between variables, while negative signs (−) indicate negative correlations. The data more than 0.4 used for CCA for explain of associations.

Table 4 The results of the canonical correspondence (CC) analysis, presenting seven canonical correlations (CC 1–7) along with their corresponding eigenvalues and the percentage of variance explained by each correlation among phytochemical, morphological and studied eight populations.

In CC1, a positive association is observed between the combination of DPPH (L) and compound 26 with population 5, as well as the morphological traits stem diameter (SD) and internode distance (ID). Additionally, compound 20, along with other essential oil compounds (1–10 and 12–25) and TPC L in populations P6 and the node number per stem (NNS) trait, display positive correlations. Thus, high NNS values in populations may be linked to increase of these phytochemical traits. CC2 reveals a positive correlation between compound 1, 8, 9, 15, 16, and 26, as well as leaf total phenolic contents (TPC L) in populations P5 and P6, with the morphological traits PH and NNS. Conversely, compound 19, 21, and 22 in population P3 exhibit negative correlations with the morphological traits BL, BW, SD, PL, UUL, LLL, and LL.

Considering that CC1 and CC2 collectively explain 76.40% of the relationship between morphological and phytochemical traits, scatter plots were generated using these two canonical correlations (Fig. 6). The direction of the lines associated with phytochemical traits indicates their positive or negative correlation with other studied traits. The positive correlation between NNS and TPC (L), as well as compounds 8, 5, and 16, suggests a relationship with this morphological trait. Moreover, plant height (PH) shows a positive correlation with population P5 and the combination of essential oil 26 and DPPH (L). Conversely, most morphological traits exhibit an inverse correlation with essential oil compounds and total leaf phenols (TPC L), indicating that smaller plant stature is associated with higher levels of EO in populations.

Figure 6
figure 6

Canonical correspondence analysis of between morphological and phytochemical traits of Hymenocrater longiflorus populations.

To further explore the diversity of phytochemical and morphological traits among populations, cluster analysis using R software (circular cluster) was conducted (Fig. 7). The analysis divided the genotypes of all individuals in the eight studied populations into three main categories based on all the studied traits. This clustering indicates a significant variation in phytochemical and morphological traits among populations, surpassing the within-population variation. Therefore, the interbreeding between individuals from two distant populations as opposed to the interbreeding of individuals within populations can potentially lead to the emergence of new genotypes that are adaptable to different regions and contribute to the development of more effective plant breeding and cultivation.

Figure 7
figure 7

Hierarchical clustering of all combined traits among 200 individuals of Hymenocrater longiflorus.

The analysis of plant populations showed that the plant is a rich source of compounds tau-cadinol, and mono (2-ethylhexyl) phthalate. According to the EO groups, oxygenated sesquiterpenes played a major role in forming the majority of the EOs in studied populations. The role of oxygenated sesquiterpenes has been identified in insecticidal, antioxidant, and allelopathic activities, indicating that chemotypes containing these compounds could be new candidates for using industrially for this aims7,30. Furthermore, the study on populations demonstrated that the antioxidant activity of the flower and leaf extracts of the plant is high, suggesting their potential use as antioxidant compounds31. Examination of the plant extracts revealed that they are rich sources of flavonoids, phenolics, and flowers are particularly rich in anthocyanins. Therefore, the studied populations showed that the plant holds promise for domestication and improvement for widespread cultivation. The presence of mono (2-ethylhexyl) phthalate in the P5 population with a high percentage of over 94% can be a valuable source for this compound, as it possesses various biological properties, including cytotoxicity, antioxidant, anti-inflammatory, antimicrobial, and antiviral activities23. Three valuable chemotypes identified in the present study are as follows:

(I) mono (2-ethylhexyl) phthalate, tau-cadinol, and α-elemol; (P1 and P2).

(II) mono (2-ethylhexyl) phthalate; (P5).

(III) tau-cadinol, β-eudesmol, and 4-terpineol; (P3, P4, P6, and P8).

CCA and correlation plot analysis indicated a strong relationship between morphological and EO gradients. These properties could eliminate the need for expensive and time-consuming analytical quantification methods when selecting plants for domestication and breeding. In conclusion, the application of CCA, scatter plots, and cluster analysis provides valuable insights into the relationship between phytochemical and morphological traits in the studied populations. These findings contribute to our understanding of the diversity and interplay of these traits in the plant species under investigation.

Methods

Plant material

To obtain the plant material, the aerial parts of H. longiflorus were collected in full bloom in June from different parts of the Zagros Mountains, West of Iran. Sampling was done for academic purposes, with the permission of the University and in accordance with relevant institutional, national, and international guidelines and legislation. Sampling was in accordance with the IUCN guideline on research on endangered species and the Convention on the Trade in Endangered Species of Wild Fauna and Flora. Twenty five individual samples were randomly selected in each population from eight collection site. The species were identified by Mr. Hossein Maroofi, from Research Center of Agriculture and Natural Resources of Kurdistan, Sanandaj, Iran. Then was labeled with a voucher specimen (No. 1521) in the herbarium of the Department of Horticultural Science, Urmia University, Iran. The aerial parts were dried in the shade at room temperature for two weeks. To identify the habitats of these plants, local people were interviewed, previous articles and books were consulted (Table 1).

Essential oil isolation and GC/MS analysis

The EO content was determined by extracting 35 g of dried plant materials using the water distillation method with a Clevenger-type apparatus over a 3 h period and the EO content expressed in % (v/w). The obtained oils were then stored in securely sealed vials with foil covers at 4 °C prior to analysis. Gas chromatography-mass spectrometry (GC/MS) analyses were conducted using a Thermo Finnigan capillary gas chromatograph directly linked to the mass spectrometer system (model GC TRACE; TRACE MS plus). A non-polar fused silica capillary column (HP-5MS, 30 m × 0.250 mm, 0.25 μm film thickness) was utilized. The injector temperature was set at 280 °C. The GC oven temperature was programmed as follows: initially, the oven temperature was set at 40 °C for 2 min, then increased to 160 °C at a rate of 3 °C/min, and finally raised at a rate of 5 °C/min to 280 °C, where it was held for 2 min. Helium used as the carrier gas at a flow rate of 1 ml/min, and the ionization energy was set at 70 eV. Without any dilution, 1 μL of each essential oil sample was manually injected into the GC, and the split ratio was 1:6018. The injection conditions for both GC and GC–MS were identical. The percentages of chemical constituents were determined using the area normalization method, without considering response factors. Retention indices (RI) were calculated using the retention times of injected n-alkenes (C6–C24) under the same experimental conditions. Compound identification was achieved by comparing the RIs with those available in NIST and other relevant literature sources32,33, and the mass spectra of the compounds were identified using the installed libraries on X-Calibur (2.07) software.

Preparation of methanolic extracts

The leaf and flower samples from each region were first dried and finely powdered, with 1 g of each sample being used for extraction. Following this, 15 ml of 80% methanol extract was added to each gram of sample. The samples were then subjected to extraction in an ultrasonic bath for 30 min at 25 °C and 120 Hz waves using an Elmasonic E 120 Hz apparatus from Elma Schmidbauer GmbH, Germany. The resulting extracts were filtered with whatman filter paper before being stored at 4 °C.

Determination of total phenolic and flavonoid content

The total phenolic content (TPC) was determined using the Foline-Ciocalteu procedure, following the method described previously with some modifications34. Specifically, 10 µl of each extract were combined with approximately 1200 µl of Foline-Ciocalteu (10%), followed by the addition of 960 µl of sodium carbonate (7%) and 180 µl of distilled water. The solution was thoroughly mixed and then incubated in darkness for 30 min. Subsequently, the absorbance was measured at 760 nm using a spectrophotometer (Dynamica HALO DB-20, UK). For the determination of total flavonoids, the aluminum chloride colorimetric method was employed with slight adjustments27. Initially, 15 µl of the extract was mixed with 150 µl of NaNO2 (5%) and left at room temperature for 15 min. Then, 300 µl of AlCl3 (10%) and 1 ml of NaOH were added to the solution. After a 30 min. incubation period, the mixture exhibited a yellow color, and the absorbance was measured at 380 nm.

DPPH free radical scavenging assay

The radical scavenging activity of the extracts was determined using the DPPH assay, with slight modifications to the procedure reported by Yahia and coworkers35. Three different concentrations (2, 5, and 10 ppm) of methanol extracts from each region were mixed with 2 ml of 0.1 mM methanol DPPH solution. The mixture was gently shaken and left at room temperature for 15 min. The absorbance of the samples was measured at 517 nm using a UV–visible spectrophotometer, with methanol (80%) used as the blank and DPPH solution without extract used as the control. Ascorbic acid solution was used as the standard, with absorbance determined in the same way as the plant extracts.

The formula used to calculate the inhibition percentage is as follows:

$$ {\text{Inhibition}}\% \, = \frac{{A_{B} - A_{A} }}{{A_{B} }} \times 100 $$

Here, \(A_{A}\) represents the absorbance of the DPPH solution with extract, and \(A_{B}\) represents the absorbance of the DPPH solution without extract, which serves as the control.

Anthocyanin determination

Anthocyanins, which are a type of phenolic compound, were analyzed using the previous established method36. Dried flower samples (0.1 g) were ground into a fine powder and then mixed with 1.5 mL of acidic methanol (a solution of methanol and hydrochloric acid in a 1:99 ratio). The resulting mixture was centrifuged to separate the components. To measure the total anthocyanin content (TAC), the pH difference procedure was employed. This involved preparing two buffers with different pH values (pH = 1 and pH = 4.5). Subsequently, 2.5 mL of the pH 1 buffer was added to the extract, followed by the addition of 2.5 mL of the pH 4.5 buffer to 100 ppm of the extract. The absorbance of the resulting solution was measured at 520 nm and 700 nm using a spectrophotometer.

$$ {\text{TAC }}\left( {{\text{mg}}/100{\text{g}}} \right) = \frac{A \times MW \times DF \times 1000}{{\varepsilon \times C}} $$

Statistical analysis

The eight populations of H. longiflorus were classified and grouped based on ward distances by analyzing the essential oil composition data matrix using hierarchical cluster analysis (HCA) and principal component analysis (PCA) with PAST software (version 4.03). Additionally, canonical corresponding analysis (CCA) was conducted on the morphological, phytochemicals, and essential oil content and composition using the same software. Heat-map cluster and correlation analysis for combined data of all genotypes were obtained using RStudio (version 1.2.5019) URL http://www.rstudio.com/. Furthermore, analysis of variance with completely randomized design (CRD) (ANOVA) and means comparisons with Duncan's Multiple Range test (DMRT) were carried out using SAS software version 9.4.

Ethical approval

Plant sampling were comply with the IUCN Policy Statement on Research Involving Species at Risk of Extinction and the Convention on the Trade in Endangered Species of Wild Fauna and Flora.