Introduction

Blackberries belong to the Rosaceae family and the Rosoideae subfamily. Its genus is Rubus which is divided into 12 subgenera, with an estimated number of 750 species distributed worldwide1. The three largest subgenera of Rubus include Idaeobatus, Malachobatus and Rubus2. Among the Rubus genus, blackberries and raspberries are important fruit crops which are mostly produced as fresh fruits, processed food, frozen, juice, concentrate, jams, jellies or ice creams3. Raspberries and blackberries have a relatively short history of less than a century as cultivated crops and only a few of its generations have been collected from their wild progenitor species4. Fruits of blackberries are rich in anthocyanins and phenolic compounds that contribute to their high antioxidant capacity; therefore, the consumers’ interest is growing for blackberries because of their beneficial implications on human health5,6. The information on worldwide production, regions and quantity are not well known, even though blackberry production is rapidly increasing7,8 with an estimated 140,292 MT commercially harvested from 20,035 ha in 2005. Europe leads the world in its area under cultivation (7692 ha), while North America has the highest production (59,123 MT).

Restricted genetic diversity is a serious concern for future Rubus breeding programs, especially when selecting genotypes with superior quality fruit for processing and high yield. On the other hand, very little information is available on diversity in blackberry species, and few accessions are currently being utilized in blackberry breeding programs9. The evaluation of genetic and phenotypic diversity among wild and cultivated plant populations is important for the conservation of genetic resources, the identification of superior genotypes to be used as parents in breeding programs and for expanding the genetic background of blackberries10. There are various techniques available for the evaluation of plant phenotypic and genetic variability including morphological, bio-chemical and molecular markers. Morphological traits are the first to be used in the classification and evaluation of the germplasm11,12,13. In a report, Badjakov et al.14 analyzed 28 raspberry genotypes from the Bulgarian germplasm collection including 18 Bulgarian genotypes and breeding lines, 8 accessions from outside Bulgaria and two accessions of wild species, including R. occidentalis and R. adiene. The genotypes they studied were clearly clustered into two groups, corresponding to two pedigree groups. The investigation of genetic diversity of R. alceifolius in its native range in Southeast Asia as well as its introduced range in Madagascar and the neighboring parts of the Indian Ocean islands have confirmed greater genetic diversity in its native range15. Furthermore, Gharaghani et al.16 investigated the diversity of fruit quantity and chemical properties of R. sanctus Schreb. genotypes from two distinct regions of northern and southern Iran. They reported high variability as well as considerable differences in fruit characteristics between the genotypes collected from the two different parts of Iran.

There is no documented data available about the cultivation of Rubus spp. crops in Iran; however, the collection of blackberry fruits from wild stands is a common activity among Iranian locals of the Caspian region, and the sale of blackberries is a frequent sight along the roadsides of northern Iran. Based on a recent survey, Rubus genetic resources comprise 8 species in Iran including Rubus saxatilis L. (the only herbaceous type that belongs to the Cylactis Subgenus), R. caesius L., R. sanctus Schreber (syn. R. anatolicus (Focke) Hausskn.), R. hirtus Waldst. & Kit. (syn. R. lanuginosus Stev. ex Ser), R. hyrcanus Juz., R. dolichocarpus Juz. (syn. R. ochtodes Juz.), R. discolor Weihe and Nees (syn. R. armeniacus Focke) and R. persicus Boiss (syn. R. raddeanus Focke). All of these species belong to the subgenus Rubus, except for R. saxatilis L., the only herbaceous type, that belongs to the Cylactis Subgenus. Five inter-specific natural hybrids also have been reported from Iran17. However, apart from the reports on botanical studies, there is little information on pomology and nutritional values of the available wild Rubus genetic resources in Iran. Accordingly, the objectives of this study were to describe the variability of berry size, fruit bio-chemical characteristics and antioxidant activities in 57 genotypes of the Iranian wild blackberry, representing four species. Objectives also dealt with discovering useful correlations among the measured traits and determining the relationships between the genotypes and species by using cluster analysis. These were performed to provide necessary information on the domestication and breeding of this valuable crop.

Material and methods

Plant material

In this study, fruit samples were collected from first Iranian blackberry repository, established in 2010 at Shiraz University, Iran18. The geographic coordination of the established collection comprised a longitude of 52° 35′ 0.73′′, a latitude of 29° 43′ 43.56′′ and an altitude of 1791.4 (m). The average long-term annual rainfall was 386 mm, and the soil was silty-loam. To establish the repository genotypes which included four different species, R. sanctus, R. hirtus, R. caesius and R. persicus, were collected from 10 provinces across Iran (Fig. 1). The geographic information of the origin as well as the species of the genotypes under study is presented in Table 1.

Figure 1
figure 1

Approximate distribution of blackberry habitats in Iran that used in our study (the green colored provinces). The original map is obtained from d-map (https://d-maps.com/carte.php?num_car=5496&lang=en) and modified (colored) using paint software of Microsoft Windows 10).

Table 1 Geographic information of original sites in different parts of Iran, in which 57 studied blackberry genotypes, collected from.

This research was conducted in two growing seasons during 2014 and 2015, on 5-year-old canes propagated on its own root at 4 × 2 m spacing and a free training system. A total of 57 wild genotypes out of more than 100, were selected and tagged as experimental plant materials. Each genotypes represents three cane in the repository, which treated as three replicate. The orchard floor was managed using mechanical cultivation between rows and hand cultivation within the rows. Sever pruning (including removal of all canes from 30 cm above ground) were done manually in all of the experimental plants on early March. No fruitlet thinning was applied. Canes were irrigated using a drip irrigation system with 3-day intervals and the amount of irrigation water was adjusted based on the plants’ requirement during the growing season.

The regular nutrition program of the orchard consisted of an application of 20 t ha–1 of decomposed dairy cattle manure, 200 kg ha–1 of Di-Ammonium phosphate, 200 kg ha–1 potassium sulfate every other year, and also 100 kg of Ammonium sulfate annually divided in two applications of equal amounts in autumn (early November) and spring (mid-May) using a fertigation system; plus, one foliar spray of a mix of trace elements in early June. No chemical spry used for plant protection.

Fruits of each genotypes were sampled at its appropriate ripening time (Fully black color stage) and immediately transferred to the lab for further evaluation.

Determination of fruit physico-chemical properties

Fruit samples were harvested in their ripe stage (black color) and fruit characteristics including fresh weight, total soluble solid (TSS), titratable acidity (TA), total phenolic compound, ascorbic acid, total anthocyanin content and antioxidant activity were evaluated. Samples were in three replicates. In order to measure the berry weight, a sample of 20 fruits per replicate was randomly selected and weighed by using an electronic balance with 0.001 g precision, and then the weight was averaged and reported as g per berry. TSS was determined by using a manual refractometer (ATAGO, Japan). TA content was measured as citric acid equivalent by diluting 5 ml aliquot of fruits juice in 40 ml of distilled water, which was then titrated to pH 8.2 by using 0.1 N NaOH, and was finally expressed as g/100 g of fresh weight, according to descriptions by Zheng et al.19. The total phenolic compound was determined by using a spectrophotometer (Epoch microplate spectrophotometer, USA) based on the Folin–Ciocalteu colorimetric method20. For this purpose, the fruit extract (1 ml) was mixed with 1 ml of HCl (6 mol) and 5 cc of methanol (75%). Then, 1 cc of the solution was mixed with 5 ml of 10% Folin-Ciocalteu reagent and 15 ml of NaCO3 \(\left(\frac{7\mathrm{g}}{100\mathrm{ ml}}\right)\). It was then tested by spectrophotometry at 760 nm. The results were expressed as mg Gallic acid/100 g fresh weight. Gallic acid was used as a standard compound and the total phenols were expressed as mg/g Gallic acid equivalent by using the standard curve equation: y = 0.0061x + 0.0396, R2 = 0.9991, where x is the absorbance at 760 nm and y is total phenolic content in the different extracts.

Ascorbic acid was quantitatively determined according to 2, 6-dichlorophenol-indophenol (DCPIP) dye method21. The ascorbic acid in 0.1 g of fresh fruit was extracted by using 2% (w/v) m-phosphoric acid. The extract volume was increased to reach 100 ml, and then 1 cc of the solution was mixed with 9 cc of 2, 6-dichlorophenol-indophenol (50 μmol). The absorbance of the solution was measured at 515 nm by using the spectrophotometer (Epoch microplate spectrophotometer, USA). Ascorbic acid content was calculated according to the following formula: Ascorbic acid [mg/100 g] = Corrected Volume × F × 40, (F = M/g of Ascorbic acid equipollent of ml indophenol).

Total anthocyanin content of fruit extract was estimated by using the pH differential method proposed by AOAC International22. Absorbance was measured with the spectrophotometer at 510 and 700 nm, at the pH value of 1 and 4.5, where A = (A510–A700) pH 1 and (A510–A700) pH 4.5. The data were calculated by using the extinction coefficient for cyanidin-3-glucoside and were expressed as mg cyanidin 100 g−1 fresh weight.

The antioxidant activity was evaluated by using the 1, 1-dipheny l-2-picrylhydrazyl (DPPH) free radical scavenging method. The DPPH free radical scavenging activity measurements were carried out according to the method of Ao et al.23. Briefly, 50 μl of the fruit extracts were added to 950 μl of DPPH radical and were vortexed for 30 s. They were kept at room temperature in darkness for 15 min. The absorbance of the samples was measured at 515 nm by using the spectrophotometer (Epoch microplate spectrophotometer, USA). For each sample, three separate determinations were carried out. The antioxidant activity was expressed as the percentage of decline in absorbance, compared to that of the control, and corresponding to the percentage of scavenged DPPH. The percentage of scavenged (% DPPHsc) was calculated by using the following formula: % DPPHsc = (A cont − A samp) × 100/A cont, where ‘A cont’ is the absorbance of the control, and ‘A samp’ is the absorbance of the sample.

Statistical analysis

Data were analyzed on the basis of a randomly complete block design (RCBD) with three replicates. Analysis of variance were carried out by SAS 9.1 software. Initially, the data were checked for differences between the two growing seasons. There were no significant differences between the 2 years and the interaction between year and genotype was not statistically significant (Table 2). Therefore, data pertaining to the 2 years were averaged and accordingly used for further analysis. Diversity index were determined as indicators of variability which was calculated using this formula24; Diversity Index = (Std. dev./Mean) × 100, Std. dev. = Std. error × √n, n = 57. The correlations between all morphological variables studied herein as well as between these variables and geographic data of original sites in which plant materials were sampled, calculated based on Pearson’s Simple Correlation Coefficient by using the SPSS software (version 17). Principal component analysis (PCA) was performed and first two components were used to create bi-plot illustrating the relationships of the accessions as well as the measured traits. Cluster analysis of the accessions was performed by incorporating all of the measured attributes into the analysis. This was carried out according to Euclidian distance measure based on Wards method by the R software. The different variables were scaled based on Z-score prior to cluster analysis. The graph that presents the classification is a dendrogram of dissimilitude with standardized Euclidean Distances as they group together the closest accessions in homogeneous groups.

Table 2 Analysis of variance of studied traits in 57 blackberry genotypes assigned to four wild species.

Results and discussion

Diversity of fruit characteristics

Berry weight

Fresh berry weight of all accessions ranged from 0.14 to 1.30 g. ‘Sepidan’ (Roodbal) and ‘Babolsar 2′ accessions belong to the R. sanctus, had the largest and smallest berry sizes respectively (Table 3). In general, a high level of variation (diversity index = 30.22) was recorded for berry size among the accessions and species studied (Table 4). Yilmaz et al.6 investigated the fruit weight of 9 cultivated blackberries and 16 blackberry genotypes collected from the wild (Rubus fruticosus L.) in Turkey. A range of 0.4–1.2 g of berry weight was reported for the wild genotypes and 1.2–5.4 g for the commercial cultivars. The range, as mentioned for the wild genotypes, is consistent but less broad than the range of berry weight reported in this study (0.14 to 1.30 g). On the other hand, Celik et al.25 reported a higher range of fruit weight (1.5–2.1 g) for wild blackberries in Turkey. These differences could arise from different species, varied numbers of the accessions under each case of study, as well as the diverse climatic conditions in each context11,26,30. Results of correlation analysis showed that berry weight has a significant, but negative correlation (r =  − 0.287) with latitude (Table 5), illustrating the effect of climatic and geographic parameters of the original sites in which the plant materials were collected.

Table 3 Variations of berry weight, fruit bio-chemicals and antioxidant activity in 57 blackberry genotypes assigned to four wild species (mean of 2014 and 2015).
Table 4 Descriptive Statistics of berry weight, fruit bio-chemical characteristic and antioxidant activity in 57 blackberry genotypes assigned to four wild species (mean of 2014 and 2015).
Table 5 Simple Pearson correlations between berry weight and fruit bio-chemical characteristic with longitude, latitude and altitude of original site of 57 blackberry genotypes assigned to four wild species (mean of 2014 and 2015).

Antioxidant activity

As presented in Table 3, the antioxidant activity varied from 88.08% to 40.21% among the accessions studied. The genotypes ‘Yasuj’ (Naregah) and ‘Nourabad’ (Bavan), both of which belong to the R. sanctus, were recognized as accessions with the highest and lowest rates of antioxidant activity, respectively (Table 3). The evaluation of antioxidant activity among the genotypes of Iranian blackberry species in this study showed a relatively high level of variability (Diversity index = 20.07) (Table 4). Diversity in the antioxidant activity of blackberry can be attributed to the various methods of cultivation, environmental conditions during harvest and, especially, genetic differences26,27. Although we didn’t find any significant association in correlation analysis between geographic parameters of the original habitats of used plant materials and measured values for antioxidant activity (Table 5), but according to Wang28, the growth of blackberry through cold nights and cool days can reduce the antioxidant capacity of fruits. However, it should be noted that organisms causing disease and pests can have an impact on the plants phenolic compounds synthesis, which could influence antioxidant activity. Gundogdu et al.11 reported the range of 30.855–48.900 μmol TE g–1 (TE = Trolox equivalent antioxidant capacity) for antioxidant capacity in blackberry cultivars from East Anatolia. Huang et al.29 estimated the antioxidant activity of blackberry compared to blueberry and strawberry, whereby it was proved that blueberry had the highest antioxidant capacity (14.98 mmol Trolox/100 g DW) followed by blackberry (11.48 mmol Trolox/100 g DW) and strawberry (4.44 mmol Trolox/100 g DW). Those results were in line with the results of the current study, however the methods used for measurements were different.

Anthocyanin

The fruits’ anthocyanin content in all accessions belonging to the four species ranged from 80.74 to 145.09 mg/100 g. Herein, the ‘Abidar’ (R. sanctus) and ‘Ashkvarat’ (R. hirtus) were recorded as genotypes with the highest and lowest average anthocyanin contents, respectively (Table 3).

Pantelidis et al.30 reported that blackberry genotypes have a higher anthocyanin content compared to raspberry and red gooseberry. According to Fan-Chiang and Wrolstad31, the anthocyanin content ranged from 70.3 to 201 mg/100 g with an average of 137 mg/100 g among the 51 genotypes of blackberry. Also, Moyer et al.20 reported the range of 70 to 201 mg/100 g and an average of 141 mg/100 g in the fruit, as a result of investigating 27 blackberry hybrids. All of those previous reports are in general agreement with the results obtained in this study. Interestingly there was a significant negative association between latitude of the original habitats of the studied Rubus genotypes and their fruit anthocyanin (r =  − 0.287). It should be noted that plants sampled in higher latitude evolved in Caspian see which experience rainy and cloudy weather and less light intensity.

TSS

Total soluble solids ranged from 7.9°Brix (in the ‘Talesh 2’ genotype) to 17.8°Brix (in the ‘Kazerun’ (Eslamabad) genotype) among the accessions of the four species studied (Table 3). The increase in TSS can be a result of longer photoperiods or higher intensities of light. TSS was probably dependent on temperature and field conditions, as blackberry plants that were located at higher latitudes received less light. For example, blackberries grown in Oregon had relatively higher TSS levels than those grown in Mexico. That is probably because the latter genotypes were adapted to longer photoperiods27. In line with mentioned example, considering the geographic parameters of the natural habitats in which studied Rubus genotypes collected (Table 5), there were a significant positive correlation between altitude and fruit TSS (r = 0.287), but a negative association between latitude and TSS (r =  − 0.387). Evidently, TSS is a better indicator of fruit maturity in most of the products, especially in blackberry, and has a very important role in the food industry32. Previously, Reyes-Carmona et al.27 recorded a TSS value of 16.1% in the ‘Evergreen’ cultivar from Woodburn as the highest TSS, and 7.5% in ‘Brazos’ from Ziracuaretiro as the lowest TSS among the wild blackberry genotypes grown in Mexico. Variation in TSS was also reported by Pantelidis et al. and Clark and Finn.

Total phenol

The fruits’ total phenol content, considering all genotypes of the species investigated, ranged from 109.5 mg/100 g to 129.1 mg/100 g. The genotypes ‘Anzali 2’ (R. hirtus) and ‘Sepidan’ (Roodbal) (R. sanctus) were considered to have the highest and lowest total phenols, respectively (Table 3). Among all of the evaluated fruit characteristics, the lowest diversity index was attributed to the total phenol content (4.86) which was six times less than the variation recorded for berry weight and four times less than that of the antioxidant activity (Table 4).

Pantelidis et al.30 reported the highest total phenol in blackberry compared to other crops such as raspberries, red currants, goose berries and cornelian cherries. According to Gundogdu et al.11, the phenolic compounds that were found chiefly in blackberry genotypes were catchin, ranging from 111.599 to 438.970 mg/100 g, and ellagic acid ranging from 10.610 to 51.506 mg/100 g in the fruits of blackberry cultivars from East Anatolia. Berry crops including blackberries are important sources of polyphenolic compounds for the human diet, and they have been examined for phenolic content frequently. Nonetheless, a high level of phenolic content is not a reliable sign of the general quality of products. Accordingly, more comprehensive experimental evidence is needed to confirm the beneficial effects of phenolic compounds in these crops33.

Titratable acid (TA)

Considering all accessions of the four species, TA content ranged from 0.36 to 0.83%. In this respect, the genotypes ‘Namak Abrud 2’ and ‘Jadeh Kandovan’ (of the R. sanctus) were ranked as genotypes with the highest and lowest TA contents (Table 3). The diversity index of TA in this study equaled 20.48 (Table 4). Clearly, the ratio of TSS to TA contribute greatly to the taste of blackberry and it is therefore very important in selecting genotypes for breeding programs34. An investigation on the total acid content and malic acid of the five blackberry genotypes revealed that they varied from 1.3 to 25.9 g kg−1 and from 0.6 to 11.0 g kg−1, respectively35. A similar study was carried out on blackberry genotypes in Italy and showed that the fruits’ citric acid content ranged from 1.1 to 16.7 g kg−1 and that the malic acid ranged from 4.0 to 15.8 g kg−113. Furthermore, Gundogdu et al.11 reported the highest citric acid (7.131 g kg−1) and succinic acid (2.021 g kg−1) in ‘Cherokee’ and ‘Bursa 2’ cultivars. There is a report on wild blackberry that shows the total acidity ranged from 1.02 to 4.22% in the ‘Evergreen’ (from Oregon) and Patzcuaro (from Mexico), respectively27. These results are quite higher than those obtained in this study. These differences could be attributed to varied climatic conditions, i.e. higher light intensities in Iran, as well as the different species or genotypes.

Ascorbic acid (AA)

Variations in the AA among all genotypes of the four species ranged from 9.56 mg/100 g to 20.92 mg/100 g. The genotypes ‘Sari 3’ and ‘Nourabad (Bavan)’ (of the R. sanctus) were recognized as genotypes with the maximum and minimum values of AA, respectively (Table 3). The estimated diversity index for the AA content was 17.68 (Table 4).

Blackberries are among the products with high ascorbic acid contents. They can have an important role in supplying this important vitamin to consumers. Reyes-Carmona et al.27 estimated the amount of ascorbic acid in different genotypes and reported that the ascorbic acid ranged from 0.80 meq/g in ‘Comanche’ to 0.82 meq/g in ‘Brazos’. No significant differences were observed in the ascorbic acid content, even in genotypes considered from different origins. Ochmian et al.36 reported an ascorbic acid content of 11 mg/100 g in the blackberry. Gundogdu et al.11 also recorded ascorbic acid contents between 10.288 and 25.399 mg/100 g among blackberry genotypes that originated from East Anatolia in Turkey, which is in agreement with the findings of the current study.

Correlation among the measured traits and also between geographic parameters and measured traits

Results of correlation analysis concerning the plant chemical data are presented in Table 6 and Fig. 3. Results showed that the antioxidant activity correlated highly and positively with all of the measured attributes including ascorbic acid (r = 0.927), anthocyanin (r = 0.752), total phenol (r = 0.681), TSS (r = 0.473) and berry weight (r = 0.541). However, the antioxidant activity did not show a significant correlation with titratable acidity. Fruit weight correlated positively with anthocyanin (r = 0.591), ascorbic acid (r = 0.508), total phenol (r = 0.412) and TSS (r = 0.264), but correlated negatively with titratable acidity (r =  − 0.281). The TSS also correlated positively with anthocyanin (r = 0.429), total phenol (r = 0.484) and ascorbic acid (r = 0.342). The values of ascorbic acid and the total phenol correlated positively with each other (r = 0.646).

Table 6 Person simple Correlations between berry weight, fruit bio-chemical characteristic and antioxidant activity in 57 blackberry genotypes assigned to four wild species (mean of 2014 and 2015).

According to Fig. 3 which represents the bi-plot of principle component analysis, the cosine of the angles between vectors shows the extent of correlation between traits which is in accordance to that of correlation analysis in almost all of studied traits. The acute angles (< 90°) represent positive correlations, whereas wide obtuse angles (90° <) show a negative correlation. The length of the vectors connecting traits to the origin shows the extent of variability.

Correlations between the bio-chemical characteristic and geographic information (Table 5) showed that, latitude has a significant negative correlation with TSS, anthocyanin and berry weight (− 0.387, − 0.366 and − 0.287, respectively). It worth to mention that the higher latitudes in which many of the genotypes sampled, are located in Caspian Sea region which experience more rain and cloudy weather than other regions. On the other hand, altitude has a positive and significant correlation with fruit TSS (0.287). However, several environmental factors (such as temperatures, light, rainfall) that was effected by altitude, longitude and latitude can influence on photosynthetic rate, fruit size, sugar-to-acid ratio, fruit quality and etc.7,37. Association between geographical parameters and some of plant features like leaf and fruit size has been reported previously by Sorkheh et al.24 in wild almond species from Iran.

The success of breeding programs is highly dependent on our knowledge of desirable traits and their outcomes through crosses. Established relationships between desirable traits can help breeders with parental partner selection in breeding programs. Correlations between traits indicate whether the selection of one trait has an effect on another. Strong correlations between traits could help breeders select the important traits indirectly. This can accelerate and facilitate breeding programs. Furthermore, a close relationship between traits may also facilitate or hinder gene introgression since the selection of a trait could favor the presence of another desirable trait.

Wang and Lin38 found a linear correlation between total antioxidant capacity and phenol content in both blackberries (r = 0.961) and raspberries (r = 0.911). Deighton et al.39 reported that there was an apparent linear relationship between antioxidant capacity and total phenols (r = 0.965), whereas anthocyanin content had a minor influence on antioxidant activity (r = 0.588) and ascorbic acid contributed only minimally to the antioxidant potential of Rubus juices. Correlations obtained between the chemicals in this study are in general agreement with the findings reported by Yilmaz et al.6, Guerrero et al.21, Huang et al.29, Maro et al.12 and Gharaghani et al.16. However, Rekika et al.40 did not find correlations between anthocyanin contents, antioxidant capacity and ellagic acid.

Cluster analysis

Cluster analysis of accessions was performed by considering all of the measured attributes (Fig. 2). According to the dendrogram, the genotypes being studied were divided into two main groups. The first cluster contains the majority of R. sanctus accessions, the only accession of R. persicus (‘Masule’), one accession of R. caesius (‘Aliabad Katul 2’) and two of R. hirtus accessions (‘Astara 2’ and ‘Anzali 2’). Cluster 2 was comprised of the other three accessions of R. caesius (‘Aliabad Katul 2’, ‘Fuman’ and ‘Gorgan 5’), all of the R. hirtus accessions (except two mentioned before) as well as some accessions of R. sanctus (most of which were from the Caspian Sea region in Golestan, Mazandaran and Guilan provinces, with a few exceptions from Fars provinces).

Figure 2
figure 2

Dendrogram for clustering 57 blackberry genotypes assigned to four wild species based on all evaluated traits (left column shows the abbreviations for the species and provinces (RS, RP, RH and RC represents Rubus sanctus, R. persicus, R. hirtus and R. caesius, respectively. A, M, K, F, H, CB, WA, KB, GU and GO represents Alborz, Mazandaran, Kurdistan, Fars, Hamadan, Charmahal-Bakhtiari, West Azarbayejan, Kohgiluyeh-Boyerahmad, Guilan and Golestan provinces, respectively).

In general, the results of cluster analysis showed a partial differentiation between the genotypes based on their species and, to lesser extent, according to their origin within the main clusters (Fig. 2). For example, ‘Ali Abad Katul 2’ of R. caesius and two of R. hirtus accessions were positioned among R. sanctus accessions in cluster 1and a minority of R. sanctus genotypes were clustered with accessions from other species in second clade. However, the genotypes from different origins were grouped together in some cases, but there were also many cases that genotypes of close regions placed in same cluster or sub-clusters. For example, 8 out of 12 accessions of R. sanctus from Fars province were placed in third sub-cluster of the first cluster. Two accessions of R. sanctus from West Azarbayejan were located in first sub-cluster and two others from Kurdistan were positioned in second sub-cluster of the first clade.

The genetic material of plants could have been exchanged between the regions studied herein, and this can partly explain the reason why the genotypes of different geographic origins could be grouped together. Nonetheless, genotypes of the same geographic origin may not necessarily be positioned in the same cluster. However, it should be mentioned that this analysis is based on a limited number of bio-chemical traits and their corresponding data. Therefore, it may not be as reliable as larger numbers of morpho-chemical traits or molecular data. Gharaghani et al.16 studied the genetic diversity of R. sanctus genotypes growing in two distinct climatic conditions including the Caspian Sea region which is situated in north of Iran, and in the southern regions of Zagros being located southwest of Iran. According to that study, 16 blackberry genotypes were grouped into two main clusters regardless of their origins. It should however be mentioned that they investigated the genotypes by collecting their fruits from plants as they were situated in their respective places of origin, and more of the fruit attributes were measured, including physical and chemical traits. However, the current research investigated the plants as they were transferred and established in a repository, i.e. all in the same environment. Badjakov et al.14 analyzed 28 raspberry genotypes including 18 Bulgarian genotypes and breeding lines, 8 accessions from outside Bulgaria and two accessions of wild species, including R. occidentalis and R. adiene. Their genotypes were clearly clustered into two groups corresponding with two pedigree groups. Guerrero et al.21 used similar methods of cluster analysis to group genotypes and cultivars of wild and cultivated berries including blueberry, raspberry, Chilean guava and red sarsaparilla, based on their antioxidant capacity in Chile.

Principle component analysis (PCA)

In PCA analysis, the first two PCAs determined 73.2% of the total variations of accessions in studied traits. The first PCA (PCA1) that was associated with TSS, total phenol, antioxidant capacity, vitamin C, anthocyanin and berry weight accounted for 56.2% of the variations, whereas, the second PCA (PCA2) with 17.0% share in the total variation was associated with titrateable acidity and to some extent berry weight (Fig. 3).

Figure 3
figure 3

Projection of studied blackberry accessions based on the first (PCA1) and second (PCA2) principal components and vectors of studied traits.

Based on the bi-plot of the first two PCAs, genotypes and traits classified into four quadrants. The projection of genotypes on the bi-plot showed that genotypes placed in the quadrant I had higher titrateable acidity. Accordingly, the genotypes scattered in the quadrant II had higher TSS, total phenol, antioxidant activity and vitamin C. The genotypes positioned in quadrant III represent high berry weight and anthocyanin content. Accessions in quadrants IV had lower quantity in almost all of studied traits (Fig. 3). This grouping is in agreement with that of cluster analysis, considering the fact that almost all of accessions in quadrants II and III (with a few exceptions) were positioned in the first clade of cluster analysis and the genotypes grouped in quadrants I and IV, represents the accessions of clade 2 in cluster analysis.

According to Fig. 3, the cosine of the angles between vectors shows the extent of correlation between traits. The acute angles (< 90°) represent positive correlations, whereas wide obtuse angles (90° <) show a negative correlation. For example, wide angles between vector for titrateable acidity and those of other trait shows the weak association of these traits, this is while, the very narrow angle between antioxidant activity and vitamin C content shows the high correlation of these attributes. These results are fully in accordance to that of correlation analysis in almost all of studied traits (Table 5). The PCA is a multivariate statistical analysis which can be used to determine the number of main factors with the purpose of reducing the number of effective parameters to discriminate genotypes27,41. Previously, PCA had been used to establish genetic relationships among cultivars and genotypes as well as to study the correlations among plant biochemical traits of different Rubus species in Iran38 and some other countries6,42.

Conclusion

Results showed that a vast majority of genetic diversity is available for berry weight, fruit bio-chemical traits and antioxidant activity in the Iranian wild Rubus species. This diversity has so far remained untapped for the development of new cultivars in breeding programs. Despite the profitable traits, however, some traits like the spiny bush trait often exist in the superior genotypes which make them less likely to be used directly as commercial cultivars. Strong correlations were detected among the traits being studied, especially between the anti-oxidant capacity and other measured chemicals like anthocyanin and total phenols, which could help breeders select these traits indirectly. This could facilitate and accelerate breeding programs. Cluster analysis showed a relatively good discrimination between the genotypes based on their species and, to some extent, according to their origin. However, in some cases, genotypes were grouped together even though they were from different origins. Moreover, it should be mentioned that this analysis is based on limited numbers of bio-chemical traits and their data. Further studies based on larger numbers of morpho-chemical traits or molecular data are needed to shed a stronger light on the relationships among the species and genotypes under consideration.