Pip shape echoes grapevine domestication history

Bonhomme, Vincent; Ivorra, Sarah; Lacombe, Thierry; Evin, Allowen; Figueiral, Isabel; Maghradze, David; Marchal, Cécile; Pagnoux, Clémence; Pastor, Thierry; Pomarèdes, Hervé; Bacilieri, Roberto; Terral, Jean-Frédéric; Bouby, Laurent

doi:10.1038/s41598-021-00877-4

Download PDF

Article
Open access
Published: 01 November 2021

Pip shape echoes grapevine domestication history

Vincent Bonhomme¹,
Sarah Ivorra¹,
Thierry Lacombe^2,3,
Allowen Evin¹,
Isabel Figueiral^1,4,
David Maghradze⁵,
Cécile Marchal³,
Clémence Pagnoux^1,6,
Thierry Pastor¹,
Hervé Pomarèdes⁴,
Roberto Bacilieri²,
Jean-Frédéric Terral¹ &
…
Laurent Bouby¹

Scientific Reports volume 11, Article number: 21381 (2021) Cite this article

1861 Accesses
7 Citations
11 Altmetric
Metrics details

Subjects

Abstract

The pip, as the most common grapevine archaeological remain, is extensively used to document past viticulture dynamics. This paper uses state of the art morphological analyses to analyse the largest reference collection of modern pips to date, representative of the present-day diversity of the domesticated grapevine from Western Eurasia. We tested for a costructure between the form of the modern pips and the: destination use (table/wine), geographical origins, and populational labels obtained through two molecular approaches. Significant structuring is demonstrated for each of these cofactors and for the first time it is possible to infer properties of varieties without going through the parallel with modern varieties. These results provide a unique tool that can be applied to archaeological pips in order to reconstruct the spatio-temporal dynamics of grape diversity on a large scale and to better understand viticulture history. The models obtained were then used to infer the affiliations with archaeobotanical remains recovered in Mas de Vignoles XIV (Nîmes, France). The results show a twofold shift between the Late Iron Age and the Middle Ages, from table to wine grape varieties and from eastern to western origins which correlates with previous palaeogenomic results.

Disentangling the origins of viticulture in the western Mediterranean

Article Open access 12 October 2023

Francesco Breglia, Laurent Bouby, … Girolamo Fiorentino

Seed morphology uncovers 1500 years of vine agrobiodiversity before the advent of the Champagne wine

Article Open access 27 January 2021

Vincent Bonhomme, Jean-Frédéric Terral, … Laurent Bouby

Morphotype broadening of the grapevine (Vitis vinifera L.) from Oxus civilization 4000 BP, Central Asia

Article Open access 29 September 2022

Guanhan Chen, Xinying Zhou, … Xiaoqiang Li

Introduction

Today, grapevine (Vitis vinifera L. subsp. vinifera) is economically one of the most important cultivated fruit species in the world¹. Its central economic and cultural role in the Mediterranean Basin goes back beyond the Greco-Roman era². Modern genetics and archaeobotany concur in locating the origin of domesticated grapevine in the Near East, south of the Caucasus^3,4. Its initial domestication is thought to have occurred during the Neolithic (between 6000 and 3000 BC) but the date is still debated. Chemical analyses of pottery vessels suggest that wine was already produced in the Caucasus area 8000–6000 years ago^2,5. From its Near-Eastern cradle, viticulture spread to most of the Mediterranean and eventually the rest of modern-day Europe, between 3000 BC and 500 CE². Viticulture could have started in Sardinia and Southern Italy as early as the late 2nd millennium BC⁶ and in Southern Spain by the beginning of the 1st millennium BC, in connection with the Phoenician influence⁷.

Grapevine has been dramatically modified and diversified since its early domestication. The most notable changes concern: the shift from dioecy in wild grapevines (Vitis vinifera subsp. sylvestris) to a hermaphroditic reproductive system for most of the varieties, the increase in berry and bunch sizes, the increase in sugar and acid content, and the variation in berry colour and shape^8,9. These changes are so significant that the phenotypic diversity of the domestic grapevine, including its morphological component, is much greater than that of its wild counterpart⁸. Several thousand varieties can be distinguished¹⁰, and are generally classified in two main groups: table (fruits consumed fresh or dried) and wine grapes.

Cultivated grapevine diversity is patrimonial and a direct product of its intertwined history with human societies. Because of this, cultivar diversity can help understand this shared history through the use of genetic or morphological markers.

This paper explores the global grapevine diversity through the analysis of seed morphology. It is known that seeds from wild and domesticated grapes differ in their form (i.e. size and shape); wild grapes produce roundish pips with short stalks and cultivated varieties produce more elongated pips with longer stalks⁸. Grape pips have long been a focal point in archaeobotanical studies, because of these well-known differences and because they often are the only remains that are preserved in archaeological contexts. Morphometrics, or the statistical description of shape, has a prominent place in the quantitative analysis of pips. Morphological characterization works on a highly integrated and well preserved datum, the pip shape, and its capacity to signal phenotypic resemblances, give major insights into domestication studies using modern and ancient material^11,12,13,14.

Various pip measurements have been used to distinguish wild and domesticated V. vinifera subspecies in the archaeological record^15,16. Recent quantifications of pip shape that use outline analyses have helped to improve the discrimination towards the identification of individual varieties^{17,18,19,20,21,22}.

The study of the whole phenotypic diversity in cultivated varieties reveals that two clear types can be discerned: table and wine varieties²³. Table grapes tend to have large berries, sometimes seedless, and have relatively thin skin while wine grapes are smaller, have higher concentrations of sugar, are generally seeded, and have relatively thick skins²⁴.

Negrul^25,26,27 proposed a comprehensive classification of all the known varieties into three major groups or proles. Proles orientalis is composed of table grape varieties, with big berries, typical of the Near and Middle East and regions of the Mediterranean basin. Proles occidentalis gathers wine grape varieties from Central and Western Europe. They are typically more resistant to low temperatures and have smaller, more acidic berries with lower sugar content. The varieties of Proles pontica (Balkans, Black Sea and Caucasus) have intermediate characteristics between orientalis and occidentalis. They are mostly used for winemaking or for both table/wine purposes.

In recent years, the global diversity of cultivated grapevines has mostly been studied with the use of nuclear microsatellite markers^28,29,30, thousands of SNP markers^3,30,31, or both³². These studies tend to identify a global structure confirming the three major groups described by Negrul²⁵, as well as an additional group of Iberian varieties, and a number of varieties with admixed/intermediate assignment.

Using 20 microsatellite markers (SSR), Bacilieri et al.³⁰ were able to identify Negrul’s three major groups, and a second level with two additional groups: “Iberian Peninsula and Maghreb” and “Table grapes from Italy and central Europe”. In Lacombe’s study²⁹, four distinct groups could be recognized using the same markers on a reduced set of varieties, in which closely related genotypes were excluded. In the fourth group, the Iberian cultivars were found to be associated to wine and table varieties from Asia Minor and the Caucasus. Laucou et al.’s findings³¹, which were based on an array of 18k single nucleotide polymorphisms (SNP), present a similar organization into four main groups, with the Iberian and Negrul’s groups, however, the majority of varieties were admixed.

This global structuring of V. vinifera, determined by its predominant use by humans and geographic origin, is a result of the long history of grape domestication and of the spread of viticulture. Negrul’s hypothesis proposes that the main groups of cultivars were domesticated from different populations of wild grapevines²⁵. The existence of secondary domestication events with local wild populations as opposed to mere introgression processes in other areas of the Mediterranean is still discussed, and may have helped shaped regional diversity^3,33,34. The modern diversity of cultivated grapes stems from thousands of years of selection and diffusion through cuttings and seeds combining spontaneous hybridization and somatic variation³¹.

Following morphometric analyses of grape leaves which demonstrate a weak correlation between leaf morphology and the East/West origin of grape varieties³⁵, we decided to use seed outline analysis to explore the structure of the diversity of cultivated varieties across the entire Eurasian and Mediterranean area. Our research aims to establish solid foundations on modern material, to further fuel archaeobotanical studies that can provide insights into past grapevine diversity and viticulture history. We used a representative collection of modern grapevine diversity in the form of a photographic pip shape collection, and we tested the reliability of discriminant models based on shape to infer: (i) destination use, (ii) geographical origins, (iii) conformity in the genetic structure found among varieties. Finally, we applied these models on archaeological remains as a first step into drawing finer-grained, morphological-based inferences about viticulture in the past.

Materials and methods

Reference collection of modern pips

This study includes 434 grapevine modern cultivars (Table A ESM). Their origins cover the entirety of Euro-Mediterranean diversity. Most of the cultivars were selected and sampled from the INRAE Grape Germplasm Repository (Marseillan-Plage, France). Additionally, autochthonous cultivars from the Caucasus area were sampled from the Saguramo Grape Repository (Jighaura, Georgia).

For each cultivar, 30 normally developed berries were randomly collected from a single, fully-ripe bunch. The final dataset comprised 12,346 pips.

Cofactors further used are presented in the Table A (ESM) and summarised in the Table B (ESM). They comprised: berry size, geographic origins, and destination use assessed from general bibliography^10,36. We also included genetic assignation^30,31. These two studies and the present one largely used the same set of varieties. However, because these sets were compiled at different time and with different aims, information may be missing, debated or unknown, for any given variety (Table A). For our analyses, only cultivars with well-defined information were used (Tables A, B). Berry size was observed and recorded over several years in the grapevine repositories and coded according to the International Organization of Vine and Wine descriptors³⁷.

Archaeological material

As a case study, we selected the site of Mas de Vignoles XIV (Nîmes, France) where large quantities of well-preserved, waterlogged, grape pips dating back to two time-periods (Late Iron Age/Early Roman and Medieval times) were found in association with other plant remains.

The site is located in the alluvial plain of the river Vistre and was excavated by INRAP³⁸. The site was occupied at different periods between the Neolithic and the early Middle Ages. The first traces of occupation are very sporadic, but from the end of the Bronze Age onwards, the area appears to undergo continuous changes in land use, mainly oriented towards agricultural and craft activities and including few traces of human habitation. During the second century BCE (late Iron Age or Republican period) a large farmhouse was identified nearby; the remains of its northern boundary were uncovered at Mas de Vignoles XIV. This farmhouse was later replaced by two small farms surrounded by areas devoted to agriculture and animal husbandry. The pips from this period were recovered from a well (PT14203, SU14258) and a ditch (FO14194, SU14152) that was part of a network of ditches delimiting the farming areas.

Archaeological remains from the Middle Ages are not abundant, nor well preserved and include very few traces of human habitation; this suggests that people lived further away, probably due to the unfavourable topography and edaphic conditions (depression area with high water table). The archaeological structures found include several ditches, wells, storage pits, animal enclosures and wooden constructions dedicated to farming activities ; the importance of animal husbandry is suggested by the abundance of cattle remains, which is unusual in a region mainly dominated by sheep/goat, but in agreement with local conditions³⁹. The presence of beetles associated with stable areas further reinforce the evidence of animal husbandry. Flax and hemp figure among the potentially cultivated plants, other than grapevine. The Vitis pips investigated come from a well (PT12024, SU12109 and 12111) radiocarbon-dated to the Early Middle Ages: Poz-48697: 1200 ± 30 BP (706–945 cal AD⁴⁰).

Nine pips from Mas de Vignoles XIV previously delivered aDNA results showing that varieties of different origins may have been cultivated during the Late Iron Age and the Middle Ages⁴¹.

Pip morphometric description

Each pip was photographed according to two orthogonal views (dorsal and lateral) by the same operator (TP). Outline coordinates (x; y) were extracted from these images and two markers (one at each tip of the pips) were used to normalize the position, size, rotation and first point of the outlines by registering them on “Bookstein coordinates”, that are (x = − 0.5; y = 0) and (x = 0.5; y = 0) coordinate points. For each view, elliptical Fourier transforms were used to convert the contour geometry into “Fourier coefficients”. Elliptical Fourier transforms are detailed elsewhere^42,43. The number of harmonics was chosen to gather 95% of the total harmonic power⁴³, which corresponds to five for both views. In terms of operator error (e.g. while positioning the pip), this is less harmonics, and thus a conservative choice, compared to previous recommendations of six harmonics for both views²². With four coefficients per harmonic, 40 coefficients were obtained and further used as quantitative variables describing the shape. Pip length was derived from outline coordinates. Pip length was shown to be the best predictor of all other lengths measured on pips²⁰ and here helped to analyse form, that is the shape plus size. When compared to manual measurements obtained in a subset of another study²⁰, error was centred and was, on average, below 1% (~ 1/20 mm). The correlation between the berry and the pip size previously shown²⁰ was here tested on a larger dataset using one-tail Wilcoxon rank tests (medium vs. small, large vs. small; Fig. A ESM). The final matrix analysed and used in models was thus [12346 × 41].

Statistical environment

Analyses were performed in R 4.0.2⁴⁴, with the packages Momocs 1.3.2⁴³ for everything morphometrics, MASS 7.3-51.6⁴⁵ for linear discriminant analyses, tidyverse 1.2.1⁴⁶ for general data manipulation and visualization, pvclust 2.2-0⁴⁷ for assessing uncertainties in hierarchical clustering and ape 5.0⁴⁸ for unrooted tree representation.

Visualizing and testing for use, geographical and genetic signals

First, a principal component analysis was calculated on the full matrix of coefficients to visualize how each level of each cofactor of interest were located in this synthetic morphological space (Fig. B ESM).

To test for a costructure between pip form and cofactors, two approaches were used: hierarchical clustering with robustness assessment, and cross-validation using permutational and balanced linear discriminant analyses. To assess geographical structuration the (putative) countries of origin of cultivars were organized in geographical groups (Table A ESM, Table B ESM).

For hierarchical clustering, the averaged coefficients for each level were used to calculate a distance matrix using correlation as the distance method, on which a hierarchical clustering using average (i.e. UPGMA) was calculated. Topologies obtained were presented as unrooted trees (Figs. 1, C ESM). The robustness of nodes was estimated through multiscale resampling⁴⁷ with proportions ranging from r = 0.5 to r = 1.4 with a 0.1 increment and 10² resampling with replacement. The “approximately unbiased p-values” were retained⁴⁹ and presented in Figs. 1 and C (ESM).

This method had the merit of simplicity but did not accounted for: (a) variability within groups, (b) overlapping between groups, and (c) unbalanced groups sizes since coefficients were averaged. Moreover, for archaeobotanical inference, we need predictive models along with their performance assessment. We thus combined this approach with linear discriminant analyses (LDA). Class accuracies (i.e. tree leaves) were presented as confusion matrices, and class clustering (i.e. tree nodes) were estimated using classes belonging to a node against all others. The index used is accuracy (the proportion of correctly classified pips), using leave-one-out cross-validation. To cope with unbalanced sample sizes, 10² permutations were used⁵⁰, and each sampled the minimal group size among all groups. The median value was reported on each node in Fig. 1. For each model, we also presented the confusion matrices obtained for each group against others (Fig. 2), as well as the distribution of accuracies obtained under this null model (here obtained through simulations but expected to follow a multinomial distribution; Fig. D ESM).

Assessing filtering predictions

In a “predictive” LDA, new statistical individuals are always assigned to the class with the highest posterior probability. In filtering out predictions based on posterior probabilities sample size is exchanged for identification confidence. The posterior probability cut-off value is often based on rule of thumb by picking an arbitrary threshold. Here we use a twofold approach, combining LDA, resampling and filtering in the spirit of⁵⁰. Across the 10² permutations, we calculated the class accuracies and the proportion of the original group sample size retained, as functions of cut-off value for posterior probability (Fig. E ESM and Table C ESM). We also explored the same relationship using the proportion of cases among permutations where each pip was attributed to a given class as a cut-off value (Fig. E ESM and Table D).

Inferences on archaeological material

Former studies have shown that archaeobotanical assemblages generally include an important proportion of wild type grape pips^16,22. For this reason, we used a first LDA_status to identify domesticated and wild type pips with the dataset published^17,51. Here, we used the same approach with 10² balanced permutations. For each pip, the majority rule applied. In the cited studies, these LDA achieved 95% accuracy (without filtering) in distinguishing between pips from wild grapevine individuals and those from domesticated varieties. Further inferences about archaeological pips identified as the domesticated type were then obtained using the 10² balanced models previously presented (Figs. 3 and F ESM): we inferred destination use (LDA_Use), geographical origins (LDA_Geo), the destination and geographical origins jointly (LDA_Geo×Use) and genetical grouping (LDA_SNP4, LDA_SSR5). Inferred proportions of each class are presented with three approaches: no filtering (Fig. 3), filtering out pips with a median posterior probability observed among permutations < 0.8 (Fig. F ESM); filtering out pips that were attributed to the same class less than 50% of the time (Fig. F ESM).

Results

Berry size

Our results show that berry size in modern grapes positively correlates with pip size (Fig. A ESM): varieties with medium-sized berries had longer pips than those with small-sized berries (Wilcoxon rank tests: W = 6,581,329, P < 10^–16). Similarly, large-sized berries had longer pips than those with medium-sized berries (W = 11,851,551, P < 10^–16).

Principal component analysis

The first two components of the PCA obtained from the full matrix of coefficients captured 73.2% of the total variance in form. For the sake of clarity, for each variety only the PC1-PC2 centroid was displayed (Fig. B ESM). With the exception of “Use”, which shows a clear positional difference between table and wine varieties, other cofactors of interests showed more subtle contrasts.

Form and destination use

After discarding mixed destination varieties, the remaining set included 3,106 pips (Table B ESM). The discriminant model achieved a good discrimination rate (wine = 80.2%; table = 78.9%—Fig. 2), far better than the results expected of chance alone (expected = 50%; max. observed for 100 permutations = 53%—Figs. 2 and D ESM).

Form and geographical origin

The first geographical model, based on the putative origin of cultivars according to our bibliography, included all regions except NEW WORLD. For each group, 480 pips were included. The resulting tree (Fig. C ESM) showed a clear geographical structuring with two separate clusters (EUR_IBER + EUR_WEST; EUR_ITAL + BALKANS + MED_SOUTH + MED_EAST) with EUR_EAST in between. Varieties gathered in the ASIA_CENT group were clearly set apart. For the other nodes, pvclust values were all > 94 and cross-validation > 64%. Class accuracies ranged between 23% (MED_EAST) and 63% (ASIA_CENT); aside from MED_EAST, all were better than chance alone (expected = 12.5%; max₁₀₀ = 25%).

A second geographical model was restricted to “core” historical regions (Fig. 1). In practical terms, the varietal sampling within these groups was more exhaustive and allowed to include 1,439 pips in permutations (Table B ESM). The resulting tree (Fig. 1) clearly distinguished between EUR_IBER + EUR_WEST and the other groups. All nodes presented pvclust values > 80 and cv values > 64%. Class accuracies ranged between 36% (EUR_WEST) and 50% (EUR_IBER), all better than chance alone (expected 20%; max₁₀₀ = 29%—Figs. 2 and D ESM).

With fewer groups, one usually expects better accuracies; since this was not the case, we suspected a latent effect. We thus built a third model including the same core geographical regions, and combined them with their destination use (Fig. 1). Due to the lower number of table varieties, only 120 pips were included in each permutation (Table B ESM). Despite this, a clear structure with two neat clades corresponding to destination use were observed. Apart from the clade representing BALKANS_Table and MED_EAST_Table, all nodes had pvclust values > 91 and cv values > 66% (Fig. 2). For the wine varieties, BALKANS and MED_EAST were clustered together (pvclust = 94; cv = 67%), then with ITAL (pvclust = 98; cv = 66%). Among wine varieties, another clade grouped EUR_WEST and EUR_IBER (pvclust = 99; cv = 73%). Class accuracies ranged between 29% (BALKANS_Wine, EUR_WEST_Wine, MED_EAST_Table) and 69% (EUR_IBER_Table), overall, far better than chance alone (expected = 10%; max₁₀₀ = 22%—Figs. 2 and D ESM).

Form and genetic structure

We then explored whether the genetic structure found using SSR³⁰ and SNP³¹ data was also echoed in pip form. The first model used SNP4 (Fig. 1) and included 360 pips in each permutation. The EAST_TABLE group was distinguishable from the three other groups (pvclust = 98; cv = 89%), and in the latter BALK_Wine and IBER_Wine clustered together (pvclust = 77; cv = 83%). Cross-validation values for each node were all > 83%. Class accuracies for each group ranged from 66% (BALK_Wine) to 82% (EAST_Table), much better than chance alone (expected = 25%; max₁₀₀ = 38%—Figs. 2 and D ESM).

The second model used microsatellite data (SSR5) and led to similar results. This model used only 120 pips for each permutation and distinguished two groups EAST_Table and ITACE_Table, and WCEUR_Wine + BALK_Wine + IBER_WT. All nodes presented pvclust values > 85 and cv values > 78% (Fig. 1). Class accuracies ranged from 53% (ITACE-Table and WCEUR-Wine) to 63% (EAST-Table), again, better than chance alone (expected = 20%; max₁₀₀ = 35%—Figs. 2 and D ESM). We built a final model excluding ITACE_Table (not shown), which allowed us to increase the number of pips to 420. The same topology was obtained among remaining groups, all pvclust values were > 77 and cv values > 70%, and nodes and class accuracies for groups ranged from 67 to 71%, much better than chance alone (expected = 25%; max₁₀₀ = 36%).

Sample size and filtering out based on posterior probabilities

As expected, filtering results based on posterior probabilities improves class accuracies at the cost of reduced sample sizes (Fig. E ESM, Tables C ESM and D ESM). The models with the lowest class accuracies before any filtering were the ones with the steepest slopes for the proportion of filtered out curves. Such simulations are useful since they show that with low class accuracies and without filtering on posterior probabilities, the benefit in the accuracy gain is quite low compared to the price to pay in terms of sample size reduction. For instance, when filtering at a posterior probability of 0.5, the absolute gain is only 11% (33% of relative gain) but 64% of the original sample size is filtered out. Also, the models with the smallest number of pips in each permutation showed the highest uncertainties for class accuracy estimates. Due to the resampling nature of these simulations, the higher the number of pips, the lower the variation expected for the estimates.

Application to the archaeological material

LDA_status trained on the reference material led to 95.2% accuracy for both wild and domesticated types (100 permutation using 2005 pips). When applied to archaeological material, LDA_status classified 81% pips (102/128) and 39% (74/204) pips as the wild morphological type for the Late Iron age (LIA) and the Middle Age (MA) phases. These pips were discarded from further analyses and the remaining sample sizes were thus 26 and 130 for LIA and MA phases. Overall, the different filtering approaches led to congruent results (Figs. 3, F ESM). We tried a high pass of 0.8 for posterior probabilities whose results on archaeological material (not shown) confirmed those obtained on modern material: sample sizes were dramatically reduced and results were much more dependent on the training set. The overall tendencies observed for the Mas de Vignoles XIV and based on pip shape were a shift from table to wine type, and from Southwest Asia to Western Europe, between the LIA and MA. The cofactors including use in their definitions (Use, Geo × Use, SNP4 and SSR5) all corroborated the predominance of the table type during the LIA and of the wine type during the MA. Similarly, for the geographical origins, all models provide evidence of the Eastern origins for the LIA and of Western European origins for the MA (Figs. 3, F ESM). The length of the pips classified in the domesticated-type of the LIA assemblage is greater than that of the MA. Consequently, the berry size that can be inferred is higher for the LIA, close to medium-size to large modern berries, while the size inferred for MA berries is very small (Fig. A ESM).

Discussion

The destination use (table/wine) and geographical origins of V. vinifera are echoed in the shape of modern grapevine pips and corroborate the structure found using genetic markers. The results here obtained from this modern material dataset pave the way for a more comprehensive archaeobotanical analysis of the grapevine historical agrobiodiversity and biogeography.

Analyses of genomic sequences brought direct insights into V. vinifera genealogies, kinships and, more generally, into the intraspecific structuring of the domesticated grapevine^{3,4,30,31,52,53}. Genetic markers are direct, sensitive and accurate proxies, but do not provide clear-cut groups within agrobiodiversity since variety amelioration is the product of a continuous and intertwined history and are rarely performed on the large spatio-temporal scale offered by morphometric studies. The two studies where genetic assignation were used demonstrate a high proportion of admixed varieties^30,31.

Pip shape, on the other hand, integrates genotypic, developmental and environmental factors. As a phenotypical trait, it is known to be a much more indirect proxy for measuring and uncovering agrobiodiversity structuring. Because morphology is prone to homoplasy, two identical shapes may not be directly genetically related and may instead reflect a potentially mixed signal of ancestry, similar environmental adaptations, as well as non-adaptive natural processes (i.e. drift). Our results are validated by molecular approaches rather than confirming them. Under certain conditions, for example where a strong population structure and divergent selection are present, phenotypical approaches may be superior to molecular ones for measuring agrobiodiversity⁵⁴.

Our results show that grapevine pip diversity is significantly structured by use and, to a lesser extent, by the geographical origins of varieties. Use had the best class accuracies; the pips of wine and table grapevine varieties have different form. This was shown in a previous study, which used a smaller set of varieties²⁰. Several other studies based on phenotypic^23,25 and genetic markers^3,30,31, have already concluded that grape varieties were structured, above all, according to their use as table or wine.

More importantly, our results also established a significant geographical correlation in the shape of pips. Between the two genetic models, the classifications from SNP4 gave better class accuracies for pips than those of SSR5. The fact that SNP4 was calculated using 10,000 SNP markers scattered along each chromosome, bolsters the findings of our study since the SSR5, only used 20 microsatellite markers. Despite having one class less, the SNP4 dataset may be more representative of pip variability since it was trained using more varieties, and thus more pips in permutations.

The large-scale structure in pip shape reflects the same blurred boundaries as those reported by genomic analyses. In genetic analyses, unassigned varieties are mostly attributed to human-assisted movement of cultivars across regions and inter-group breeding. In morphologic analysis, however, additional factors may be involved. i.e. environment and development constraints as well as homoplasy. Nevertheless, the morphology-based geographical tree indicates the clustering of eastern groups and western groups, and Italian varieties clustering with eastern ones. The same pattern is found for the tree combining geography and use, where despite a predominant Use structure, the Italian wine varieties are clustered with eastern ones.

Interestingly, incorrect assignations may also reveal meaningful information. Misclassified seeds fall primarily into groups that are closely related in terms of use and geographical origin. For instance, in the Geographical confusion matrix, the eastern Mediterranean group is most frequently misclassified with pips of the “Balkans” groups, and vice-versa. On the same confusion matrix, the “EUR-ITAL” group reflects its intermediate nature between western and eastern varieties. For these reasons, retrieving congruent results using shape alone was far from a foregone conclusion.

Finally, it is worth noting that the proles classification proposed by Negrul²⁵ was based on morphological criteria and was later confirmed by genetical studies^30,31. These different approaches are largely congruent and provide evidence that grapevine diversity is not only structured, but that its structuring is related to, and likely a product of, the history and the geography of viticulture.

Dedicated genotype-to-phenotype association studies could help decipher the mechanisms behind such correlation between the shape of pips and the cofactors of destination use and geographical origins. The destination use is directly related to the phenotypic traits and chiefly those of the berry (e.g. size, flavors, aromas, etc.). Berry trait loci have been reported in other studies^32,55, as have the covariation between the berry and the pip size and shape²⁰. With regard to geographical origins, we cannot exclude an indirect link with climatic conditions through crossing of varieties from different origins but we see no reason why a particular geographical origin may directly select a particular pip shape. Berry size likely has a correlation with geographical origin because of the relationship between use and geography. Negrul²⁵ already highlighted the ubiquity of large-berry table varieties in the Near-Middle East, that of small-berry wine varieties in Central-Western Europe, and intermediate varieties in his proles pontica (Balkans, Caucasus, Black Sea). Overall, it is likely that pip shape was not directly implicated in selection and that the subtle changes in its shape are probably neutral. One can thus reasonably hypothesize that any shape change is therefore caused by genetic drift and/or genetic linkage.

While indirect, both in origin and signature, pip shape is informative about variety origins and use. Shape is also often the only exploitable datum on archaeological remains, and these results are therefore of prime interest to help us better understand grapevine agrobiodiversity through both time and space^16,17,21.

Archaeobotanical inference is mostly actualistic: insights obtained from modern material generate inferences for archaeological remains. So far, the shape of pips has been used to distinguish between wild and domesticated types^15,16,22 and, more recently, to identify domesticated morphotypes that correlate with modern varieties^17,21. The identification of similar modern varieties amongst a diverse subset can be used to make infraspecific conclusions if their properties are compared to those of the already identified varieties^21,51.

In our study, we used a more extensive and more representative collection of modern seeds, the largest worldwide, to the best of our knowledge. The pioneer collection used in Terral et al.²² increased since then^17,51,56. Here, we moreover directly infer cofactors of interest without the intermediate identification of the most-resembling modern variety. This is an important result because it means that properties of varieties cultivated in the distant past can be inferred directly from the archaeological remains without having to compare them with modern varieties that may not be appropriate counterparts.

To increase the robustness of inferences, predictions are sometimes filtered out based on posterior probabilities^12,51. When cut-off thresholds are selected by the “rule of thumb” method, the trade-off between accuracy and sample size is often forgotten. Our results indicated that this approach should be used sparingly and only when classification accuracies were already proven to be better than classification by chance alone. One should also keep in mind that poorly classified pips may actually be “true” intermediate forms, and if filtered out, may accentuate the contrast between or within assemblages. Finally, when using linear discriminant analyses, particularly when groups have significantly unbalanced sample sizes, the use of permutations is preferable for obtaining better estimates of their accuracies⁵⁰.

We applied the models trained on modern material to the Mas de Vignoles XIV assemblages whose palaeogenomic information identifying the relationships between ancient grapes and modern diversity had already been obtained⁴¹. It is an ideal opportunity to combine aDNA approaches with and morphometrics strike force, although not on the same pips.

Our morphometric analyses of the Mas de Vignoles XIV assemblage demonstrate notable changes in viticulture between the Late Iron Age and Middle Age. The first one was the decreased prevalence of the wild type, from a predominant to a minority proportion. This decrease was previously observed in the study of several other archaeological sites in southern France¹⁶. It can be assumed that this wild type, which is phenotypically different from modern varieties, corresponds to a part of the diversity amongst historically cultivated varieties rather than being gathered wild berries¹⁶.

Changes were also observed in domesticated pips. The different models agree on a twofold trend “between” the LIA and the MA: a shift from table to wine varieties, and from eastern to western varieties. These changes are probably two facets of the same shift. The pip lengths are also much smaller for the Middle Age assemblages. This direct measure is congruent with the LDA results as well as those of a previous study, which showed that wine varieties have shorter pips than table ones²⁰. The SSR4 model, that should be considered more robust than the SSR5 as discussed above, similarly exhibits a contrast between these two historical phases: the predominance of Eastern table varieties in the LIA and of Western wine varieties in the MA. According to aDNA results, the three seeds studied from the LIA sample resemble table grapes as well as Eastern and Iberian wine varieties whereas medieval seeds are more similar to Western European wine varieties⁴¹. Although the seeds used were not the same as those of our study, they nevertheless corroborate our observations of a geographical shift towards Western wine varieties.

It is not possible to prove with certainty that the grape seeds found at Mas de Vignoles XIV came from locally grown grapevines, but its rural location may suggest this. While grapes were traded and could be transported over long distances in Roman times⁵⁷, archaeological findings have revealed traces of grapevine cultivation on the outskirts of the city of Nîmes, in the close vicinity of Mas de Vignoles XIV, from the second century BC onwards⁵⁸. Until recently, it was believed that by the late Iron Age the vines cultivated in the South of France were only intended for wine production⁵⁹. However, while vineyards appeared to be fairly extensive around the city of Nîmes during the early Roman period, wine cellars and wine production equipment were not very widespread in the excavated settlements⁵⁸. This observation could be consistent with the hypothesis that parts of vineyards were dedicated to the cultivation of table grapes. These new results allow us to imagine the possibility of a viticulture partly destined for table purposes using eastern varieties rather than native or locally domesticated varieties.

Data availability

Full datasets will be released upon acceptance. They are (privately) available there: https://figshare.com/s/7578a0740dfbac9e3742.

References

Alston, J. M. & Sambucci, O. Grapes in the world economy. In The Grape Genome (eds Cantu, D. & Walker, A.) 1–24 (Springer International Publishing, 2019) https://doi.org/10.1007/978-3-030-18601-2.
Chapter Google Scholar
McGovern, P. E. Ancient Wine The Search for the Origins of Viniculture (Princeton University Press, 2007).
Google Scholar
Myles, S. et al. Genetic structure and domestication history of the grape. Proc. Natl. Acad. Sci. 108, 3530–3535 (2011).
Article ADS PubMed PubMed Central Google Scholar
Riaz, S. et al. Genetic diversity analysis of cultivated and wild grapevine (Vitis vinifera L.) accessions around the Mediterranean basin and Central Asia. BMC Plant Biol. 18, 1–14 (2018).
Article Google Scholar
McGovern, P. E. et al. Early Neolithic wine of Georgia in the South Caucasus. Proc. Natl. Acad. Sci. 114, E10309–E10318 (2017).
Article PubMed PubMed Central CAS Google Scholar
Sabato, D. et al. Archaeobotanical analysis of a Bronze Age well from Sardinia: A wealth of knowledge. Plant Biosyst. Int. J. Dealing Aspects Plant Biol. 149, 205–215 (2015).
Google Scholar
Pérez-Jordà, G. et al. The emergence of arboriculture in the 1st millennium BC along the Mediterranean’s “Far West”. Agronomy 11, 902 (2021).
Article Google Scholar
Levadoux, L. Les populations sauvages et cultivées des Vitis vinifera L. Annales de l’amélioration des plantes 6, 59–118 (1956).
Google Scholar
This, P., Lacombe, T. & Thomas, M. Historical origins and genetic diversity of wine grapes. Trends Genet. 22, 511–519 (2006).
Article PubMed CAS Google Scholar
Galet, P. Dictionnaire encyclopédique des cépages (Libre & Solidaire, 2015).
Google Scholar
Jones, G., Valamoti, S. & Charles, M. Early crop diversity: A ‘new’ glume wheat from northern Greece. Veg. Hist. Archaeobotany 9, 133–146 (2000).
Article Google Scholar
Terral, J.-F. et al. Historical biogeography of olive domestication (Olea europaea L.) as revealed by geometrical morphometry applied to biological and archaeological material. J. Biogeogr. 31, 63–77 (2004).
Article Google Scholar
Fuller, D. Q. Contrasting patterns in crop domestication and domestication rates: Recent archaeobotanical insights from the old world. Ann. Bot. 100, 903–924 (2007).
Article PubMed PubMed Central Google Scholar
Bonhomme, V. et al. The first shoots of a modern morphometrics approach to the origins of agriculture. Web Ecol. 16, 1–2 (2015).
Article Google Scholar
Mangafa, M. & Kotsakis, K. A New Method for the Identification of Wild and Cultivated Charred Grape Seeds 409–418 (1996).
Bouby, L. et al. Bioarchaeological insights into the process of domestication of grapevine (Vitis vinifera L.) during Roman times in southern France. PLoS ONE 8, e63195 (2013).
Article ADS PubMed PubMed Central CAS Google Scholar
Pagnoux, C. et al. Inferring the agrobiodiversity of Vitis vinifera L. (grapevine) in ancient Greece by comparative shape analysis of archaeological and modern seeds. Veg. Hist. Archaeobotany 24, 75–84 (2015).
Article Google Scholar
Karasik, A., Rahimi, O., David, M., Weiss, E. & Drori, E. Development of a 3D seed morphological tool for grapevine variety identification, and its comparison with SSR analysis. Sci. Rep. 8, 6545 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Orrù, M., Grillo, O., Lovicu, G., Venora, G. & Bacchetta, G. Morphological characterisation of Vitis vinifera L. seeds by image analysis and comparison with archaeological remains. Veg. Hist. Archaeobotany 22, 231–242 (2013).
Article Google Scholar
Bonhomme, V. et al. Eco-evo-devo implications and archaeobiological perspectives of trait covariance in fruits of wild and domesticated grapevines. PLoS ONE 15, e0239863 (2020).
Article PubMed PubMed Central CAS Google Scholar
Bonhomme, V. et al. Seed morphology uncovers 1500 years of vine agrobiodiversity before the advent of the Champagne wine. Sci. Rep. 11, 2305 (2021).
Article ADS PubMed PubMed Central CAS Google Scholar
Terral, J.-F. et al. Evolution and history of grapevine (Vitis vinifera) under domestication: New morphometric perspectives to understand seed domestication syndrome and reveal origins of ancient European cultivars. Ann. Bot. 105, 443–455 (2010).
Article PubMed Google Scholar
Boursiquot, J.-M., Faber, M.-P., Blachier, O. & Truel, P. Utilisation par l’informatique et traitement statistique d’un fichier ampélographique. Agronomie 7, 13–20 (1987).
Article Google Scholar
Kui, L., Tang, M., Duan, S., Wang, S. & Dong, X. Identification of selective sweeps in the domesticated table and wine grape (Vitis vinifera L). Front. Plant Sci. 11, 1–11 (2020).
Article ADS Google Scholar
Negrul, A. Origin and classification of cultivated grape. In The Ampelography of the USSR Vol. 1 (eds Baranov, A. et al.) 159–216 (Pischepromizdat, 1946).
Google Scholar
Negrul, A. Variabilität und Vererbung des Geschlechts bei der Rebe. Die Gartenbauwissenschaft X, 215–231 (1936).
Google Scholar
Negrul, A. Evolucija kuljturnyx form vinograda. In Akademii nauk SSSR XVIII 585–588 (1938).
Aradhya, M. K. et al. Genetic structure and differentiation in cultivated grape, Vitis vinifera L. Genet. Res. 81, 179–192 (2003).
Article PubMed CAS Google Scholar
Lacombe, T. Contribution à l’étude de l’histoire évolutive de la vigne cultivée (Vitis vinifera L.) par l’analyse de la diversité génétique neutre et de gènes d’intérêt (Centre International d’Etudes Supérieures en Sciences Agronomiques, 2012).
Google Scholar
Bacilieri, R. et al. Genetic structure in cultivated grapevines is linked to geography and human selection. BMC Plant Biol. 13, 1–14 (2013).
Article Google Scholar
Laucou, V. et al. Extended diversity analysis of cultivated grapevine Vitis vinifera with 10K genome-wide SNPs. PLoS ONE 13, 1–27 (2018).
Article CAS Google Scholar
Emanuelli, F. et al. Genetic diversity and population structure assessed by SSR and SNP markers in a large germplasm collection of grape. BMC Plant Biol. 13, 1–17 (2013).
Article CAS Google Scholar
Grassi, F. et al. Evidence of a Secondary Grapevine Domestication Centre Detected by SSR Analysis 1315–1320 https://doi.org/10.1007/s00122-003-1321-1(2003).
Arroyo-Garcia, R. et al. Multiple origins of cultivated grapevine (Vitis vinifera L. ssp. sativa) based on chloroplast DNA polymorphisms. Mol. Ecol. 15, 3707–3714 (2006).
Article PubMed CAS Google Scholar
Chitwood, D. H. et al. A modern ampelography: A genetic basis for leaf shape and venation patterning in grape. Plant Physiol. 164, 259–272 (2014).
Article PubMed CAS Google Scholar
Maul, E. & Töpfer, R. Vitis International Variety Catalogue (VIVC): A cultivar database referenced by genetic profiles and morphology. In BIO Web of Conferences Vol. 5, 01009 (2015).
OIV. OIV Descriptor list for grape varieties and Vitis species. (2009).
Pomarèdes, H. Nîmes (30) Mas de Vignoles XIV, Rapport final de fouille archéologique. (2018).
Forest, V. & Fabre, M. Étude archéozoologique, ostéologie, conchyliologie (périodes antiques et médiévales), in Pomarèdes H (Dir.) Nîmes (30) Mas de Vignoles XIV, Rapport final de fouille archéologique, INRAP. (2018).
Reimer, P. J. et al. The IntCal20 northern hemisphere radiocarbon age calibration curve (0–55 cal kBP). Radiocarbon 62, 725–757 (2020).
Article CAS Google Scholar
Ramos-Madrigal, J. et al. Palaeogenomic insights into the origins of French grapevine diversity. Nat. Plants 5, 595–603 (2019).
Article PubMed Google Scholar
Kuhl, F. F. P. & Giardina, C. C. R. Elliptic Fourier features of a closed contour. Comput. Graphics Image Process. 18, 236–258 (1982).
Article Google Scholar
Bonhomme, V., Picq, S., Gaucherel, C. & Claude, J. Momocs: Outline analysis using R. J. Stat. Softw. 56, 24 (2014).
Article Google Scholar
R Development Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2021).
Google Scholar
Venables, W. N. & Ripley, B. D. Modern Applied Statistics with S (Springer, 2002).
Book MATH Google Scholar
Wickham, H. et al. Welcome to the Tidyverse. J. Open Source Softw. 4, 1686 (2019).
Article ADS Google Scholar
Suzuki, R. & Shimodaira, H. Pvclust: An R package for assessing the uncertainty in hierarchical clustering. Bioinformatics 22, 1540–1542 (2006).
Article PubMed CAS Google Scholar
Paradis, E. & Schliep, K. Ape 5.0: An environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35, 526–528 (2019).
Article PubMed CAS Google Scholar
Shimodaira, H. Approximately unbiased tests of regions using multistep-multiscale bootstrap resampling. Ann. Stat. 32, 2616–2641 (2004).
Article MathSciNet MATH Google Scholar
Evin, A. et al. The long and winding road: Identifying pig domestication through molar size and shape. J. Archaeol. Sci. 40, 735–743 (2013).
Article Google Scholar
Bouby, L. et al. Tracking the history of grapevine cultivation in Georgia by combining geometric morphometrics and ancient DNA. Veg. Hist. Archaeobotany https://doi.org/10.1007/s00334-020-00803-0 (2020).
Article Google Scholar
Bianchi, D., Brancadoro, L. & De Lorenzis, G. Genetic diversity and population structure in a Vitis spp. core collection investigated by SNP markers. Diversity 12(3), 103 (2020).
Article CAS Google Scholar
Raimondi, S. et al. DNA-based genealogy reconstruction of Nebbiolo, Barbera and other ancient grapevine cultivars from northwestern Italy. Sci. Rep. 10, 1–16 (2020).
Article ADS CAS Google Scholar
Pressoir, G. & Berthaud, J. Population structure and strong divergent selection shape phenotypic diversification in maize landraces. Heredity 92, 95–101 (2004).
Article PubMed CAS Google Scholar
Guo, D. L. et al. Genome-wide association study of berry-related traits in grape [Vitis vinifera L.] based on genotyping-by-sequencing markers. Hortic. Res. 6, 1–13 (2019).
Article CAS Google Scholar
Pagnoux, C. et al. Local domestication or diffusion? Insights into viticulture in Greece from Neolithic to Archaic times, using geometric morphometric analyses of archaeological grape seeds. J. Archaeol. Sci. 125, 105263 (2021).
Article Google Scholar
André, J. L’alimentation et la cuisine à Rome. (2009).
Pomarèdes, H., Bel, V., Monteil, M., Séjalon, P. & Vidal, L. Le paysage périurbain à Nîmes (Gard, France) de la Protohistoire au Haut-Empire (VIe av. n. è.- IIe s. de n. è.). In Le paysage périurbain pendant la Protohistoire et l’Antiquité en Méditerranée occidentale 287–318 (2009).
Bouby, L., Marinval, P. & Terral, J.-F. From secondary to speculative production? The protohistorical history of viticulture in Southern France. In Plants and People: Choices and Diversity through Time (eds Chevalier, A. et al.) 175–181 (Oxbow Books, 2014).
Google Scholar

Download references

Acknowledgements

This research is funded by French National Agency, (ANR-16-CE27-0013) “Vignes et vins en France du Néolithique au Moyen Âge. Approche intégrée en archéosciences” (PI: LB). AE has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement No. 852573). We are grateful to the INRA Vassal-Montpellier grapevine collection (Marseillan-Plage, France) and the Saguramo Grape Repository (Jighaura, Georgia), which provided all the pips from cultivated varieties, and for that of the OSU-OREME (https://oreme.org/), which helped the constitution of the wild grape pip collection. All relevant permits or permissions have been obtained to obtain the plants and the studies conducted comply with local and national regulations or guidelines. We warmly acknowledge Aya Alphs for her help with proofreading.

Author information

Authors and Affiliations

ISEM, University of Montpellier, CNRS, EPHE, IRD, Montpellier, France
Vincent Bonhomme, Sarah Ivorra, Allowen Evin, Isabel Figueiral, Clémence Pagnoux, Thierry Pastor, Jean-Frédéric Terral & Laurent Bouby
UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France
Thierry Lacombe & Roberto Bacilieri
Grapevine Biological Resources Center, INRAE, Unité Expérimentale Domaine de Vassal, University of Montpellier, Marseillan, France
Thierry Lacombe & Cécile Marchal
INRAP Méditerranée, Center of Villeneuve-les-Béziers, Villeneuve-les-Béziers, France
Isabel Figueiral & Hervé Pomarèdes
National Wine Agency of Georgia, Tbilisi, Georgia
David Maghradze
École Française d’Athènes, Athens, Greece
Clémence Pagnoux

Authors

Vincent Bonhomme
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Ivorra
View author publications
You can also search for this author in PubMed Google Scholar
Thierry Lacombe
View author publications
You can also search for this author in PubMed Google Scholar
Allowen Evin
View author publications
You can also search for this author in PubMed Google Scholar
Isabel Figueiral
View author publications
You can also search for this author in PubMed Google Scholar
David Maghradze
View author publications
You can also search for this author in PubMed Google Scholar
Cécile Marchal
View author publications
You can also search for this author in PubMed Google Scholar
Clémence Pagnoux
View author publications
You can also search for this author in PubMed Google Scholar
Thierry Pastor
View author publications
You can also search for this author in PubMed Google Scholar
Hervé Pomarèdes
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Bacilieri
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Frédéric Terral
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Bouby
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: L.B., J.F.T., V.B., T.L., R.B.; Data curation: S.I., T.L., T.P., V.B., C.P., R.B., J.F.T., L.B.; Formal analysis: V.B., A.E.; Funding acquisition: L.B., A.E.; Investigation: V.B., S.I., T.P., J.F.T., L.B.; Methodology: V.B.; Project administration: L.B., J.F.T.; Resources: T.L., C.M., D.M., I.F., H.P.; Software : V.B.; Visualization: V.B.; Writing—original draft: V.B., L.B.; Writing—review and editing: V.B., L.B., J.F.T., A.E., T.L., R.B., I.F.

Corresponding authors

Correspondence to Vincent Bonhomme or Laurent Bouby.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Supplementary Information 5.

Supplementary Information 6.

Supplementary Information 7.

Supplementary Information 8.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bonhomme, V., Ivorra, S., Lacombe, T. et al. Pip shape echoes grapevine domestication history. Sci Rep 11, 21381 (2021). https://doi.org/10.1038/s41598-021-00877-4

Download citation

Received: 03 May 2021
Accepted: 15 October 2021
Published: 01 November 2021
DOI: https://doi.org/10.1038/s41598-021-00877-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.