Introduction

Characterization of the crop growth response spatial variability plays a key role in achieving site-specific management that optimizes grower inputs on areas much smaller than the entire field. In viticulture the knowledge of “vigour” defined by Winkler et al. (1974)1 as the condition expressed in the rapid growth of the parts of the vine, is very useful information for farmers to monitor canopy development and grape production parameters2. Since the late 1970s, satellite and aircraft platforms have been widely used in viticulture3 for the monitoring of “vigour” spatial variability through the calculation of vegetation indices, based on the canopy spectral response to solar radiation in the visible and near infrared spectrum. However, those platforms offer a limited spatial resolution that does not allow a correct analysis of pure canopy pixels, by removing the inter-row component4. In the last decade, the continuous advance in Unmanned Aerial Vehicle (UAV) technologies led to an increasing of spatial ground resolution5. Combining powerful photogrammetric software and image analysis methodologies it is possible to perform a geometric analysis at single vine level for canopy height, thickness and volume estimation6. As hypothesized by previous works7,8, this made it possible to identify new effective solutions to monitor vine vegetative and productive characteristics, with respect to traditional vegetation indices, such as the widely diffused Normalized Difference Vegetation Indices (NDVI). In literature, the use of UAV remote sensing for applications in vineyards is well assessed, however only a few works support remote data with ground truth agronomic measurements made with traditional methods. Those works focus on diseases with ground quantified symptoms9,10, water stress with leaf stomatal conductance and leaf water potential5,11,12,13,14, vegetative development by means of pruning weight and leaf area index (LAI)4,5,15,16, leaf compounds as carotenoid content and net photosynthesis17,18 and grape quality compounds and yield5,19,20,21,22,23. Exploring new approaches for agronomic parameters estimation is a topic of great interest as it would allow to overcome the limitation of traditional time-consuming and destructive measurements, frequently subjective or little representative. With regards to yield prediction, many studies have achieved meaningful results, but the relationships identified have not always been clarified by linear equations. In this context, new Machine Learning (ML) techniques based on non- and semi-parametric structures have become suitable to solve nonlinear and complex problems24,25. The most commonly used ML techniques are Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), Gaussian Process Regression (GPR) and Artificial Neural Network (ANN). Jeong et al. (2016)26 found that RF was highly capable of predicting crop yields in wheat, maize and potato, outperforming multiple linear regression. Romero et al. (2013)27 applied several ML methods for the classification of yield components of wheat and showed that the association rule mining method obtained the best performance. Khaki et al. (2020)28 described a yield model based on convolutional neural networks (CNNs) and recurrent neural networks (RNNs), which successfully generalized the yield prediction to untested locations for maize (RMSE = 24.10 bushels/acre, validation correlation = 75.04%) and soybean (RMSE = 6.35 bushels/acre, validation correlation = 77.84%) yield. In the field phenotyping area, Herrero‑Huerta et al. (2020)29 presented a research focussed on capability of ML techniques to perform grain yield prediction in soybeans by combining data from multispectral and RGB cameras equipped on UAV platforms, achieving an accuracy of over 90.72% by RF and 91.36% by eXtreme Gradient Boosting (XGBoost). Zhou et al. (2020)30 compared the model performances for predicting wheat grain yield and protein content between the ML algorithms based on spectral reflectance bands and plant height and the traditional linear regression based on vegetation.

Indices. The research reported that the linear regression model based on the enhanced vegetation index (EVI) provided highest performance capable of predicting the yield with a RMSE = 972 kg/ha, while the RF model based on reflectance bands was capable of predicting the protein content with an RMSE of 1.07%. Unfortunately, there is a lack of this kind of research in viticulture. A recent paper22 proposed a methodology based on the combination of spectral (NDVI) and geometric (Fc) UAV indices to establish a relationship with the final yield by using ANN techniques. That research provided interesting results with linear approach models (mean R2 = 0.7, mean RMSE = 1.0 k g/vine, mean RE = 23.9%) in the flight closer to harvest in September, but higher predictive accuracy was found with ANN approach (R2 = 0.9, RMSE = 0.5 kg/vine, RE = 12.1%). However, low correlation was obtained applying the ANN model to the following season providing a yield overestimation (R2 = 0.3, accuracy indicator not reported). That result highlighted the main limitation of the methodology in the seasonal variability, avoiding the opportunity to obtain a global model that returns accurate results for any year. Our work aims to evaluate innovative and accurate forecast methods based on ML techniques applied on high resolution UAV remote sensing data for the prediction of key role agronomic parameters in a sector such as viticulture barely explored with those techniques. Given the significant impact of climate change and terroir on vine physiological response, the added value of the research is represented by the use of a huge dataset considering both the temporal factor by examining 3 very different vegetative seasons, and spatial factor on 3 experimental sites with very different characteristics. Another strength and innovation of the work is the fact of validating the predictive models identified on all the plants present in the study vineyards and not just a few sample plants, thanks to a protocol for the extraction of remote sensing indices as input of the models at the single vine level. The starting point of our research is the performance evaluation of spectral (NDVI) and geometric (canopy thickness and volume) UAV indices with respect to productive (yield and total soluble solids) and vegetative (pruning weight) parameters estimation. The evaluation of these correlations was then deepened by applying an in-depth analysis of the potential of the ML-based models. Finally, considering the importance of yield prediction for farmers21,31, the overall goal of this research is the validation of a novel yield forecast method based on a UAV image acquired several weeks before harvest.

Materials and methods

Experimental vineyard

The research was undertaken during the growing seasons 2017, 2018 and 2019 in a 1.4 ha vineyard (355 m above sea level) planted in 2008, owned by Castello di Fonterutoli farm and located near Castellina in Chianti (Siena, Italy) (43° 25′ 45.30″ N, 11° 17′ 17.92″ E) (Fig. 1a). Sangiovese cv. (Vitis vinifera) vines were trained with a vertical shoot-positioned trellis system and spur-pruned single cordon. The vine spacing was 2.2 m × 0.75 m (inter-row and intra-row) and the rows were NW–SE oriented on a slight southern slope.

Figure 1
figure 1

Experimental vineyards: Caggio (a), Belvedere (b) and Solatio (c) sites. The images were created using RGB imagery acquired by UAV during the flight campaign in the 2017 season.

For the validation, two other vineyards were used as independent dataset (only 2019 data), identified as Belvedere (Fig. 1b) and Solatio (Fig. 1c), located in the same area characterized by different conditions with respect to the calibration site. Table 1 summarizes the vineyard features.

Table 1 Experimental vineyards.

Ground measurements

At the beginning of the study, the analysis of multispectral imagery collected in the 2017 season, was used to characterize vigour spatial variability and plan the experimental design (Fig. 2). Representative vine vigour zones were chosen: one with high vigour in the north (HV), a second with low vigour in the middle (LV), and a third with intermediate vegetative behaviour in the south (MV). Within each zone, a sampling area of about 30 × 30 m (0.1 ha) was identified, in which 18 sample plants were identified, for a total of 54 plants within the vineyard. Ground truth measurements related to productive and vegetative parameters were performed every year on each sample vine. At harvest time, yield (kg/vine) and total soluble solids or sugar content (°Brix) were measured for each sampled vine, using a field scale and a hand-held optical refractometer respectively. As indicator of vine vigour, total shoot fresh mass (kg/vine) was determined in the field for each vine in the dormant period following the growing seasons.

Figure 2
figure 2

NDVI map created using UAV images acquired during the flight campaign in June 2017 with representative high vigour (HV), medium vigour (MV) and low vigour (LV) zones.

UAV platform and imagery acquisition

Remote sensed images were acquired using a prototype UAV platform consisting of a modified multirotor Mikrokopter (HiSystems GmbH, Moomerland, Germany) (Fig. 1b), described in a previous paper of the authors21. The UAV was equipped with an ADC Snap (Tetracam, Inc., Gainesville, FL, USA) multispectral camera, which provides 1.3 MP images in the green (520–600 nm), red (630–690 nm) and NIR (760–900 nm) bands. The use of that camera yielded a ground resolution of 0.03 m/pixel at 50 m above ground level (AGL) flight quote. A series of flight campaigns were performed at the beginning of August of each season in veraison phenological stage (9 August 2017, 8 August 2018 and 6 August 2019).

Climatic characterization

The climate analysis of each growing season (2017–2018–2019) was conducted by the calculation of some bioclimatic indices, such as sum of daily temperature (min, average and max), number of days with Tmax > 35 °C and cumulative rain in the summer period (June–August), which are based on air temperature and rain parameters collected by a traditional agrometeorological weather station located close to the study vineyard. The elaboration of the bioclimatic indices made it possible to discuss some results relating to the differences in productive and vegetative parameters over the three monitored years.

Image analysis

Multispectral images acquired by UAV were pre-processed using Agisoft Metashape Professional photogrammetric software (Agisoft LLC, St. Petersburg, Russia), which allows to export the orthomosaic and the digital elevation model (DEM) of the entire vineyard. A vicarious calibration based on the absolute radiance method was chosen, given that the digital number (DN) value for each pixel has a direct relationship (linear model) with the radiance detected by the sensor. For this radiometric calibration process, images from three OptoPolymer (OptoPolymer—Werner Sanftenberg, Munich, Germany) homogeneous and Lambertian surface panels, with 95%, 50% and 5% reflectance, were acquired for each flight.

The filtering procedure of the pure vine canopy pixels was assessed using DEM method with Matlab v.2019a (Mathworks, Natick, MA, USA) as described in Cinat et al. (2019)32. The NDVI was computed according to the following Eq. (1):

$$NDVI = (NIR - RED)/(NIR + RED)$$
(1)

where NIR and RED are the spectral reflectance in near infrared and red bands, respectively.

Geometric variables related to canopy thickness and volume were calculated using the 2.5D methodology with Matlab v.2019a (Mathworks, Natick, MA, USA) described in Di Gennaro and Matese (2020)6. Starting from the DEM of vines enclosed within a polygon grid, canopy thickness and vine number are extrapolated through the binarized image of the canopy extracted from each polygon grid. Following that approach, the missing plants present in each vineyard were identified and counted to estimate the yield of the vineyards based on the real vine number (RVN). This operation was fundamental, since a vineyard, once established, commonly loses vines each year due to diseases or abiotic stress. Considering that the number of missing plants can exceed 20%, the decrease in production based on the average production per plant multiplied by the theoretical total number of plants (defined by the vine spacing) can cause a high degree of overestimation.

The NDVI filtered (NDVI_f), canopy thickness (thick) and the canopy volume (Voldem) parameters were extracted for each vine within the vineyard and used as validation dataset. In detail, the real vine number detected by UAV approach respect to original number at planting time were 6264 versus 7273, 7780 versus 9185 and 10,231 versus 14,733 for Caggio, Solatio and Belvedere respectively.

Yield prediction models based on machine learning approach

Matlab’s Regression Learner app was used to train regression models to predict data yield, total soluble solids and pruning weight parameters. Users can perform automated training to search for the best regression model, including linear regression models, regression trees, Gaussian process regression models, support vector machines, and ensembles of regression trees (Matlab v.2019a). In this work the following regression methods were applied on the full dataset with the aim of taking into account the intra-annual climate variability considering three very different vintages. SVM regression is a nonparametric technique because it relies on kernel functions. Gaussian nonlinear kernel function was also used in this work. DTs are amongst the most intuitively simple classifiers. RF is an ensemble classifier, as it uses many DTs to overcome the weaknesses of a single DT. Boosted DTs are also an ensemble method using DTs. GPR models are nonparametric kernel-based probabilistic models. Exponential kernel function was used in this work. The selected models were applied to identify the performances of the regressions (R2, RMSE, MAE, RMSE% and MAE%) with the parameters measured on the ground, considering the filtered NDVI, canopy thickness and volume separately, but also by combining the filtered NDVI factors and canopy thickness, and combining all the factors processed by data acquired by UAV. The RMSE and the MAE indicators were assessed with the following formula:

$$RMSE = \sqrt {\frac{1}{n}\sum\nolimits_{j = 1}^n {{{(Predicte{d_j} - Observe{d_j})}^2}} }$$
(2)
$$MAE = \frac{1}{n}\sum\nolimits_{j = 1}^n {\left| {Predicte{d_j} - Observe{d_j}} \right|}$$
(3)

while MAE% and RMSE% were calculated as ratio between MAE and RMSE versus average of observed values.

Having identified the yield as parameter of greatest productive interest, the validation step was assessed only on this parameter by applying the best performing models to estimate the yield (quintals per hectare). The validation was done on Caggio site (Fig. 1a) using a new dataset of remote sensed parameters extracted from all the plants of the vineyard on which the previously developed models (using 54 sampled vines) were applied. Moreover, validation was performed also on two other independent dataset (Belvedere and Solatio vineyard Fig. 1b and 1c). In detail, the validation was made considering the total production of the vineyards measured at harvest. The ML models were tested using a fivefold cross validation. In addition, the model performance indicators were assessed also without cross validation. The models were then applied to a larger dataset, represented by the cumulative production of all plants present in each of the 3 vineyards.

The yield data estimated with the methodology suggested in this paper were compared with the data estimated by the farmer and production data collected during the harvest. The traditional method of yield estimation used by the company was performed a few days before the survey by UAV, and requires visual inspection in representative areas of the vineyard according to the know-how of the agronomist who makes a rapid observation by counting the number of bunches per plant on some vines. Once an average value per plant has been identified, the agronomist multiplies the number of bunches by the average bunch weight, specific to each variety.

Results and discussion

The results obtained from the research are presented in the following subsections, according to the order defined in the Materials and Methods section.

Climatic characterization

Table 2 reports several bioclimatic indices, obtained from meteorological data representative of the study site collected by a weather station located on the farm during the three growing seasons (2017–2019).

Table 2 Climatic characterization of the 2017–208–2019 seasons by calculation of the sum of daily temperature (min, average and max), number of days with Tmax > 35 °C and cumulative rain in the summer period (June–August).

Considering that the summer period is a critical phase for grape production, Table 2 describes very different seasons. In particular, the summer of 2017 was characterized by higher daily maximum temperatures, with 41 days of extreme temperatures above 35 °C, and minimal rainfall (50 mm). In that period, 2018 instead had lower daily maximum temperatures, with the fewest days with extreme temperatures (12 days) and highest rainfall (152 mm). The 2019 season was more temperate, with intermediate values compared to 2017, which was extremely hot and dry, and 2018 cooler and with greater rainfall intensity.

Ground and remote characterization of vineyard variability

The results of the productive and vegetative characterization performed with destructive ground sampling and UAV images acquisition are summarized in Table 3. The results demonstrated the correct planning of the experimental design, since during the three seasons each vigour zone shows a trend with strong differences in the HV compared to LV zones, and intermediate values in the MV zone.

Table 3 Productive and vegetative characterization of vines within each vigour zone during 2017, 2018 and 2019 seasons (mean and standard deviation).

In the first columns related to ground destructive sampling, higher yield and total fresh biomass values are observed in HV than in LV zones, on the contrary total soluble solids have lower values in HV zone. The ground truth measurements demonstrate the strong climatic impact on vine physiological response. In particular, the hot and dry 2017 season led to a very low yield and minimum vegetative development. While regarding the sugar accumulation, the strong impact of summer abiotic stresses in that season caused a reduction of photosynthetic efficiency and sugar synthesis23,33, as a result, lower values than the 2019 season are observed. The 2018 season, with cooler temperatures and higher rainfall, showed higher production and biomass values and lower sugar accumulation in the bunches. The intermediate climatic conditions of 2019 led to yield, quality and vegetative development in line with historical averages.

In the last three columns, Table 3 shows the results of the spectral and geometric elaboration of the UAV data. The images processing outputs confirmed the same trend between the different vigour zones identified by ground-truth measurements. Specifically, the HV zones are characterized by plants with higher values of both NDVI and canopy thickness and volume, while lower values emerge in the LV and intermediate values in the MV areas. In detail, the extreme temperatures and minimum rainfall of 2017 season led to lower values of both NDVI and vegetative growth of the canopy. On the contrary, the peculiarities of 2018 generally translated into greater photosynthetic activity and increased canopy growth. During this season the farm did two canopy trimmings to control the higher shoots growth, but as a consequence lateral growth was stimulated increasing canopy density34,35. The rainier and cooler climate of 2018 combined with the double trimming, led to a high vegetative response which significantly reduced heterogeneity within the vineyard. As a consequence, minimal differences between HV and LV zones were observed on all UAV indices. With regards to 2019 season, spectral and geometric indices were in intermediate position between the other years.

Linear regression between ground measurements and UAV spectral and geometric indices

Figure 3 shows the linear regression results obtained between ground measurements of yield (kg/vine) and UAV spectral and geometric indices. As expected, the yield observed shows positive correlations with all UAV products. In 2017 the best correlation was obtained with the thickness (R2 = 0.80), but also the NDVI and volume parameters provide high performance (R2 = 0.69 and R2 = 0.68 respectively). 2018 presented the lowest correlations for all three parameters, and the thickness is still the most correlated with yield (R2 = 0.54). Regarding 2019, all three remotely sensed parameters showed high and significant correlations, higher for thickness and volume (R2 = 0.63) with respect to NDVI (R2 = 0.56). In general, during the 3 years all the UAV-based parameters presented high significant correlations with yield data (p value < 0.001), except for 2018 when NDVI and canopy volume show less significance (p value < 0.05). This can be explained as a function of the high vegetative response to climate conditions and canopy management that was observed at the beginning of August, in terms of both photosynthetic efficiency and vegetative development, highlighted by higher NDVI and geometric values respectively.

Figure 3
figure 3

Linear regression results between ground measurements of yield (kg/vine) and UAV remote sensed data, represented by NDVI filtered, canopy thickness (m) and volume (m3) in the HV (green dots), MV (yellow dots) and LV (red dots) zones. Significance: ***p < 0.001; **p < 0.01; *p < 0.05.

The linear regressions between sugar content (°Brix) and the UAV indices showed negative correlations, in line with what was expected from the vegetative-productive balance (Fig. 4). As observed for the yield, the geometric indices showed similar or better correlations with respect to the spectral index NDVI. In 2017 the best correlation was obtained with the volume (R2 = 0.66), while the thickness showed the lowest coefficient of determination (R2 = 0.26). The canopy thickness and volume presented good and similar correlations in 2018 (R2 = 0.51 and R2 = 0.48 respectively) with respect to the NDVI (R2 = 0.29). As for 2019, similar correlations were observed between the three UAV indices (0.38 < R2 < 0.41), with lower values for the NDVI. In general, there were lower R2 and significance than the regressions obtained with the yield, this may be due to the greater sensitivity of sugar concentration to climatic factors (rainfall, wind, etc.), which may occur even a few days before harvest.

Figure 4
figure 4

Linear regression results between ground measurements of total soluble solids (°Brix) and UAV remote sensed data, represented by NDVI filtered, canopy thickness (m) and volume (m3) in the HV (green dots), MV (yellow dots) and LV (red dots) zones. Significance: ***p < 0.001; **p < 0.01; *p < 0.05.

The results of the linear regressions between pruning weight measurements and UAV data are presented in Fig. 5, which shows positive and significant correlations. In both the 2017 and 2018 seasons, the NDVI presented higher coefficient of determination values (R2 = 0.73 and R2 = 0.67 respectively) compared to the geometric indices, which showed similar results ranging between R2 = 0.59 and R2 = 0.65. However, in 2018 the geometric indices provided performances similar to NDVI in biomass estimation. As for the 2019 season, canopy thickness was the most correlated index (R2 = 0.60) followed by canopy volume (R2 = 0.51), while the NDVI showed lower results (R2 = 0.44). Given that the vine considerably slows down vegetative development at the beginning of August with veraison, to concentrate resources on the grape ripening process, the biomass data observed at the end of the season is very representative of the biomass at the time of flight, explaining the excellent correlations with UAV data, with respect to the more dynamic behaviour of the total soluble solids content.

Figure 5
figure 5

Linear regression results between ground measurements of pruning weight (kg) and UAV remote sensed data, represented by NDVI filtered, canopy thickness (m) and volume (m3) in the HV (green dots), MV (yellow dots) and LV (red dots) zones. Significance: ***p < 0.001; **p < 0.01; *p < 0.05.

The results between remote spectral and geometric data presented are in line with those reported in the literature. The importance of vegetative growth in terms of vine canopy thickness with respect to spectral information is well assessed by other papers7,8,36. Hall et al. (2010)8 analysed the relationship between remote descriptors of vine status with the agronomic variables chosen in this work (yield, sugar content and pruning weight). That research identified higher correlation at veraison for the yield with canopy thickness (r = 0.58) reported as CA (canopy area), with respect to spectral data (r = 0.46) referred to as CD (canopy density calculated as mean NDVI values per plant), providing similar but lower correlations than our results. Considering sugar accumulation, that research confirmed the negative correlation with remote indices, although nevertheless lower with respect to this work. Furthermore, Hall et al. (2010)8 identified that CA showed higher correlation with pruning weight than CD. In this case, our results identified canopy thickness as a stable descriptor of biomass (0.60 > R2 < 0.62), while the NDVI presented different behaviour over the 3 seasons (0.44 > R2 < 0.73).

Machine learning prediction models

Tables 4, 5 and 6 shows the results of application of the ML models on the complete dataset containing the 3 years of data acquired on Caggio vineyard related to ground measurements and UAV indices. The results in terms of coefficient of determination and RMSE, showed the same trend that emerged from the linear regressions applied to the individual years (Figs. 3, 4, 5), namely that the UAV data were more correlated with yield and pruning weight than to the total soluble solids content. In all cases, the linear model provided poorer performance, thus making the selected models interesting to identify a methodology for key parameters estimation in viticulture. Regarding the regressions with the single spectral and geometric UAV data, the linear models showed an average determination coefficient of about 0.66 over the 3 years compared to ML-based models with average values of 0.81. A similar trend was found for pruning weight and total soluble solids with an average R2 = 0.59 and 0.32 respectively in the linear models, and R2 = 0.71 and 0.48 in the regressive ML models. Taking into consideration the regressions with aggregated values of spectral and geometric UAV parameters, i.e. the NDVI combination with canopy thickness, and the combination of all 3 parameters, Tables 4, 5, 6 always present more performing regressions than those obtained with the individual parameters. The results provided by the application of the different ML models provided similar values of R2 and RMSE as regards yield, small differences on the pruning weight, while on the total soluble solids content there were important differences such as the Tree fine model with R2 = 0.54 and RF boosted model with R2 = 0.32 compared to the NDVI.

Table 4 Regression models performance (R2, RMSE, MAE, RMSE%, MAE%) obtained from the complete dataset (2017–2018-2019) between yield and NDVI filtered (NDVI_f), canopy thickness (Thick), canopy volume (VolDem), NDVI filtered and canopy thickness (NDVI_f Thick), NDVI filtered, canopy thickness and canopy volume (NDVI_f Thick VolDem). Each cell presents 2 values, the first calculated without cross validation, and the second using fivefold cross validation. The best resulting models are bold typed.
Table 5 Regression model’s performance (R2, RMSE, MAE, RMSE%, MAE%) obtained from the complete dataset (2017–2018–2019) between pruning weight and NDVI filtered (NDVI_f), canopy thickness (Thick), canopy volume (VolDem), NDVI filtered and canopy thickness (NDVI_f Thick), NDVI filtered, canopy thickness and canopy volume (NDVI_f Thick VolDem). Each cell presents 2 values, the first calculated without cross validation, and the second using fivefold cross validation. The best resulting models are bold typed.
Table 6 Regression model’s performance (R2, RMSE, MAE, RMSE%, MAE%) obtained from the complete dataset (2017–2018–2019) between total soluble solids and NDVI filtered (NDVI_f), canopy thickness (Thick), canopy volume (VolDem), NDVI filtered and canopy thickness (NDVI_f Thick), NDVI filtered, canopy thickness and canopy volume (NDVI_f Thick VolDem). Each cell presents 2 values, the first calculated without cross validation, and the second using fivefold cross validation. The best resulting models are bold typed.

Considering the yield parameter as the most relevant for farmers, an in-depth analysis of the potential of the ML models for yield forecast was conducted. The most commonly used linear model was compared with the most performing ML model identified in Table 4. Furthermore, given the similar performances of canopy thickness and volume, it was decided to use the first parameter as it required lower computing power and processing time to be elaborated from UAV images. Table 7 therefore shows the results of the selected models with and without cross validation for the yield forecast on Caggio (2017–2018–2019) and Solatio and Belvedere (2019) vineyards.

Table 7 Yield models’ output (quintals per hectare) related to real plant number without missing plants (Yield RVN) using NDVI filtered (NDVI_f), canopy thickness (Thick), NDVI filtered and canopy thickness (NDVI_f Thick) compared to yield ground estimation and yield harvested. The best resulting models are bold typed.

The yield is strongly affected by the number of missing plants present in the vineyard for reasons related to senescence, mechanical damage or wood diseases, such as the widespread esca disease9, however it is not easy for the farmer to have the knowledge of that number, since a visual count walking along all the rows every year would be extremely time-consuming. Considering the accurate missing vine detection by our approach, the RVN yield data estimated by UAV imagery are considerably more accurate than the values estimated by the agronomist by means visual ground observation, confirming the good performance of the post processing workflow in the recognition and counting of missing plants along the rows.

The yield estimation by visual observations in the vineyard responded with good accuracy to the real production harvested. However, in 2019 season the ground estimation provided important errors on the three vineyards examined, probably due to low representative vines chosen for ground observation. As expected by previous results, both spectral and geometric indices provided similar yield estimation values, however it is observed that models based on a single NDVI input variable showed lower accuracy, than single use of the canopy thickness or combined with NDVI. Furthermore, models based on single NDVI didn’t present a clear behaviour, while the linear model demonstrated lower predictive performance than the exp GPR and SVM models, applied on single canopy thickness and combined with NDVI respectively. Considering the performance of these models, the single use of canopy thickness input variable led to higher accuracy than combined with NDVI, with the exception of the 2018 season. This could be a consequence of the greater stability of the geometrical indices, compared to the high sensitivity of the spectral data that is affected by the light environmental conditions and the heterogeneous structure of the top of the canopy, due to leaves with different angle and orientation and mixed light and shadow conditions within the same canopy. However, it was an anomalous year with two canopy trimmings, the second approximately 17 days before the UAV flight, which strongly stimulated the vine vegetative response resulting in a huge development of secondary shoots. The high values of both NDVI for the spectral response of the youngest leaves and the increase in canopy volume caused a yield overestimation in all the models. The experience gained in this work has highlighted the importance of monitoring time, in particular in a year with vegetative excesses such as 2018, the survey should have been done a few days after canopy trimming to avoid the impact of the strong vegetative response. The exp GPR model with canopy thickness values as input data was the best performing, with an average error over the 3 years in Caggio and for 2019 in Solatio and Belvedere of about 20.5% compared to the estimates made by visual observations on the ground that provided a 16.0% error. However, excluding the error obtained from the methodology suggested in the year 2018 (64.3%), the accuracy of the yield forecast methodology by UAV platform significantly increases, presenting an error of 9.6%, therefore a better estimate than the traditional method.

The most accurate models for yield forecast in Belvedere site was RF boosted selected after a k-fold cross validation, while the best accuracy in the other vineyards (Caggio and Solatio) was reached using models selected without cross validation. Indeed, the best overall score might not be the best model for yield forecast and sometimes a model with slightly lower overall score could be the better model.

In the literature there is only a similar work proposed by Ballesteros et al. (2020)22, which described a multitemporal analysis using spectral and geometric data elaborated from multispectral UAV sensing, for the development of a yield predictive model based on linear and ANN regressions. The linear regressions for the 2017 season showed that better correlations were identified in September close to the harvest, and the best model suggested is the one with multiple linear regression for all flight dates with final yield reaching R2 = 0.76 with the pure canopy pixel NDVI (NDVIWIV), then similar results with fraction vegetation cover (Fc) or both combined (NDVIWIV x Fc) (R2 = 0.71). That work therefore presented a different trend to what emerged from our research in which the filtered NDVI data (NDVI_f) and canopy thickness (Thick) provided a linear regression with similar coefficient of determination as single variables (R2 = 0.65 and R2 = 0.64 respectively), while better results when combined together (R2 = 0.71). Taking the RMSE data into consideration, the most accurate estimated yield values were provided by the NDVI_f xThick model identified in our work (RMSE = 0.73 kg/vine) compared to the NDVIWIV model of the cited work (RMSE = 0.81 kg/vine). With regard to the results obtained with the application of predictive ANN models applied on the combined spectral and geometric information factors, both researches showed better performing outputs in terms of both R2 and RMSE compared to traditional regressions. In this respect, the data obtained from the cited research, was slightly better than the results presented in this paper. This aspect is probably due to the fact that the analysis done by Ballesteros et al. (2020)22 was related to a single year and wasn’t affected by inter-annual variability, while our research takes into consideration three different seasons, which however make the model much more robust as subsequently demonstrated by the validation results on two other vineyards at the end of the 3 years of experimentation. The validation of the models analysed is much more structured than that described in the paper of Ballesteros et al. (2020)22, where a survey made the following year on the same experimental site was used, while there were 3 different study sites in this work. The models calibrated over 3 years were efficient on all vineyards with estimated production values close to those observed, while the research by Ballesteros et al. (2020)22 by applying the model calibrated in 2017 on the same study site the following year, provided a lower coefficient of determination (R2 = 0.32) and an overestimation with almost double yield values per vine. The results obtained from 3 years of experimentation on such different years, have allowed to overcome the limits in the cited work on the difficulty of identifying global predictive production models using only 1 year of calibration due to inter-annual yield and canopy variability.

Overall evaluation

The NDVI index becomes an excellent indicator of vegetative vigour taking into consideration intensive crops with horizontal development, such as cereals or horticulture, providing combined information on vegetative activity (photosynthesis) and growth density (biomass). The limit of this index emerges when the vigour of discontinuous tree crops, such as vines, olives and pome fruits has to be assessed. In this condition, the soil component with respect to canopy cover is relevant and above all the vertical development is key factor for vegetative growth assessment. In these crops, the leaf spectral response of the canopy top describes only a part of the vigour of the plant, as all the vegetative growth component is lost (canopy width, height and volume). Furthermore, there are a large number of factors that can affect the quality of spectral data: stable light conditions, sun radiation angle, radiometric correction know-how, leaf status such as age, mechanical damage and symptoms due to biotic and abiotic stresses.

Today, the high spatial resolution obtained thanks to the use of UAV opens new possibilities in the study and characterization of vigour, allowing filtering techniques to be applied and pure canopy pixel spectral data extracted. At the same time, the application of SfM algorithms on a UAV dataset with high overlap level led to a volumetric reconstruction of the canopy directly linked to vegetative growth and biomass. Following this approach, it becomes possible to use high resolution multispectral images to individually quantify the components of the photosynthetically active biomass (PAB), obtaining the spectral response of the canopy from 2D analysis and biomass data from 3D analysis, thus overcoming the limit of the traditional NDVI index in the characterization of vigour. In support of the geometric approach is added the fact that it can be obtained with common RGB cameras, with minimum costs and very high performance derived from the extreme spatial resolution6. In general, the geometric approach presented in this paper can be applied on many tree crops, however on some exceptions with full soil covering such as overhead trellis system21 and on extended crops such as horticultural crops or cereals, the NDVI index remains the only applicable solution for estimating production and vegetative parameters.

Conclusions

In the last decade there has been an exponential growth in research activity on the identification of correlations between vegetational indices elaborated by a multispectral camera mounted on UAVs and productive and vegetative parameters of the vine. Our results underlined the great potential of ML approaches to predict several weeks before harvest the yield, which is the agronomic parameter that mainly drive the economic return for the farmer. Compared to previous works, our results were supported by a robust dataset that examined both climatic variability working on 3 different years and different field conditions using 3 study sites. The model based on geometric data presents better results than the use of NDVI data; this opens up an extremely interesting perspective to the proposed methodology by focusing on RGB sensors compared to multispectral cameras. Among the main advantages, production estimates could be made using very simple and inexpensive instrumentation (< 1000.00 €), overcoming any problems given the need for spectral know-how on radiometric correction and data analysis, primarily for filtering the canopy with low-temperature sensors resolution from common multispectral cameras (< 3MP), but also in data interpretation due to the complex structure of the canopy as a consequence of the copresence of leaves fully exposed to the sun, partially or completely shaded, turned leaves and gaps that reveal the soil below. A limitation of the work is the identification of sample plants used to evaluate the performance of the methodology in the estimation of productive and vegetative parameters. In particular, the plants were not chosen spatially distributed within the vineyard but concentrated in areas representative of spatial variability. Consequently, it was not possible to use a geostatic analysis approach, however this study aims to evaluate traditional statistical methods of comparison commonly used in agronomic experiments.