Abstract
The productivity of beans is greatly influenced by the different edaphoclimatic conditions in the Agreste-Sertão region, requiring the identification of adapted and stable genotypes to minimize the effects of the interaction between genotypes per environments (GxE). The objective of this work was to analyze the adaptability and stability of carioca bean pre-cultivars in three municipalities in the Agreste-Sertão of Pernambuco using the AMMI model in its Bayesian version BAMMI and compare the results with the frequentist approach. According to the results, the BAMMI analysis showed better predictive capacity, as well as better performance in the study of adaptability and stability. The cultivar BRS Notável stood out in terms of main effect and stability. Adaptability of genotypes to specific locations was also observed, enabling the use of the positive effect of the GxE interaction, which was more evident with the BAMMI model. From this work, the flexibility of BAMMI model to deal with data resulting from multi-environmental experiments can be seen, overcoming limitations of the standard analysis of the AMMI model.
Similar content being viewed by others
Introduction
The common bean (Phaseolus vulgaris L.) is a legume highly valued and cultivated worldwide, mainly due to its high nutritional quality, being an important source of protein in human food1. In Brazil, this culture has great socioeconomic importance and stands out in subsistence agriculture, being cultivated in different environments and at different technological levels of production2.
Brazil occupies the first and third place in the world ranking of consumption and production of beans, respectively, with annual production of 2.89 million tons. In most states of the country, preference is observed for the carioca commercial group, which represents about 60% of national consumption3.
In the Agreste-Sertão of Pernambuco, bean cultivation is carried out in several municipalities, which cover different edaphoclimatic conditions and, consequently, influence directly on productivity4. In this sense, the interaction between genotypes x environments (GxE) is an important challenge for breeders in the evaluation of pre-cultivars since the environment can mask the true potential of the genotype5.
Minimization of the effects of the GxE interaction can be achieved by identifying the most stable genotypes (with wide recommendation) and/or by identifying the adaptability of genotypes to specific environments. The evaluation of genotypic stability and adaptability can be conducted using different statistical methodologies6,7,8.
The AMMI (Additive Main effects and Multiplicative Interaction analysis) model is one of the most popular frequentist methods for analyzing genotype responses in various environments8,9,10. This model offers several advantages, among which its good predictive capacity and the possibility of graphically describing the effect of the interaction in biplots stand out, making data interpretation simpler11,12,13.
Despite the advantages offered, the classic AMMI analysis has limitations that restrict or make its use unfeasible, such as the impossibility of working with unbalanced and/or heteroscedastic data sets, the requirement to work with these effects as fixed, e a dificuldade em construir estatísticas inferenciais para os parâmetros bilineares da interação GxE9.
It is worth mentioning that there are methods that circumvent the difficulties related to the data structure, using decomposition by singular value, such as imputation of missing values, corrections in the averages of the little houses and weighting in relation to the environments14,15. However, some of these procedures are susceptible to criticism and can lead to loss of information12,16.
To circumvent such problems, an alternative that has proved to be relevant is the application of the Bayesian approach to the AMMI model (Bayesian-AMMI or BAMMI). The BAMMI model offers flexibility in analyzing experimental data under different conditions, demonstrating viability for use in balanced data10,17, unbalanced9, heteroscedastic16, with a posteriori fixed effect for all parameters18 and with random effect for genotypes and/or singular values11,19.
The Bayesian approach allows the incorporation of prior information, when it is available, enabling greater efficiency in the analysis16,20,21,22. However, there are still few studies exploring the predictive power of BAMMI in relation to frequentist AMMI.
Romão et al.8 investigated the predictive power between AMMI via EM algorithm (EM-AMMI), Bayesian AMMI with homogeneity (BAMMI) and heterogeneity of variances (BAMMI-H) and the Analytical Factorial (FA) model, but using simulated data. Thus, this work is a pioneer in the investigation of the predictive capacity of BAMMI in relation to the frequentist AMMI using real data. Furthermore, it innovates by employing the ammiBayes statistical package, recently developed by Oliveira et al.23, for the inference process with the BAMMI model.
Therefore, this work aimed to analyze the adaptability and stability of carioca bean pre-cultivars in the Agreste-Sertão of Pernambuco using the AMMI model (Additive Main effects and Multiplicative Interaction analysis) in its Bayesian version BAMMI (Bayesian AMMI) and comparing the results with the frequentist approach.
Material and methods
Sample data and experimental conditions
Grain productivity data (kg ha-1) are from competition trials of common bean pre-cultivars of the carioca type, performed in three municipalities in the Agreste-Sertão of Pernambuco (Table 1), in the period from May to August of the years 2014 and 2015, conducted by the Pernambuco Agronomic Institute (IPA) and the Federal Rural University of Pernambuco (UFRPE). These municipalities have different climatic conditions, incurring a greater representation of the region (Fig. 1).
The genotypes used consisted of 10 lineages and four cultivars, from the National Rice and Beans Research Center (CNPAF) from the Brazilian Agricultural Research Corporation (Embrapa Rice and Beans) (Table 2). The cultivars were used as witnesses in the experimental tests since they are recommended for cultivation in the state of Pernambuco.
The experimental design was in randomized blocks, with three replications. The experimental plots consisted of four rows spaced 0.5 × 0.1 m between and inside, respectively, and the useful area consisted of the two central rows.
Soil preparation was done in a conventional way, with chemical fertilization with 40 kg ha-1 of N, 60 kg ha-1 of P2O5 and 30 kg ha-1 of K2O before the implementation of the tests. Irrigation was performed when necessary for cultivation in Belém de São Francisco, using a conventional sprinkler system, because in the others the rainfed system predominated. Weed plants control was done by hand weeding and for pest control, application of Metamidophos Fersol (600 at a dose of 0.5 L ha-1) was promoted. The measurement of the grain productivity variable grain productivity experiment, and the harvest was carried out 90 days after planting, in the R9 phase.
Statistical analysis
The AMMI model in matrix notation, as presented in Oliveira et al.10, is given by:
being \({\mathbf{y}}_{n \times 1}\) the vector composed of phenotypic response, where \(l\), r and c denote, respectively, the number of repetitions, the number of genotypes and the number of environments. The vectors \({\varvec{\beta}}_{cl\, \times 1}\) and \({\varvec{g}}_{r \times 1}\) contain the effects parameters of hierarchical blocks within environments and main effects of genotypes, respectively.
The terms \({\lambda }_{k}\), \({\boldsymbol{\alpha }}_{{\varvec{k}}}\) and \({{\varvec{\gamma}}}_{{\varvec{k}}}\) are the multiplicative or bilinear components of model (1) and denote, respectively, the singular value and the singular vectors associated with the k-th principal component, with \(k = \,1,\,...,\,t\) being \(t = \min \,(r,\,c)\) the rank of the "GxE" interaction matrix. Bilinear components are also subject to order restrictions (\(\lambda_{1} > \lambda_{2} > \cdots > \lambda_{t}\)) and orthonormalization \(\left({\boldsymbol{\alpha }}_{k}^{\top }{\boldsymbol{\alpha }}_{k}={{\varvec{\gamma}}}_{k}^{\top }{{\varvec{\gamma}}}_{k}=1 e{\boldsymbol{ }\boldsymbol{\alpha }}_{k}^{\top }{\boldsymbol{\alpha }}_{{k}^{^{\prime}}}={{\varvec{\gamma}}}_{k}^{\top }{{\varvec{\gamma}}}_{{k}^{^{\prime}}}=0;k\ne {k}^{^{\prime}}\right)\).
The matrices \({\mathbf{X}}_{1}\), \({\mathbf{X}}_{2}\) and \({\mathbf{Z}}\) are design matrices associated with \({\varvec{\beta}}\), \({{\varvec{\upgamma}}}_{k}\) and \({\varvec{g}}\), respectively. The term \({\varvec{\varepsilon}}\) is the error vector, with \(\user2{\varepsilon \sim }N_{n} ({\mathbf{0}},\,\sigma_{e}^{2} {\mathbf{I}}_{n} )\), where \({\mathbf{0}}\) represents the null vector and \({\mathbf{I}}_{n}\) the identity matrix of order n. The conditional distribution of \({\mathbf{y}}\) is multivariate normal, that is, \({\varvec{y}}|{\varvec{\alpha}},{\varvec{\gamma}},\lambda , {\varvec{g}},{\varvec{\beta}},\sigma_{e}^{2} \sim N_{n} \left( {{{\varvec{\upmu}}},{\mathbf{I}}_{n} \sigma_{e}^{2} } \right)\) with \({{\varvec{\upmu}}} = {\mathbf{X}}_{1} {\varvec{\beta}} + {\mathbf{Z}}{\varvec{g}} + \mathop \sum \limits_{k = 1}^{t} \lambda_{k} diag({\mathbf{Z}}{\varvec{\alpha}}_{k} ){\mathbf{X}}_{2} {\varvec{\gamma}}_{k}\), being \(\sigma_{e}^{2}\) the residual variance.
A priori distributions and posterior conditional distributions
The a priori distributions used for the parameters of model (1) are the same used by Oliveira et al.10:
\({{\varvec{\upbeta}}}{|}\,{{\varvec{\upmu}}}_{{{\varvec{\upbeta}}}} ,\,\sigma_{{{\varvec{\upbeta}}}}^{2} \sim \,N\left( {{{\varvec{\upmu}}}_{\beta } ,\,\sigma_{{{\varvec{\upbeta}}}}^{2} } \right)\,\);
\({\mathbf{g}}|{{\varvec{\upmu}}}_{{\mathbf{g}}} ,\sigma_{{\mathbf{g}}}^{2} \sim N({{\varvec{\upmu}}}_{{\mathbf{g}}} ,{\mathbf{I}}\sigma_{{\mathbf{g}}}^{2} )\);
\(\lambda_{k} |\mu_{{\lambda_{k} }} ,\sigma_{{\lambda_{k} }}^{2} \sim N^{ + } (\mu_{{\lambda_{k} }} ,\sigma_{{\lambda_{k} }}^{2} )\);
\({\varvec{\alpha}}_{k}\) ~ uniform spherical distribution;
\({\varvec{\gamma}}_{k}\) ~ uniform spherical distribution;
For Gaussian distributions, a priori information can be incorporated by assigning values for average and variance. In order to incorporate minimum information, it was considered \({{\varvec{\upmu}}}_{{{\varvec{\upbeta}}}} = {\mathbf{0}}\), \(\mu_{{\lambda_{k} }} = 0\), \(\sigma_{\beta }^{2} = 10^{8}\) and \(\sigma_{{\lambda_{k} }}^{2} = 10^{8}\). To the singular vectors \({{\varvec{\upalpha}}}_{k}\) and \({{\varvec{\upgamma}}}_{k}\) uniform spherical a priori densities were assigned in the corrected subspace, which are uninformative25 and for the experimental variance an a priori of Jeffrey was assigned \(\sigma_{e}^{2} = {1 \mathord{\left/ {\vphantom {1 {\sigma_{e}^{2} }}} \right. \kern-0pt} {\sigma_{e}^{2} }}\). For the genotype effect, \({{\varvec{\upmu}}}_{{\mathbf{g}}} = {\mathbf{0}}\) and \(\sigma_{g}^{2} \sim {1 \mathord{\left/ {\vphantom {1 {\sigma_{g}^{2} }}} \right. \kern-0pt} {\sigma_{g}^{2} }}\) were considered, obtaining a posteriori random effect for genotypes. This model is the same used by Oliveira et al.10, being referred to by BAMMI (Bayesian-AMMI).
Complete conditional distributions are described in detail in Oliveira et al.10 or Silva et al.19 and are the following:
\({{\varvec{\upbeta}}}| \ldots \sim N\left[ {\left( {{\mathbf{X}}_{1}^{{{\top}}} {\mathbf{X}}_{1} + {\mathbf{I}}\frac{1}{{\sigma_{{{\varvec{\upbeta}}}}^{2} }}} \right)^{ - 1} {\mathbf{X}}_{1}^{{{\top}}} {\mathbf{M}}_{{{\varvec{\upbeta}}}} ,\;\left( {{\mathbf{X}}_{1}^{{{\top}}} {\mathbf{X}}_{1} + {\mathbf{I}}\frac{1}{{\sigma_{{{\varvec{\upbeta}}}}^{2} }}} \right)^{ - 1} } \right]\), where \(\sigma_{{{\varvec{\upbeta}}}}^{2} = 10^{8}\) and.
\({\mathbf{M}}_{{{\varvec{\upbeta}}}} = {\mathbf{y}} - {\mathbf{Zg}} - \sum\nolimits_{k = 1}^{t} {\lambda_{k} diag({\mathbf{Z\alpha }}_{k} ){\mathbf{X}}_{2} {{\varvec{\upgamma}}}_{k} }\).
\({\mathbf{g}}|...\sim N\left[ {\left( {{\mathbf{Z}}^{{{\top}}} {\mathbf{Z}} + {\mathbf{I}}\frac{1}{{\sigma_{{\mathbf{g}}}^{2} }}} \right)^{ - 1} {\mathbf{Z}}^{{{\top}}} {\mathbf{M}}_{{\mathbf{g}}} \,,\,\,\,\,\left( {{\mathbf{Z}}^{{{\top}}} {\mathbf{Z}} + {\mathbf{I}}\frac{1}{{\sigma_{{\mathbf{g}}}^{2} }}} \right)^{ - 1} } \right]\)\({\mathbf{M}}_{{\mathbf{g}}} = {\mathbf{y}} - {\mathbf{X}}_{1} {{\varvec{\upbeta}}} - \sum\nolimits_{k = 1}^{t} {\lambda_{k} diag({\mathbf{Z\alpha }}_{k} ){\mathbf{X}}_{2} {{\varvec{\upgamma}}}_{k} }\) where:
\({ }\lambda_{k} | \ldots \sim N^{ + } \left[ {\left( {{{\varvec{\Lambda}}}_{k}^{{{\top}}} {{\varvec{\Lambda}}}_{k} + \frac{{\sigma_{e}^{2} }}{{\sigma_{{\lambda_{k} }}^{2} }}} \right)^{ - 1} {{\varvec{\Lambda}}}_{k}^{{{\top}}} {\mathbf{\rm M}}_{k^{\prime}}^{{{\top}}} , \,\,\left( {{{\varvec{\Lambda}}}_{k}^{{{\top}}} {{\varvec{\Lambda}}}_{k} } \right)^{ - 1} \sigma_{e}^{2} } \right]\) being \({{\varvec{\Lambda}}}_{k} = diag({\mathbf{Z\alpha }}_{k} ){\mathbf{X}}_{2} {{\varvec{\upgamma}}}_{k}\);\({\mathbf{\rm M}}_{k^{\prime}}^{{}} = {\mathbf{y}} - {\mathbf{Zg}} - {\mathbf{X}}_{1} {{\varvec{\upbeta}}} - \mathop \sum \limits_{k^{\prime} \ne k}^{t} \lambda_{k^{\prime}} diag({\mathbf{Z\alpha }}_{k^{\prime}} ){\mathbf{X}}_{2} {{\varvec{\upgamma}}}_{k^{\prime}}\) and \(\lambda_{1} \ge \ldots \ge \lambda_{t} \ge 0.\)
being \({{\varvec{\Lambda}}}_{{{{\varvec{\upalpha}}}_{k} }} = diag\left( {{\mathbf{X}}_{2} {{\varvec{\upgamma}}}_{k} } \right){\mathbf{Z}}\), and
being \({{\varvec{\Lambda}}}_{{{{\varvec{\upgamma}}}_{k} }} = diag\left( {{\mathbf{Z\alpha }}_{k} } \right){\mathbf{X}}_{2}\).
The a posteriori densities for the singular vectors are proportional to spherical distributions of the von Mises-Fisher type (vMF). Due to the orthogonality constraints of the singular vectors the a posteriori distribution for these parameters is non-trivial and sampling must be performed by auxiliary variables in the corrected subspace. Details on this process are presented in Viele and Srinivasan25, Liu26 and Oliveira et al.10.
Finally, the complete a posteriori conditional distribution for the residual variance is inverse scaling chi-square:
MCMC sampling, model selection and comparison
The Markov chains, as well as the entire inference process with the BAMMI model, were obtained using the ammiBayes package23. The sampling of parameters was conducted using a Gibbs sampler and a description of the iterative algorithm can be found in Oliveira et al.10. The convergence of the produced chains was monitored by the criteria of Raftery and Lewis27 and Heidelberger and Welch28, using the library Bayesian Output Analysis (BOA)29.
Model selection was performed using BIC30, AIC31 and AICM32 information criteria. The estimates for the univariate parameters of the model were obtained by a posteriori averages of the MCMC chains of regions of maximum a posteriori density (HPD), built using the BOA package29. Bivariate regions at 95% credibility, for genotypic and environmental scores in the biplot representation, were implemented using the method of Hu and Yang33.
The classic AMMI (frequentist approach) was adjusted for comparison purposes. minimum squares method for the estimation of the effects and the multiplicative terms of the interaction, estimated from the singular value decomposition (DVS)34. Model selection was performed using Cornelius' Fr test35.
Random imbalances were performed in the data, with levels of 10% and 20% loss of genotypes and environments, to verify the efficiency of the methods, and later the analyzes were performed. As the analyzes were conducted in two imbalance scenarios the EM (expectation–maximization) algorithm was used to impute the missing values in the AMMI-Classic approach36.
An assessment of predictive ability was performed for the BAMMI and the AMMI frequentist models using cross-validation procedures. To compare the two models, the correlation between predicted and observed values (Cor), Spearman's Correlation (CorS) and PRESS (Prediction Error Sum Square) was used. Statistical analyzes were performed considering each location-year combination as an environment and were performed using the R statistical software37.
Results
AMMI-classic analysis
The joint analysis showed a significant effect of the interaction, indicating non-additivity of the main effects, justifying the application of the AMMI analysis (Table 3).
According to Cornelius' Fr test, the model that best described the data set was AMMI-4, that is, the model that presents four main components retained to explain the effect of the GxE interaction. In this model, 98% of the variation of the GxE interaction is explained (Table 4).
Although the AMMI-4 model was the one that best fit the data, in the analysis of adaptability and stability, the most common is the use of the AMMI-2 biplot, since in the first axes are the highest percentages referring to the interaction pattern, with less influence of noise, even when they explain a lower percentage of the sum of squares38. The AMMI-2 biplot representation of the genotypic and environmental scores is shown in Fig. 2.
The lineages CNFC 15460 (G3), CNFC 15462 (G4), CNFC 15504 (G8), CNFC 15507 (G9) and the cultivar BRS Pérola (G14), were closer to the origin and, therefore, are the ones that contributed the least to the interaction effect, demonstrating wide adaptability, as well as the environments Arcoverde-2015 (E4) and São João-2015 (E6). On the other hand, IPR 39 (G1), CNFC 15513 (G10) and BRS Estilo (G12) were further away from the center of the biplot and that is why they are considered the most unstable (Fig. 2). The smallest contribution visualized in the interaction axis points to greater stability and indicates that the average productivity of these genotypes is little variable, depending on locations and years39.
The behavior of some genotypes within the same location was incongruous between the two years of evaluation. This can be visualized by taking as an example the Belém de São Francisco municipality, in which specific adaptability of the genotypes CNFC 15458 (G2) and BRS Notável (G13) was suggested in the year 2014 (E2), whereas in 2015 (E5) this was observed in relation to another material, the BRS Estilo (G12) (Fig. 2).
In relation to the Arcoverde municipality, the genotype CNFC 15497 (G7) showed specific adaptability, in the year 2014 (E1), while in 2015 (E4) there was no significant contribution from the environment to the interaction (Fig. 2).
The genotypes CNFC 15475 (G5) and CNFC 15513 (G10) showed specific adaptability to the municipality São João in 2014 (E3), which in turn did not contribute significantly to the interaction in 2015 (E6). In addition to the stable genotypes, CNFC15513 (G10) would be the most interesting for this specific location in terms of recommendation (Fig. 2).
AMMI-bayesian analysis
For all MCMC chains sampled, good convergence properties were observed from the criteria used, since all model parameters had a dependency factor lower than 527 and passed the stationarity test28.
In the graphs of traces and densities of the MCMC chains, it was observed that the distributions were stationary, corroborating the results obtained by the applied convergence tests. For examples, Fig. 3 shows the plot of traces for the genotypic and residual variance of the BAMMI-1 model, which was the model that best fitted the data in the selection stage.
In Table 5, point and interval estimates related to singular values are presented for BAMMI models as a function of the number of bilinear components retained. Based on the complete BAMMI-5 model, it can be stated that the first axis explains 42.8% of all the variation in the interaction.
Figure 4 shows the results of the information criteria applied in the model selection stage. For all the criteria used, the model that best described the dataset was BAMMI-1, that is, the one that presented the lowest value of AIC, BIC and AICM. The choice based on these criteria focuses on more parsimonious and robust models according to the different a priori distributions12.
Table 6 presents point and interval estimates for genotype effects referring to BAMMI, as well as the minimum squares estimates obtained by the frequentist AMMI adjustment for comparison purposes. As can be seen, in general, the predictions of the BAMMI model are slightly smaller than the respective minimum squares solutions. The genotypes that stood out the most regarding the main effect were CNFC 15480 (G6), BRS Estilo (G12) and BRS Notável (G13).
Overlaps between the HPD intervals suggest similar effects, however the lineage CNFC 15480 (G6) and the cultivars BRS Estilo (G12) and BRS Notável (G13) did not include negative values in their credibility regions and, therefore, are the most interesting in terms of the main effect (Table 6). Positive HPD values indicate that these genotypes contributed the most to the population average and consequently are the ones with the highest productivity.
The adaptability and stability analysis is performed by biplot interpretation for genotypic and environmental scores. Genotypes and environments whose credibility regions for the scores encompassed the origin are considered stable11. Thus, the lineages CNFC 15460 (G3), CNFC 15462 (G4), CNFC 15497 (7), CNFC 15504 (G8), CNFC 15507 (G9), and cultivars BRS Notável (G13) and BRS Pérola (G14) make up a homogeneous subgroup of genotypes that did not significantly contribute to the GxE interaction, indicating wide adaptation (Fig. 5a).
To simplify the interpretations, only the credibility regions that do not encompass the origin (0,0) were represented, and the respective genotypes (or environments) are considered not stable, in other words, they have a significant contribution to interaction (Fig. 5b).
The bivariate regions of credibility for the genotypic and environmental scores, implemented in the biplot, are used to analyze the effect of the GxE interaction, in which overlaps between them are used to interpret the adaptability and stability of the materials.
The visual analysis of the regions in the biplot allows us to suggest adaptability of genotypes to specific locations, as is the case of the cultivar IPR 139 (G1) to the Arcoverde municipality (E1,E4), from the genotypes CNFC 15480 (G6) and CNFC 15534 (G11), to the Belém de São Francisco municipality (E5 and E2), and from the genotype CNFC 15513 (G10) to the São João municipality (E3, E6) (Fig. 5b).
Predictive evaluation of models
The results of the prediction analysis for the BAMMI and EM-AMMI models are shown in Figs. 5 and 6, for unbalance of 10% and 20%, respectively. The best performance of the model is indicated by the lowest value in relation to the PRESS criterion, and by the highest value in relation to Pearson's correlation (Cor) and Spearman's correlation (CorS)12.
The graphical analyzes demonstrate that the predictive accuracy between the models differed between the imbalance scenarios, however, both showed an advantage for the BAMMI model in all cross-validation criteria (Figs. 6 and 7), being more accentuated in the scenario of lower imbalance (Fig. 6). Similar results were observed by Romão et al.8, who identified the advantage of Bayesian models in unbalanced scenarios.
Discussion
AMMI-classic analysis
The distinction of genotypes that showed specific interaction with the municipality of Belém de São Francisco it may have occurred due to operational problems during the the management of crops. It is worth noting that the average yields in Belém de São Francisco in both years were the highest among the other environments (Table 7), fact possibly linked to the number of genotypes with specific adaptability to this municipality and to having an irrigation system. A similar consideration can be made in relation to the Arcoverde municipality.
The São João municipality (E3 and E6) had the lowest average productivity among the environments (Table 7), however, it is possible to visualize in the biplot that it was the municipality that presented the greatest consistency in relation to the effect of the GxE interaction in the years considered (Fig. 2). The lower averages can be explained by the low rainfall (Table 1 and Fig. 1), since the experiments at this location were conducted under rainfed conditions, furthermore, the municipalities have different types of soil and the cultivation in São João depends, essentially, of the amount of organic matter put in each year.
Regarding the Arcoverde municipality (E1 and E4), although consistency has not been maintained over the years, it is possible to observe that the genotype G1 could be indicated and would allow the exploration of the positive effect of the GxE interaction. The Belém de São Francisco municipality (E2 and E5) was the one that showed the greatest inconsistency, in relation to GxE interaction, in the two years evaluated, evidencing the performance of the complex type of interaction. Even so, genotypes G6 and G11 would be those with greater adaptability to this location (Fig. 2).
It is common for edaphoclimatic variations between different years to cause fluctuations in genotypic and environmental scores, masking the graphic patterns identified in the biplot, and therefore it is difficult to observe genotypes that maintain their performance for the same location in different years. Thus, the variation between the interaction effects in different years for a specific location may be like the effects for different locations, due to changes between years9.
The recommendation of cultivars with specific adaptation characterizes an efficient way that breeders use to take advantage of the GxE interaction, benefiting environments with conditions normally unfavorable to cultivation, as well as promoting an increase in average productivity in environments with good cultivation conditions, when using materials that could make the most of the interaction effect40,41.
The heterogeneity of the municipalities of Agreste-Sertão of Pernambuco directly influences productivity, promoting and justifying the variation in average productivity observed between environments, as observed in the work of Souza et al.42,Santos et al.43,Santos et al.44 and Lima et al.1. This heterogeneity, in addition to being confirmed in the analysis of variance (Table 3), it is evident in the description of the edaphoclimatic characteristics of each municipality (Table 1). It is worth noting that the evaluated pre-cultivars showed average productivity above the state (464.12 kg ha-1), regional (397.29 kg ha-1) and national (990 kg ha-1) averages3.
AMMI-bayesian analysis
The predictions of the BAMMI model slightly smaller than the respective minimum squares solutions (Table 6) is due to the shrinkage effect when assuming a common population for genotypes (random effects), in addition to being a common feature of Bayesian methods, corroborating what was observed by Oliveira et al.11 and Oliveira et al.12.
When considering the effects of genotypes as random, it allows for variance components to be estimated and for kinship coefficients to be incorporated in the AMMI analysis. This leads to more accurate estimates of the genetic values of the genotypes and, consequently, it offers more realistic estimates of the true genotypic value.
The stage of evaluation and selection of pre-cultivars in competition trials requires both the skill of the breeder as well as statistical tools that allow the correct interpretation of phenotypic data. In this sense, the use of shrinkage effect predictors is an excellent choice for plant breeding programs, because they are more accurate and facilitate the classification of genetic materials according to their genotypic values, minimizing the residual effects of environments present in the data12,19.
The analysis of the main effect of the genotype is very important for the selection and recommendation of cultivars, but it is essential that the effects of the GxE interaction are jointly considered, to evaluate the adaptability and stability of the materials19.
The credibility regions obtained by the BAMMI model promote the elimination of subjective average scores close to the central point of the biplot, leading to greater precision to infer about genotypic and environmental stability45. This is evident in the interpretation of the biplots (fixed and Bayesian), in which, in the Bayesian analysis, the environments Arcoverde-2015 (E4) and São João-2015 (E6) do not significantly contribute to the interaction (Fig. 5b).
Among the genotypes that stand out in terms of the main effect only BRS Notável can be considered stable and, therefore, has a wide recommendation (Fig. 5b). This interpretation is not direct with respect to the biplot of the frequentist model and would be a risky assumption in the context of classical analysis. However, using the 95% credibility level of the Baysean model, it is possible to safely identify G13 as stable, that is, this genotype has no important contribution to the GxE interaction.
The cultivar BRS Notável was used as a witness in the experiments because it has a recommendation for cultivation in the state of Pernambuco, in addition to other Brazilian states, as well as the other three cultivars mentioned in this work. However, BRS Notável was the only one that combined positive genotypic effect and stability, reinforcing its viability and potential for the Agreste-Sertão region of Pernambuco.
Credible regions for genotypic and environmental scores ensure that discrimination of genotypes and environments is performed with greater precision, grouping genotypes and environments into homogeneous subgroups that show the same patterns of interaction. The selection of genotypes with an associated level of credibility helps plant breeders in decision making, leading to the recommendation of more consistent cultivars, and reducing the costs of breeding programs9,11.
Generally, genotypes that have less than zero main effects been not recommended, given that they have a response below the population average9. However, in relation to the pre-cultivars evaluated in this work, it can be seen that all had an average productivity significantly higher than the regional average (397.29 kg ha-1), as shown in Table 8, it is therefore pertinent to indicate these materials for the places that had good interaction, according to the adaptability analysis through the credibility regions.
Overlaps between regions indicate homogeneous subgroups in terms of the interaction effect, demonstrating that the genotypes present similar behavior in the studied environments. Thus, bivariate regions are an excellent tool for recommending genotypes to specific environments7.
The lineage CNFC 15480 (G6) deserves to be highlighted because it presented a positive genotypic effect (Table 6) and productivity slightly higher than that of the witnesses (Table 8), except for BRS Estilo, corroborating its indication for the municipality of Belém de São Francisco, as it demonstrated specific adaptability for that location (Fig. 5b).
The configurations of the midpoints of genotypic and environmental scores in the biplots of the AMMI frequentist and AMMI Bayesian models were similar. However, the analysis of the credibility regions by the biplot of the BAMMI model provided a clearer organization of the specific interactions of the genotypes with certain environments, allowing you to relate materials more firmly to specific locations, despite the differences perceived in the different years, which were probably caused by the edaphoclimatic variations of the municipality. Furthermore, the incorporation of inference to the biplot allowed us to identify environments and genotypes that do not have important contributions to the GxE interaction.
The Agreste-Sertão of Pernambuco presents great climatic variations between harvests; thus, the same locality can vary a lot in different years. However, the BAMMI method showed efficiency in grouping genotypes with specific adaptation to certain locations, even in different years, corroborating the results obtained by Bernardo Júnior et al.9 who verified the good ability of this method to discriminate the environments within the different seasons.
Predictive evaluation of models
In general, the EM-AMMI model showed good performance in slightly accentuated imbalance scenarios, such as those used in this work, showing good ability to predict the GxE interaction, without requiring sophisticated statistical methods, high computational demand, or estimation of complex parameters. According to Paderewski and Rodrigues36, the EM algorithm is great for imputing missing data, since its iterative process is done in the GxE matrix, converging the imputation of the interaction in a few steps, contributing to the quality of the fit, which could justify the good performance of the EM-AMMI models in scenarios with losses of up to 33%.
Both models showed that they are robust to data loss in the analyses, however, the BAMMI model was superior in predictive accuracy and shows great potential of the technique in the study of GxE interaction and in the prediction of missing genotypes. The need to deal with unbalanced data is a recurring inconvenience in adaptability and stability studies because environmental variations and other unpredictable factors in field experiments often lead to plot loss. BAMMI analysis naturally handles unbalanced datasets and still allows dealing with uncertainty in biplots, being highly indicated for genotype evaluation9.
Compared to conventional methods, the BAMMI methodology has the disadvantage of being more time consuming and requiring greater demand for computational resources, but the greater precision in the inference process, added to the other advantages mentioned throughout this work, offered by this method, justify its use. The Bayesian analysis of multiplicative models has gained more space in the analysis of multi-environmental data, due to its benefits and because of the computational progress that has mitigated the difficulty of performing the analyzes10.
Reinforcing the advances that facilitate Bayesian analysis, emphasizing the BAMMI model, it is worth noting that the ammiBayes package, recently developed by Oliveira et al.23 and pioneered in this work, allows the user of the statistical program to perform the entire inference process in a reasonably simple and practical way.
Conclusions
The cultivar BRS Notável (G13) is able to capitalize on positive interactions with the environments, being classified as stable and recommended with wide adaptation for the Agreste-Sertão of Pernambuco; Arcoverde and São João are the municipalities that present greater stability for the effects of the GxE interaction, evidenced by the BAMMI analysis through the credibility regions.
The recommendation of cultivar IPR 139 (G1) allows exploring the positive effect of GxE in Arcoverde, while in Belém de São Francisco the pre-cultivars CNFC 15480 (G6) and CNFC 15534 (G11) are used; in São João, the genotype CNFC 15513 (G10) shows greater adaptability.
The BAMMI analysis is more capable of identifying environments and genotypes that do not have important contributions to the GxE interaction; this model exhibits superior predictive ability compared to the EM-AMMI model, for unbalance scenarios up to 10% and 20%.
Data availability
All data generated or analyzed during this study are included in this published article.
References
Lima, T. V. et al. Adaptability and stability of early carioca beans by mixed models. Biosci. J. 36, 173–182. https://doi.org/10.14393/BJ-v36n0a2020-48288 (2020).
Euzebio, M. P. et al. Adaptability and stability assessment of bean cultivars of the carioca commercial group by a Bayesian approach. Acta Sci. Agron. 40, e35272. https://doi.org/10.4025/actasciagron.v40i1.35272 (2018).
CONAB – Companhia Nacional de Abastecimento. Acompanhamento da safra brasileira – grãos – safra 2021/2022 – 2º levantamento, nov. 2021, v. 9, n. 2. CONAB, Brasília, 93p. (2021).
Rocha, R. S. et al. Estimation of adaptability and stability of early strains of ‘Carioca’ and black beans (Phaseolus vulgaris) in Pernambuco State, Brazil. Aust. J. Crop Sci. 14, 263–270 (2020).
Costa Neto, G. M. F. et al. A novel GIS-based tool to reveal spatial trends in reaction norm: upland rice case study. Euphytica 216, 1–16. https://doi.org/10.1007/s10681-020-2573-4 (2020).
Freiria, G. H. et al. Statistical methods to study adaptability and stability in breeding lines of food-type soybeans. Bragantia 77, 253–264. https://doi.org/10.1590/1678-4499.2017076 (2018).
Nuvunga, J. et al. Stability and adaptability of elite upland rice lines using Bayesian-AMMI model. Aust. J. Crop Sci. 15, 244–250 (2021).
Romão, R. F. et al. Predictive ability of AMMI and factorial analytical models in the study of unbalanced multi-environment data. Genet. Mol. Res. 18, 1–20. https://doi.org/10.4238/gmr18176 (2019).
Bernardo Júnior, L. A. Y. B. et al. AMMI Bayesian models to study stability and adaptability in maize. Agron. J. 110, 1765–1776. https://doi.org/10.2134/agronj2017.11.0668 (2018).
Oliveira, L. A. et al. Credible intervals for scores in the AMMI with random effects for genotype. Crop Sci. 55, 465–476. https://doi.org/10.2135/cropsci2014.05.0369 (2015).
Oliveira, L. A. et al. Performance of cowpea genotypes in the Brazilian midwest using the bayesian additive main effects and multiplicative interaction model. Agron. J. 110, 147–154. https://doi.org/10.2134/agronj2017.03.0183 (2018).
Oliveira, L. A. D. et al. Shrinkage in the Bayesian analysis of the GGE model: A case study with simulation. PloS one https://doi.org/10.1371/journal.pone.0256882 (2021).
Silva, A. et al. Aplicação do modelo AMMI-bayesiano no estudo de estabilidade e adaptabilidade genotípica em dados de mostarda. Res. Soc. Dev. 9, e166997023. https://doi.org/10.33448/rsd-v9i9.7023 (2020).
Rodrigues, P. C. et al. A weighted AMMI algorithm to study genotype-by-environment interaction and QTL-by-environment interaction. Crop Sci. 54, 1555–1570. https://doi.org/10.2135/cropsci2013.07.0462 (2014).
Yan, W. Crop variety trials: Data management and analysis (John Wiley & Sons, 2014).
Silva, C. P. et al. Heterogeneity of variances in the bayesian AMMI model for multienvironment trial studies. Crop Sci. 59, 2455–2472. https://doi.org/10.2135/cropsci2018.10.0641 (2019).
Crossa, J. et al. Bayesian estimation of the additive main effects and multiplicative interaction model. Crop Sci. 51, 1458–1469 (2011).
Josse, J. et al. Another look at Bayesian analysis of AMMI models for genotype-environment data. J. Agric. Biol. Environ. Stat. 19, 240–257. https://doi.org/10.1007/s13253-014-0168-z (2014).
Silva, C. P. et al. A bayesian shrinkage approach for AMMI models. PLoS ONE 10, 1–27. https://doi.org/10.1371/journal.pone.0131414 (2015).
Aczel, B. et al. Discussion points for Bayesian inference. Nat. Hum. Behav. 4, 561–563 (2020).
Nascimento, M. et al. Bayesian segmented regression model for adaptability and stability evaluation of cotton genotypes. Euphytica 216, 1–10. https://doi.org/10.1007/s10681-020-2564-5 (2020).
Perez-Elizalde, S. et al. A general Bayesian estimation method of linear–bilinear models applied to plant breeding trials with genotype x environment interaction. J. Agric. Biol. Environ. Stat. 17, 15–37. https://doi.org/10.1007/s13253-011-0063-9 (2012).
Oliveira, L. A. et al. Bayesian AMMI for continuous data (Eduardo Mondlane University, and Rhodes University, 2022).
AGRITEMPO. Sistema de Monitoramento Agrometeorológico. Embrapa – Empresa de Pesquisa Agropecuária (2021).
Viele, K. & Srinivasan, C. Parsimonious estimation of multiplicative interaction in analysis of variance using Kullback–Leibler information. J. Stat. Plan. Inference 84, 201–219. https://doi.org/10.1016/S0378-3758(99)00151-2 (2000).
Liu, G. Bayesian computations for general linear-bilinear models. Thesis (Doctor of Philosophy), Lexington, 150 p. (2001).
Raftery, A. E. & Lewis, S. How many iterations in the Gibbs sampler? In Bayesian statistics (eds Bernardo, J. M. et al.) 763–773 (Oxford University, 1992).
Heidelberger, P. & Welch, P. Simulation run length control in the presence of an initial transient. Oper. Res. 31, 1109–1144. https://doi.org/10.1287/opre.31.6.1109 (1983).
Smith, B. J. Boa: An R package for MCMC output convergence assessment and posterior inference. J. Stat. Softw. 21, 1–37 (2007).
Schwarz, G. Estimating the dimension of a model. Ann. Stat. 6, 461–464 (1978).
Akaike, H. A new look at the statistical model identification. IEEE Trans Automat. Contr. 19, 716–723 (1974).
Raftery, A. E. et al. Estimating the integrated likelihood via posterior simulation using the harmonic mean identity. Bayesian statistics. Oxf. Univ. Press 8, 1–45 (2007).
Hu, Z. & Yang, R. Improved statistical inference for graphical description and interpretation of genotype times environment interaction. Crop Sci. 53, 2400–2410. https://doi.org/10.2135/cropsci2013.04.0218 (2013).
Zobel, R. W. et al. Statistical analysis of a yield trial. Agron. J. 80, 388–393. https://doi.org/10.2134/agronj1988.00021962008000030002x (1988).
Cornelius, P. L. et al. Using the shifted multiplicative model to search for “separability” in crop cultivar trials. Theor. Appl. Genet. 84, 161–172 (1992).
Paderewski, J. & Rodrigues, P. C. The usefulness of EM-AMMI to study the influence of missing data pattern and application to Polish post-registration winter wheat data. Astr. J. Crop Sci. 8, 640–645 (2014).
R Develoment Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2018).
Lavoranti, O. J. et al. Modelagem AMMI para estudos de interação em modelos estatísticos de efeitos fixos. Comun. Téc. 124 (2004).
Ferreira, A. A. & Hongyu, K. Avaliação de rendimento de genótipo de milho em multiambientes. Biodiversidade 17, 16–26 (2018).
Hongyu, K. et al. Statistical analysis of yeld trials by AMMI analysis of genotype x environment interaction. Biometric Lett. 51, 89–102 (2014).
Silveira, L. C. I. et al. AMMI analysis to evaluate the adaptability and phenotypic stability of sugarcane genotypes. Sci. Agric. 70, 27–32. https://doi.org/10.1590/S0103-90162013000100005 (2013).
Souza, Y. P. et al. Assessing the genotypic performance of carioca beans through mixed models. Ciência Rural 48, 1–7. https://doi.org/10.1590/0103-8478cr20170761 (2018).
Santos, P. R. et al. Simultaneous selection for yield, stability, and adaptability of carioca and black beans. Pesq. Agrop. Bras. 53, 736–745. https://doi.org/10.1590/S0100-204X2018000600010 (2018).
Santos, P. R. et al. GGE Biplot and REML/BLUP based-analysis of yield stability and adaptability for common beans in multi-environment trials. Rev. Bras. Ciênc. Agrár. 14, 1–6. https://doi.org/10.5039/agraria.v14i2a5657 (2019).
Zeffa, D. M. et al. Genetic progress of seed yield and nitrogen use efficiency of Brazilian carioca common bean cultivars using bayesian approaches. Front. Plant Sci. 11, 1168. https://doi.org/10.3389/fpls.2020.01168 (2020).
Acknowledgements
The authors are grateful to the Postgraduate Program in Agronomy—Plant Genetic Improvement (PPGAMGP), to the Pernambuco Agronomic Institute (IPA) for the experimental data and to the Coordination for the Improvement of Higher Education Personnel (CAPES) for granting the scholarship.
Author information
Authors and Affiliations
Contributions
A.F.C.: designed the experiments; L.A.O, C.P.S. and A.Q.S.: analyzed the data from the experiments; G.G.M.: wrote the draft manuscript; D.R.Q., J.W.S., M.R.N., P.R.S. and R.J.S.G.: revised the draft manuscript. All authors made contributions and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
de Melo, G.G., de Oliveira, L.A., da Silva, C.P. et al. AMMI-Bayesian perspective in the selection of pre-cultivars of carioca beans in Agreste-Sertão of Pernambuco, Brazil. Sci Rep 13, 4700 (2023). https://doi.org/10.1038/s41598-023-31768-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-31768-5
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.