Impact of genetically engineered maize on agronomic, environmental and toxicological traits: a meta-analysis of 21 years of field data

Despite the extensive cultivation of genetically engineered (GE) maize and considerable number of scientific reports on its agro-environmental impact, the risks and benefits of GE maize are still being debated and concerns about safety remain. This meta-analysis aimed at increasing knowledge on agronomic, environmental and toxicological traits of GE maize by analyzing the peer-reviewed literature (from 1996 to 2016) on yield, grain quality, non-target organisms (NTOs), target organisms (TOs) and soil biomass decomposition. Results provided strong evidence that GE maize performed better than its near isogenic line: grain yield was 5.6 to 24.5% higher with lower concentrations of mycotoxins (−28.8%), fumonisin (−30.6%) and thricotecens (−36.5%). The NTOs analyzed were not affected by GE maize, except for Braconidae, represented by a parasitoid of European corn borer, the target of Lepidoptera active Bt maize. Biogeochemical cycle parameters such as lignin content in stalks and leaves did not vary, whereas biomass decomposition was higher in GE maize. The results support the cultivation of GE maize, mainly due to enhanced grain quality and reduction of human exposure to mycotoxins. Furthermore, the reduction of the parasitoid of the target and the lack of consistent effects on other NTOs are confirmed.

Numerous attempts have been carried out to synthesize the huge literature on agronomic and economic performance and environmental impact of GE maize (e.g., [6][7][8][9][10][11][12][13]. However, these studies, mostly literature reviews, do not allow us to draw univocal conclusions. To date, a few meta-analyses have been performed on GE maize at farm and field level addressing questions concerning yield, production cost and gross margin terms [14][15][16] , pesticide use 16 , and effects on non-target (NT) invertebrates [17][18][19][20] . However, there are still some unsettled key issues in GE maize cultivation which remain to be addressed, such as if GE technology improves the grain quality in terms of nutritional value and toxin content (including mycotoxins) 21,22 , and if it affects important agro-ecosystem services including soil organic matter decomposition.
Therefore, this study is aimed at increasing our knowledge about agronomic traits and safety for human health and environment of GE maize cultivation by performing a meta-analysis of the peer-reviewed literature (from 1996 to 2016) on yield and by extending the analysis on new parameters, such as grain quality, non target organisms (NTOs) at family level, target organisms (TOs) and soil biomass decomposition, allowing more robust evaluation of the field performance of GE maize. This study, embracing the period 1996-2016, applies rigorous criteria for study selection, such as the inclusion in the dataset of field observations comparing GE maize with its true non-GE isoline or near isoline, throughout its overall cultivation period.

Results
Composition of the database. The first step of the selection procedure yielded 6,006 publications. The subsequent refinement, by adopting the stringent criteria above described, gave 32, 5, 32 and 10 eligible publications, covering, respectively, the following categories: grain yield and quality, TOs, NTOs (non-target organisms), and biogeochemical cycles (e.g. lignin content in stalks and leaves, stalk mass loss and biomass loss, CO 2 emission) (Supplementary 1 . The comparison of our dataset with the available NTO dataset of Wolfenbarger et al. 18 allowed the inclusion of 40 observations (Supplementary 4). The main reasons for paper exclusion were that the experiments were not performed under field conditions, they did not have a near isogenic hybrid as comparator, hybrids were not grown under identical conditions, or the data lacked of a measure of variance, statistical significance, or sample size. For the traits within each category, the maximum number of observations has been taken into account. No papers were selected for the biodiversity category and for CO 2 emission trait. The number of papers and observations utilised for the meta-analysis of each trait is reported in Table 1. Overall, grain yield and quality, TOs, NTOs and biogeochemical cycle databases were composed of 542, 99, 813, and 29 observations, respectively (see databases provided as Supplementary 2, 3, 4, and 5). Regarding the geographic distribution of the field studies, the majority of them were performed in North America (202), followed by Europe (52), and South America (17) (Supplementary 1 Tables 2-5). Asia, Africa and Australia were represented with eleven, one, and no studies, respectively (Fig. 1). Ninety-nine per cent of the studies in North America were done in the United States, of which 49% in Iowa, Illinois, and Nebraska. In Europe the field studies were performed in nine countries (Germany, Spain, France, the Czech Republic, Denmark, Hungary, Italy, Slovakia, and the United Kingdom), while in South America they were performed in three countries (Brazil, Argentina, and Chile).
Grain yield response was based on 46% of observations on single event hybrid maize, and on 33, 13, and 8% of double, triple and quadruple stacks (GE events combined by hybridisation) in hybrid maize, respectively ( Table 2).
The TO dataset was composed of data on the abundance of Diabrotica spp.; western corn rootworm, Diabrotica virgifera virgifera (LeConte); northern corn rootworm, Diabrotica barberi (Smith and Lawrence), and southern corn rootworm, Diabrotica undecimpuctata howardi (Mannerheim) (Coleoptera: Chrysomelidae). For species other than Diabrotica spp. there were not enough observations to perform meta-analysis. The TO dataset was analyzed at genus level to achieve an adequate sample size for analyses (Supplementary 3). The response of Diabrotica spp. was based on 65% of observations on single hybrids, and 35% on double hybrids ( Table 2).
The NTO datasets were composed of data on the abundances of NTOs classified as phylum, class, order, and family levels, but they were analyzed at family level (Supplementary 1 Table 4; Supplementary 4) that was the finest possible taxonomic resolution allowing reliable analyses. The analyzed NTO families were: Anthocoridae, Aphididae, Braconidae, Carabidae, Chrysopidae, Cicadellidae, Coccinellidae, Nabidae, Nitidulidae, and Staphylinidae (Supplementary 1 Table 4). Spiders were analyzed at the order level Araneae. For the response of all families the abundance of adults was analyzed, whereas for Chrysopidae and Coccinellidae the abundance of larvae was also analyzed, Details about number of observations for the traits grain yield, damaged ears and TOs in single, double, triple and quadruple stacked hybrids are given in Table 2.
Effects on grain yield and quality. Almost all studies on grain yield and damaged ears were done with plants expressing cry genes against Lepidoptera, some also stacked with genes against Coleoptera and herbicide-tolerance (Table 2). Genetically engineered maize cultivation led to a significant increase of yield compared to the non-GE isolines or near isolines. The overall mean effect size (g + ) for grain yield was 0.526 ( Fig. 2; Supplementary 1 Table 6) and the percentage of change was 10.1% (Fig. 2). The mean effect sizes calculated for the grain yield of single, double, triple and quadruple stacked hybrids were positive (g + ranging from 0.38 to 0.629) and significant ( Fig. 2; Supplementary 1 Table 6). The percentage of change ranged from 5.6 to 11.7% for single, double and triple stacked hybrids, while it was 24.5% for quadruple staked hybrids (Fig. 3). Mean Scientific REPoRtS | (2018) 8:3113 | DOI:10.1038/s41598-018-21284-2 effect size responses did not significantly change when we accounted for another sensitivity test based on the removal of a set of observations (reduced dataset) and the results were also robust to publication biases (Table 3; Supplementary 1 Table 6). The publication biases were assessed by the sensitivity analysis that compares the fail-safe number (N) with a threshold calculated as 5n + 10, where n is the original number of studies. A fail-safe number is considered robust if it is greater than the threshold value.
The mean effect size (g + ) of the damaged ears calculated for all hybrids was −0.061, and the 95% CI did not overlap zero ( Fig. 2; Supplementary 1 Table 6). The percentage of reduction was 59.6% (Fig. 3). When differentiated into hybrid type, the mean effect size was positive for the damaged ears of double, triple and quadruple stacked hybrids, whereas the response was not significant when restricted to single hybrids (95% CI, −0.012-0.011) ( Fig. 2; Supplementary 1 Table 6). The percentage of change of damaged ears was 73.4, 31.1, and 84.0%, for double, triple and quadruple stacked hybrids, respectively. These responses did not change following the sensitivity analysis based on one set of observations removal and were also supported by the fail-safe number test (Table 3; Supplementary 1 Table 6).
The concentration of proteins, lipids, Acid Detergent Fiber (ADF), Neutral Detergent Fiber (NDF) and Total Dietary Fiber (TDF) in grain did not vary between GE hybrids and the isolines or near isolines (Supplementary 1 Table 6). These responses did not change after the sensitivity analysis based on one set of observations removal (Table 3; Supplementary 1 Table 6). Only the results on lipids were supported by fail-safe number test.
Observations on mycotoxins were mostly done on plants expressing resistance to Lepidoptera (84%), while the observations on fumonisin and thricotecens were done only on plants expressing resistance to Lepidoptera.  Table 6). The reductions ranged from 28.8 to 36.5% for mycotoxins and thricotecens, respectively (Fig. 3). These responses did not change according to the sensitivity analysis based on the removal of one set of observations and to the fail-safe number test (Table 3; Supplementary 1 Table 6).

Effects on target and non-target organisms.
Almost all observations on the abundance of TOs were done with plants expressing resistance to Coleoptera ( Table 2). The abundance of Diabrotica spp. was highly sensitive to GE cultivation with −5.007 mean effect size and 89.7% reduction (Figs 2 and 3; Supplementary 1 Table 6). This result was supported by both sensitivity analyses (Table 3; Supplementary 1 Table 6). The observations on the abundance of NTOs were obtained from plants expressing resistance to Coleoptera (35%) and Lepidoptera (65%) ( Table 4). Among taxonomic groups of NTOs, Anthocoridae, Aphididae, Araneae, Carabidae, Chrysopidae (adults and larvae), Coccinellidae (adults and larvae), Nabidae, Nitidulidae and Staphylinidae were not affected by GE cultivation. These results were supported by the sensitivity analysis based on the removal of one set of observations (Table 3; Supplementary 1 Table 6). Only the results on Coccinellidae (adults) and Staphylinidae were supported by the fail-safe number test. By contrast, Braconidae were significantly decreased (g + = −0.457), and Cicadellidae were significantly increased (g + = 0.030) ( Fig. 2; Supplementary 1 Table 6). The results on Braconidae were robust to both sensitivity tests (Supplementary 1 Tables 7 and 8). The results on Cicadellidae were not supported by sensitivity analysis (Table 3; Supplementary 1 Table 6). The abundance of Braconidae was reduced in GE maize by 31.5%, (Fig. 3).

Biomass decomposition.
All observations were done with hybrids expressing insect resistance, either single or stacked with herbicide tolerance (Supplementary 5). Lignin concentration in leaves and stems did not change between GE hybrids and their isolines or near isolines ( Fig. 2; Supplementary 1 Table 6). These results were supported by the fail-safe number test, but not by the sensitivity analysis based on the removal of one set of observations, probably due to the small number of pairwise comparisons (n = 4) ( Table 3; Supplementary  1 Table 6). Similarly, the stalk mass loss was not significantly different in GE hybrids compared to their isolines or near isolines, whereas the biomass loss, including all crop residues, was significantly increased in GE hybrids ( Fig. 2; Supplementary 1 Table 6). The biomass loss corresponded to an increase of the biomass decomposition rate of 5.9% (Fig. 3). The results on biomass loss were supported by both sensitivity analyses (Table 3;  Supplementary 1 Table 6).

Discussion
Composition of the database. To date, a considerable number of scientific articles on GE maize is present in the literature (6,006 publications examined). However, on the basis of the criteria adopted for data selection, only 76 publications were eligible for the meta-analyses. This selection suggests there is a need for more field research with a wider geographic coverage and having appropriate comparators and field design allowing robust statistical analyses. It is interesting to note that in Europe there is a relatively large number of field studies carried out in several European Union member states despite GM maize is extensively cultivated only in Spain, due to the national legislative constraints in the other countries. Moreover, there is a need to publish research data in a more standardized way, e.g. providing raw data with at least three replicates, allowing the calculation of variance. As regards the partitioning of GE hybrids by trait in the grain yield dataset, we noted that single event HT hybrids were missing and this did not allow the evaluation of such a major category of maize GE hybrids on grain yield and the other agro-environmental traits linked to the development of weed resistance to herbicides. Finally, we noted that some categories were not adequately covered in our database, such as biodiversity and soil biogeochemical cycles that are the processes that modulate the provision of agro-ecosystem services 23 .
Effects on grain yield. Our study indicated that GE maize hybrids increased yield by 10.1%, corresponding to 0.7 t ha −1 , calculated on the average grain yield of the GE isolines or near isolines in the dataset. These results, based on a high number of observations (n = 276), essentially confirmed previous results 15 , showing a GE maize yield increase of 0.6 t ha −1 . In a meta-analysis of the yield responses of GE maize hybrids in Spain 14 , similar yield increases were recorded (5.6% corresponding to 0.7 t ha −1 ), and higher yields were reported in Germany and South Africa (12.2 and 24.6%, corresponding to 1.1 and 1.8 t ha −1 , respectively). The yield increase for GE maize, calculated by disaggregating data reported by Klümper and Qaim 16 , was 18.1%. This higher yield compared to our results (18.1% vs 10.1%) might be caused by the fact that Klümper and Qaim included book chapters, grey literature and other datasets that were excluded for our meta-analysis. Indeed, this is supported by the observation that the type of publication (i.e. studies published in peer-reviewed or non-peer reviewed journals) affected the outcome of the analysis 16 . In our study we found that yield increase of GE maize varied in relation to the type of hybrid, ranging from 5.6 to 24.5% in double and quadruple stacked hybrids, respectively. Quadruple stacked hybrids provided higher grain yields. This could be related to a greater overall pest protection due to the insertion of multi-events providing resistance to Coleoptera and Lepidoptera 10,24 , confirming the positive outcome of the new genetic-engineering technologies 13 . Global losses of maize production due to pests and weeds are estimated at 31.2% and 10.5%, respectively 25 , while the yield gain provided by insect pest management by chemical insecticides is estimated about 18% 26 .  Effects on crop protection chemicals. Due to the selection criteria adopted, in our study we did not find a sufficient number of data for analyzing the quantity of insecticide and herbicide utilized in GE maize compared to the isolines or near isolines and for performing an economic analysis. Other authors have estimated that in the period from 1996 to 2011 the adoption of GE HT and IR maize caused a reduction in the volume of the active ingredient of herbicides and insecticides of 10.1% and 45.2%, respectively 27 . According to this study, the adoption of GE HT crops resulted mainly in a shift of the profile of used herbicides, and the GE IR technology has effectively reduced insecticides used to control important crop pests. Previous meta-analyses compared Bt crops and non-Bt crops that had been treated with insecticides 17,18,20 . These studies indicated that the systems using GE technology have benefited also from better biological control of all the pests the technology does not affect. This could be considered an indirect benefit of the technology.
Effects on quality traits. The results clearly indicate that GE maize grain contains lower amounts of mycotoxins (29%), fumonisin (31%) and thricotecens (37%) than its non-GE counterpart. The lower mycotoxin content seems to be related to the lower incidence of insect attack, since GE maize resulted in 59.6% less damaged ears compared to the corresponding isolines or near isolines. Insects promote fungal colonization by acting as vectors of fungal spores and by creating wounds in kernels on which the germination of fungal spores is favoured during cultivation and storage with resultant mycotoxin accumulation in grain [28][29][30] . Mycotoxins are toxic and carcinogenic for humans and animals, and the high mycotoxin content in grain, beside the health risk, causes market rejection of grain or reduction in the market price. By contrast, the lower mycotoxin content in GE maize grain can help to minimize the exposure of humans to health hazardous toxins through the diet. The risk of exposure to  31 . In a climate change scenario with rainfall reduction and increase of temperature, maize will be increasingly subjected to drought stress 32 and more susceptible to fungal attack 33,34 .
The authorization procedure prior to GE crop cultivation requires the substantial equivalence of composition with non-GE crops as an end point 35 . Apart from mycotoxin levels, our results indicated that the composition of GE maize grain did not differ from that of the isolines for protein, lipid, ADF, NDF and TDF content, and confirm what was found on compositional equivalence between GE crops and non-GE comparators over the last two decades 36 .

Impact on TOs and NTOs. The European corn borer Ostrinia nubilalis (Hubner) (Lepidoptera: Crambidae)
and the Mediterranean corn stalk borer Sesamia nonagrioides Lefebvre (Lepidoptera: Noctuidae), along with the western corn rootworm (Diabrotica virgifera virgifera Le Conte) (Coleoptera: Chrysomelidae) are common pests affecting maize 34 . In our study, only the data on Diabrotica spp. abundance were sufficient to perform a reliable meta-analysis. Our results clearly indicated that GE maize was highly effective against Diabrotica spp. infestation with 89.7% of pest decrease compared to the non-GE isolines. All data utilised were collected in field experiments where no insecticide was applied. The effectiveness of IR crops against insect pests is the main objective of crop genetic engineering and our data confirm that this target has been achieved, although the use of Diabrotica adult number could be regarded as a not entirely reliable indicator, since the damage is mostly caused by larvae. Moreover, the resistance to Diabrotica in the last generation maize hybrids is indicated by GE seed producers as a partial one and attempts are ongoing to further improve the resistance trait by using the RNA interference (RNAi) as a novel strategy 13 .
Despite the high effectiveness of IR crops, the evolution of resistance in pests and a consequent reduction of the GE crop effectiveness can not be excluded. Actually, resistance and cross-resistance to Bt maize were recently detected in Spodoptera frugiperda (J.E. Smith) (Lepidoptera: Noctuidae) in Puerto Rico 37 , Busseola fusca (Fuller) (Lepidoptera: Noctuidae) in South Africa 38 and in the Coleoptera D. virgifera in Iowa 39 even though the implementation of refuges has been mandated in USA, EU, Australia and elsewhere 40 . The refuge strategy, implemented with distinct management practices 41,42 , is based on the idea that refuges, which consist of non-Bt host plants near or in fields of Bt crops, produce susceptible pests that mate with the rare resistant individuals surviving on Bt crops. Another recent approach for delaying the evolution of pest resistance consists in the development of Bt crops expressing more than one Cry toxin, such as the multiple stacked/pyramided Bt crops 43 .
Our study showed that GE maize did not significantly affect the majority of the NTOs families, notably Anthocoridae, Aphididae, Araneae, Carabidae, Chrysopidae, Coccinellidae, Nabidae, Nitidulidae and Staphylinidae. On the contrary, we detected a considerable decrease in Braconidae 44 .
Overall, the results of NTOs are consistent with previous results 17 showing no effects of IR GE maize on different NT insect taxa, except for the presence of Hymenoptera that was lower in GE maize. Similarly, no effect of Bt maize on 26 arthropod taxa, including herbivores, predators, omnivores, parasitoids and composers, was detected in a meta-analysis of the results from 13 field trials in Spain 19 .

Impact type Parameter
Fail-safe number n 5n + 10 Grain yield and quality  Table 3. Sensitivity analysis based on the fail-safe number (i.e., the additional number of observations necessary to change results of the meta-analysis from significant to non-significant) and the number of studies (n). The observed decrease of Braconidae, mostly represented by M. cingulum (98% of observations), in GE crops is in line with other findings 18 that showed a decrease of populations belonging to the functional guild of parasitoids. Since the abundance of parasitoids depends largely on the abundance of the target pest host, the observed decrease of M. cingulum in GE maize is very likely an indirect effect of the decrease in O. nubilalis caused by the GE maize.
Differently from other results 19 , covering a limited area the NE Iberian Peninsula, we found, on the basis of observations obtained in three continents, an increase in Cicadellidae, although not supported by the sensitivity analysis revealing that this result can not be considered robust.
From a methodological point of view, all the above-cited meta-analyses have taken into consideration a larger number of observations, including experiments not having the appropriate comparators and statistics and embracing the grey literature. Impact on biomass decomposition. Plant nutrition and soil quality are directly affected by the decomposition of organic matter, which in turn depends on plant tissue composition, environmental conditions, and soil biota. Our analyses indicated that lignin concentration in leaves and stalks did not change between GE maize and their isolines. Quantity and quality of lignin are considered the main traits affecting the rate of plant biomass decomposition because lignin is the most recalcitrant component of plant tissues and offers protection to associated polysaccharides, proteins, and other plant components more susceptible to biodegradation (e.g., [45][46][47]. The rates of litter mass loss correlate with the initial lignin and N contents 48,49 . Consistently, we observed no difference in stalk mass loss between GE crops and their isolines. By contrast, we found significant differences in the loss of total biomass that includes all crop residues (leaves, stalks and tassels). This disagreement might be due to differences between GE and their isolines in the proportion and composition of the plant organs in the residue, i.e. stalks and leaves which have a distinct rate of degradation 50,51 . Unfortunately, it was not possible to compare the results of biomass loss with those of CO 2 soil fluxes and C storage in soil due to an insufficient number of data to be analysed.
Laboratory and greenhouse studies on IR and HT maize have drawn attention to GE proteins in soil and their potential effects on soil biota (e.g., 52,53 ), but few studies have evaluated the effects of GE maize on soil biota in field conditions and we could not perform a meta-analysis due to scarcity of data for single taxa or because the data did not fulfil the criteria of meta-analysis. Specifically, field comparisons of GE and non-GE maize revealed sporadic decreases in the biomass of amoebae, earthworms, flagellates, ciliates, as well as of nematodes with no difference or small difference in nematode community composition [54][55][56] . Therefore, in the case of nematodes that utilise as a food resource bacteria, fungi or plants, GE maize seems to have a direct effect on specific food resources rather than to have an indirect effect 54 . In addition soil microbial biomass and activity did not change between GE and non-GE maize 54,55,57 . Bacterial community profiles in the rhizosphere were not modified or only slightly modified by HT-maize hybrids 55,58 and IR-maize hybrids 54,55,57 . However, if some slight bacterial community changes occurred, these were shown not to be persistent 55 , probably due to the rapid degradation or inactivation of toxins in soil in field conditions 53 . Finally, the arbuscular mycorrhizal fungal (AMF) community, spore abundance and root colonization did not change in Bt versus non-Bt maize, suggesting that the cultivation of Bt maize may not have an impact on AMF in soil under field conditions 59 .
In conclusion, our meta-analysis of 21 years of field data on the agro-environmental impact of GE maize clearly shows the benefits in terms of increases in grain yield and quality, and in decreases of the target insect Diabrotica spp. Our analysis highlights modest or no effect on the abundance of non-target insects, suggesting no substantial effect on insect community diversity. This confirms previous results on NTOs and extends our knowledge to new taxa. We provide also strong evidence that GE maize cultivation reduces mycotoxin content in grain. Since mycotoxin contamination in maize grain annually leads to high economic losses in all regions of the world, the protection of maize plants through the use of GE technology against the damage of insects, favouring the development of toxinogenic  fungi, can be seen as an effective tool to reduce the contamination of grain. This can lead to increases in economic income and quality of the production and to reductions in the human exposure to mycotoxins, thus reducing health risks. Languages were: English, Spanish, French, and German. Experiments performed in controlled environments (greenhouse or climatic chamber) were excluded. We selected experiments performed in field conditions whose controls were represented by near isogenic lines and that were managed in the same way as the corresponding GE maize. We accepted only pairwise comparisons in which controls entailed non-GE varieties grown under identical conditions, because our aim was to elucidate the effect of genetic engineering of maize on agro-environmental traits. When studies include measures of both peak abundance (highest density on any given sample dates) and seasonal abundance (averaged over multiple sample dates) the peak data should be retained. To be included in the database, articles needed to report data on the agro-environmental impact of GE maize with a measure of the variance (standard deviation, standard error, coefficient of variation, least significant difference) or the statistical significance of the analysis (t-or P-value), and the sample size. Data were classified into five categories: (1) grain yield and quality including economic parameters; (2) NTOs, (3) TOs; (4) biogeochemical cycles; (5) biodiversity. For all categories, from each article data were extracted as shown in Supplementary 1 Table 9. Data from figures were extracted by the GraphGrabber software package 60 .
The following information was also extracted: (i) geographic location (latitude and longitude); (ii) transformation event(s), expressed toxin(s), hybrid name, commercial name, seed company in GE hybrid, (iii) hybrid name, commercial name, seed company in non-GE isoline; (iv) treatment (e.g., location, year of cultivation, irrigation, biocontrol); (v) NTO taxon. For each NTO taxon, phylum, class, order, and family were determined and included in the database. Non-target organisms were classified according to the Fauna Europaea (http://www.fauna-eu.org) and the Integrated Taxonomic Information System (http://www.itis.gov). In addition, for the categories grain yield and quality and NTOs we implemented our datasets following the comparison with available datasets published by Klümper and Qaim 16 and Wolfenbarger et al. 18 , respectively. Klümper and Qaim 16 reported aggregated data for soybean, maize, and cotton and we disaggregated these data to obtain the ones referred only to maize. Wolfenbarger et al. 18 included data requested from Authors in their dataset, and this allowed us to include 40 observations in this study.
For articles that reported observations from more than one treatment, all the observations were included in the database. To perform reliable analyses, agro-environmental traits were included in the analysis if they were reported in at least three articles. For each trait included in the four categories, a dataset was built to perform the calculation of the number of studies and observations and the statistical analysis described as follows. Whenever possible, we analyzed the trait for the classes single, double and triple stacks, and expressed toxin.
The methods of analysis accepted for grain quality, residues mass loss and CO 2 emission from residues and soil are given in Supplementary 1 Table 8.
Response ratios and statistical analysis. Effect size was estimated by Hedges' g 61 . For each observation included in the database, Hedges' g was calculated for all traits as the difference between the mean of GE and non GE maize divided by the pooled standard deviation and adjusted by a weighting factor based on the number of replicates per treatment. The Hedges' g was calculated using the Comprehensive Meta-Analysis software, version 3 62,63 , starting from means, SD or P values and number of replicates. Hedge' g is a unitless index that ranges from −∞ to +∞ and estimates the size and direction of impact. Hedges' g values of zero signify no difference in the measured trait due to genetic engineering, while positive and negative values imply an increase and a decrease in the trait following genetic engineering, respectively. For each trait we calculated the mean effect size (g + ) across the observations from all articles, weighted by sample size. Data were analyzed using a random-effect model, assuming that the true effect sizes could vary from study to study 62 . Under this model, two sources of variance are taken into account and within-study error and across-study error are calculated. The variability within-and across-studies is assumed to have a normal distribution. The model is hierarchical because the within-study variance is nested within the among-study variability and is mixed because it has more than one variance. To test whether g + differed significantly from zero (i.e., no change due to genetic engineering), we assessed whether the 95% bootstrap-confidence interval (CI) of bias-corrected g + did not overlap zero based on 999 iterations 62 . We also tested whether effect sizes across the observations from all articles were homogeneous, using the Q total statistic (Q t ) based on a chi-squared test 62 . A significant Q t indicates that the variance among effect sizes is greater than that expected from sampling error alone. The calculations were made using the Comprehensive Meta-Analysis software, version 3. The mean percentage of change for the g + values different from zero was calculated as [exp(R + ) −1] × 100 where R + is the weighted mean response ratio (R) across the observations from all articles 62 . The response ratio was calculated as ln (X GE /X non-GE ) where X GE is the trait in the GE maize and X non-GE is the trait in the isoline.
To verify whether the results would change if a set of observations was removed from the analysis, we applied the sensitivity analysis described by Borenstein et al. 63 . This analysis was performed for each trait using a reduced dataset obtained by randomly removing one observation per article, when the number of observations per article was higher than one. Thus, for each trait, the mean effect size obtained from the reduced dataset was compared with the one obtained from whole dataset. This procedure is available in the software Comprehensive Meta-Analysis software, version 3. Comparisons were made using 95% bootstrap CIs based on 999 iterations and using P-values. The data analyses were performed using the Comprehensive Meta-Analysis software, version 3.
In order to identify the publication bias due to the tendency for journals to only publish studies with statistically significant results, sensitivity analysis was performed comparing the fail-safe number with a threshold calculated as 5n + 10, where n is the original number of studies 64 . A fail-safe number is considered robust if it is greater than the threshold value. The calculation of fail-safe number was done applying a random-effect model in the Comprehensive Meta-Analysis version 3.
Data availability. All data generated or analyzed during this study are included in this published article (and its Supplementary files).