Abstract
Genetically modified (GM) crops currently constitute a significant and growing part of agriculture. An important aspect of GM crop adoption is to demonstrate safety and equivalence with respect to conventional crops. Untargeted metabolomics has the ability to profile diverse classes of metabolites and thus could be an adjunct for GM crop substantial equivalence assessment. To account for environmental effects and introgression of GM traits into diverse genetic backgrounds, we propose that the assessment for GM crop metabolic composition should be understood within the context of the natural variation for the crop. Using a non-targeted metabolomics platform, we profiled 169 metabolites and established their dynamic ranges from the seeds of 49 conventional soybean lines representing the current commercial genetic diversity. We further demonstrated that the metabolome of a GM line had no significant deviation from natural variation within the soybean metabolome, with the exception of changes in the targeted engineered pathway.
Similar content being viewed by others
Introduction
GM crops were first introduced to the market in 1994. Their adoption, especially in North and South America, has been steadily accelerating, now accounting for the majority of acreage in several major crops, including corn, soybean, cotton, and canola1,2. Transgenic herbicide tolerance and insect resistance have become the most widely applied trait types, providing the benefits of higher yield, lower input costs, and improved environmental profiles2,3,4. From the outset the most important aspect of GM crop adoption, and often the most controversial, has been whether they are as safe as conventional crops with respect to human and animal food consumption, as well as in their environmental impact. Extensive compositional and performance testing has always been required in order to gain regulatory approval for commercial release, and to date this concept has proven successful; despite more than a decade of widespread use of GM crops, and their extensive penetration into the food and feed chains, no case of deleterious effect on humans or animals has been found2. Of course, continued success requires case-by-case evidence that any new GM product meets similar or more refined standards.
The most common testing approach has been to compare the GM line to its nearest isogenic version, that is, the same line lacking only the transgenic insertion. Experience has shown that for traits in widespread use today effects of the transgene on the plant tend to be small, with the exception of parameters relating to the intended engineered trait. Indeed, testing over multiple years at multiple sites has shown that environmental effects on general metabolic variation within a particular line can be much greater than the variation due to the transgene itself 5,6,7,8. In addition, genetic variability within the commercial germplasm bearing an engineered trait can be quite diverse. In practice, conventional commercial crop lines have been individually optimized for performance in a wide variety of locations and environments, and a single GM line bearing a herbicide- or insect-resistance trait would not be economically viable outside the optimal agronomic region for that genetic background. As a result, the GM trait must be introgressed into a wide range of conventional backgrounds, and thus it is important to consider the variation of transgenic crops within the context of the entire range of germplasm expression.
A broad variety of methods have been used in the assessment of safety and equivalence for GM crops, and regulatory agencies rightly demand that the most advanced and accurate available technologies be applied. The most common approaches in the first decade of GM regulation involved extensive compositional and performance analysis for such characteristics as (in the case of animal feed) digestibility, gross levels of total protein, starch, fiber, fat, etc., as well as targeted analysis of specific amino acids, fatty acids, secondary metabolites, and known toxins and anti-nutritive compounds9,10,11. As analytical technologies advanced it was suggested to expand the repertoire of analysis to include non-targeted “profiling” techniques12, with the reasoning that such analysis might give insight into unpredictable pleiotropic effects and potentially deleterious consequences of transgene expression7. In one example of the later case, negative alterations in expression might be related to the site of gene insertion, which in most cases is at a random site in the genome. Soon these profiling methods were being applied, including proteomics13, NMR-based metabolite fingerprinting5,14, HPLC or GC/MS metabolite analysis5,8, and global gene expression profiling7.
Because the trend for GM plant analysis has been toward more advanced and informative technologies, we have applied non-targeted global metabolomic analysis to the assessment of GM crops. Metabolomics, which is the global analysis of small molecule metabolites, is proving a powerful and sensitive technology for revealing perturbations in plant metabolic composition. We analyzed seeds from 49 conventional soybean lines and profiled 169 metabolites covering diverse biochemical pathways and classes. The relative levels of the metabolites across these lines provided a representation of the dynamic ranges of the natural soybean seed metabolome. As an example, we further analyzed a soybean GM line and found its metabolome resided well within the natural variation with the exception of the engineered pathway.
Results
The metabolomics platform used in this analysis consists of three independent methods: ultrahigh performance liquid chromatography/tandem mass spectrometry (UHLC/MS/MS2) optimized for basic species, UHLC/MS/MS2 optimized for acidic species, and gas chromatography/mass spectrometry (GC/MS). The metabolites are identified by comparison of the ion features in the experimental samples to a reference library of chemical standard entries that include retention time, molecular weight (m/z), preferred adducts, and in-source fragments, as well as their associated MS/MS2 spectra. We analyzed seeds in eight biological replications from 49 conventional soybean accessions (Supplemental Table 1), which well represented the genetic diversity found in current commercial lines, and identified a total of 169 compounds of known structures, covering 51 biochemical pathways and compound types (Supplemental Figure 1). Interestingly, it was found that individual metabolites exhibited a considerable range of variation among the 49 conventional soybean lines (Figure 1). For each metabolite, we defined the natural dynamic range as the lines with the highest and lowest mean values. For individual metabolites the ratios of mean values for the line exhibiting the highest level to that having the lowest level ranged from 1.4-fold (stachyose) to >200-fold (allantoin) (Supplemental Table 2). This suggested that a portion of the soybean metabolome was tightly regulated across these lines, while the levels of many other metabolites could be dramatically altered by the combination of genetic and environmental impacts. Collectively, these data gave a representation of the natural variation range of the commercial soybean seed metabolome.
To test the effects of a genetically engineered trait on the soybean metabolome, we analyzed a soybean GM line resistant to the herbicide Mesotrione, and assessed the results in the context of metabolomic variation within a wide range of soybean germplasm. Mesotrione is a triketone herbicide which was developed based on the structure of a natural phytotoxin from Callistemon citrinus, the California bottlebrush plant, and acts as a competitive inhibitor of hydroxyphenylpyruvate dioxygenase (HPPD)15. Inhibition of HPPD leads to disruption of the carotenoid pathway, which is important in the production of photosynthetic electron transport compounds and anti-oxidant molecules in the tocopherol family. Blockage of this pathway allows runaway production of reactive oxygen species (ROS), which rapidly leads to light-induced chlorophyll damage and plant death. The GM trait was constructed in the soybean cultivar “Jack” by over-expressing a mutated oat HPPD gene, which conferred an ability to tolerate levels of mesotrione which are normally efficacious for control of dicot weeds.
We are not aware of established standards to judge the importance of potential metabolic perturbations which might result from transgene introduction, especially in the case of a large number of outcome variables. We thus propose different statistical methods to show what types of boundaries or comparisons might be considered, the goal being to develop hypotheses about the biological importance of any observed differences in transgenic lines relative to conventional germplasm. While it is not possible to establish “equivalence” by such methods, perturbations outside the normal ranges of conventional germplasm would certainly be a cause for concern and signal a need for further scrutiny.
In one approach we sought to establish if compound expression in the GM line differed from the various conventional lines, also termed “wild type” (WT) lines. The analysis used all of the conventional lines except for “Jack” as the baseline. Significance testing and prediction intervals were computed (Supplemental Table 3) comparing “Jack” vs. the other conventional WT lines, and then comparing the transgenic HPPD line (GM) vs. the WT lines (excluding “Jack”). A random effects model was fitted to the conventional lines with “LINE” treated as a random effect. The method of moments was used to estimate each variance component. Then prediction intervals for the average of a new line (with 8 seeds) were computed (first to compare the GM value, the second for “Jack”). The prediction intervals can be inverted to compute a p-value, which is equivalent to using the t-test with the pooled estimates of the variances (estimated from the 48 conventional lines). The histogram of the p-values for “Jack” vs. WT (Figure 2A) showed a uniform distribution, which was consistent with the case when there are no differences. By comparison, the GM p-value distribution (Figure 2B) shows a few potential metabolites that are different, in particular, the lowest two p-values were achieved by delta-tocopherol (Figure 3), and gamma-tocopherol, both products of the engineered pathway. A few other compounds showed higher but still relatively low p-values, e.g. AMP and phytate (myo-inositol-hexakisphosphate), which might flag them as compounds of interest. However, it is not possible to know if these result from some effect of the engineered gene, or if they represent false discoveries, several of which might be expected when performing 169 tests. We note that only delta-tocopherol meets the stricter multiple testing criterion of q < 0.1 for false discovery rate.
A minor drawback of the random effects model is that it can yield inaccurate estimates of confidence intervals and p-values for compounds which have significant missing data points, e.g. >50% nulls (see Supplemental Table 4). These compounds typically are represented by very small peaks, and can fall below the detectable limit in many of the lines tested. For instance, compounds such as N-carbamoylaspartate, in which all values were null for the GM line (as well as for 14 other lines), the low variance in the imputed data for the GM group would tend to underestimate p-values. However, because none of these approached real significance in the overall analysis, we feel this methodological shortcoming is not problematic for the question posed here.
To avoid the issues related to multiple testing, we fitted a Principal Component Analysis (PCA) to the log-transformed data for the 48 lines, then the values for GM and “Jack” were predicted from this PCA. For simplicity, only the first component was used, but the results could be generalized to higher dimensions. The distributions of the predicted values for the first component are shown in the box plot diagrams in Figure 4. Next, the 95% prediction interval was computed for the mean of a new line (8 seeds), the result being (−9.17, 9.17). Both “Jack” (1.44) and the GM line (0.35) fell well within the prediction interval.
These analyses suggest that with the exception of the intended engineered trait, the overall metabolomic expression of the GM herbicide tolerant line could not readily be distinguished from conventional germplasm. Box plots for all biochemicals, comparing the GM line to the conventional line pool and to the parental line “Jack” and arranged by biochemical pathway group, are shown in Supplemental Figure 1.
Discussion
Genes which encode genetically modified (GM) traits in crop plants for improved agronomic properties, such as tolerance to herbicides or resistance to insect attack, must be shown not to adversely affect the performance of the crop, nor to introduce unsafe alterations in food or feed products produced from these plants. Methods used to demonstrate the safety of GM crops have improved and expanded over the approximately 18 years since their introduction, with a trend toward inclusion of more global “-omics” technologies to gain a broad picture of plant biochemistry and physiology. Furthermore, the utility of any new method requires that it bring additional understanding of any potentially negative impacts that the engineered trait may have on the overall biochemical makeup or physiological performance of the target plant. Metabolomics, which is the global analysis of small molecule metabolites, is proving a powerful and sensitive technology for revealing perturbations, whether environmental or genetic, in plant metabolic compositions.
We propose that comparing a GM line to the range of performance in a crop's native germplasm pool by metabolomic analysis could be a useful and proper standard as part of a safety and equivalence assessment program. It should be emphasized that the goal would be to uncover metabolic perturbations that should be further investigated in the context of performance and safety; of course, the absence of perturbations would not be sufficient to guarantee safety, but would only be part of the wider evaluation, including safety and environmental impacts of agronomic practices associated with the trait's application, such as herbicide safety, resistance, etc. We have now applied metabolomics to the question of equivalence assessment in the context of a genetically engineered crop plant, specifically to the effects of a herbicide tolerance gene in soybean. The metabolomic dataset here encompassed all the major pathways, including 44 compounds of the amino acid class, 42 carbohydrates, 24 lipids, 13 compounds in the cofactor/electron carrier class, 18 nucleotide derivatives, 14 peptides, and 14 compounds from secondary metabolism. The wide distribution of the metabolites measured, and the monitoring of multiple metabolites in key pathways such as glycolysis, the TCA cycle, nitrogen utilization, amino acid synthesis and catabolism, lipid oxidation, anti-oxidant utilization, and secondary metabolite production allowed many points of observation for any potential negative effects on metabolism caused by a transgene. It is interesting to consider the overall line-to-line metabolomic variation in commercial germplasm, and how a significant pleiotropic perturbation might express itself. More than half of the compounds showed <10-fold variation between the highest and lowest expressing lines (Supplemental Table 2), while some compounds showed a wide range of genotypic variation. As shown in Figure 5, energy metabolites, free fatty acids, and most amino acid metabolites were least variable, as might be expected in a metabolically quiescent tissue like seed, and negative effects on these pathways should be relatively easy to detect in a metabolomic screen. Compounds reflecting nutrient storage, environmental stress, and secondary metabolism were the most variable. For instance, citrulline, asparagine, and allantoin are all involved in nitrogen utilization and storage, while phytate is the storage sink for phosphate. These may vary strongly either from genetic factors, or because of nutrient availability during development. The compatible solutes ribitol, galactinol, and ectoine, as well as gulano-1,4-lactone (a precursor of ascorbate), are generated during seed desiccation, and may reflect environmental variation during seed maturation and desiccation. Accumulation of isoflavonoids in seeds (e.g. genistin, diadzin, glycitin) are also known to be subject to both genetic and environmental control16,17,18. Good pathway coverage, and a knowledge of expected natural metabolomic diversity demonstrates the usefulness of the method for monitoring metabolic consequences of an engineered trait, and the present case provides strong evidence that the seed's physiology was not disturbed in a significant manner by the presence of the transgene. As more complex traits are developed to impact more general crop performance, such as yield, drought tolerance, nitrogen utilization, etc., and which will likely involve engineered regulatory genes, it will be increasingly important to understand a trait's impacts on a broad range of biochemical pathways. Metabolomic analysis is ideally suited for such a task.
Methods
Materials
Seeds from 49 conventional soybean lines (Supplemental Table 1) were obtained from the USDA National Soybean Research Center at University of Illinois and Syngenta Biotechnology, Inc. Seed for the herbicide-tolerant GM line (event SYHT06W) was obtained from Syngenta. Eight individual mature seeds for each soybean line were subjected to metabolomic analysis.
Metabolomic profiling
The metabolomic platforms consisted of three independent platforms: ultrahigh performance liquid chromatography/tandem mass spectrometry (UHLC/MS/MS2) optimized for basic species, UHLC/MS/MS2 optimized for acidic species, and gas chromatography/mass spectrometry (GC/MS). The detailed descriptions of these platforms, including instrument, data acquisition and processing, and compound identification and quantitation, were published previously19,20. Essentially, the samples were extracted and split into three equal aliquots for analysis on the three platforms. For the two LC platforms, chromatographic separation followed by full scan mass spectra was carried out to record retention time, molecular weight (m/z) and MS/MS2 of all detectable ions presented in the samples. For the GC platform, the samples were derivatized using bistrimethyl-silyl-triflouroacetamide (BSTFA). The retention time and molecular weight (m/z) for all detectable ions were measured. The metabolites were identified by comparison of the ion features in the experimental samples to a reference library of chemical standard entries that included retention time, molecular weight (m/z), preferred adducts, and in-source fragments as well as their associated MS/MS2 spectra.
Data imputation and statistical analysis
Integrated peak ion counts were used to compare relative levels of a compound in each sample. For statistical analysis, the missing values for a given metabolite were imputed with the observed minimum detected value based on the assumption that they were below the limits of instrument detection sensitivity. Supplemental Table 4 shows the frequency of present/missing data; values represent the percentage of the eight samples for each line in which the compound was detected. Statistical analysis of the data was performed using “R” (http://cran.r-project.org/). For graphical display in heat maps data for each biochemical were scaled to the median observed value for that compound, then missing values were imputed as described above. For box plot displays the data were further transformed to log2 values. Multiple comparisons were accounted for using the q-value method of Storey and Tibshirani21 (Supplemental Table 3).
Change history
27 January 2014
A correction has been published and is appended to both the HTML and PDF versions of this paper. The error has not been fixed in the paper.
References
Duke, S. O. & Cerdeira, A. L. Transgenic crops for herbicide resistance., Vol. 2. (Springer: Heidelberg, Germany, 2010).
Ronald, P. Plant Genetics, Sustainable Agriculture and Global Food Security. Genetics 188, 11–20 (2011).
Carpenter, J. E. Peer-reviewed surveys indicate positive impact of commercialized GM crops. Nat Biotech 28, 319–321 (2010).
Cerdeira, A. L., Gazziero, D. L. P., Duke, S. O. & Matallo, M. B. Agricultural Impacts of Glyphosate-Resistant Soybean Cultivation in South America†. Journal of Agricultural and Food Chemistry (2011).
Baker, J. M. et al. A metabolomic study of substantial equivalence of field-grown genetically modified wheat. Plant Biotechnol J 4, 381–392 (2006).
Harrigan, G. G. et al. Natural variation in crop composition and the impact of transgenesis. Nature Biotechnology 28, 402–404 (2010).
Cheng, K. et al. Effect of Transgenes on Global Gene Expression in Soybean Is within the Natural Range of Variation of Conventional Cultivars. Journal of agricultural and food chemistry. 56, 3057–3067 (2008).
McCann, M. C., Liu, K., Trujillo, W. A. & Dobert, R. C. Glyphosate-tolerant soybeans remain compositionally equivalent to conventional soybeans (Glycine max L.) during three years of field testing. J Agric Food Chem 53, 5331–5335 (2005).
Novak, W. K. & Haslberger, A. G. Substantial equivalence of antinutrients and inherent plant toxins in genetically modified novel foods. Food Chem Toxicol 38, 473–483 (2000).
Aumaitre, A., Aulrich, K., Chesson, A., Flachowsky, G. & Piva, G. New feeds from genetically modified plants: substantial equivalence, nutritional equivalence, digestibility, and safety for animals and the food chain. Livestock production science 74, 223–238 (2002).
Aulrich, K., Böhme, H., Daenicke, R., Halle, I. & Flachowsky, G. Genetically modified feeds in animal nutrition 1st communication: Bacillus thuringiensis (Bt) corn in Poultry, pig and ruminant nutrition. Archives of Animal Nutrition 54, 183–195 (2001).
Kuiper, H. A., Kleter, G. A., Noteborn, H. P. J. M. & Kok, E. J. Substantial equivalence--an appropriate paradigm for the safety assessment of genetically modified foods? Toxicology 181, 427–431 (2002).
Corpillo, D. et al. Proteomics as a tool to improve investigation of substantial equivalence in genetically modified organisms: The case of a virus-resistant tomato. Proteomics 4, 193–200 (2004).
Manetti, C. et al. NMR-based metabonomic study of transgenic maize. Phytochemistry 65, 3187–3198 (2004).
Mitchell, G. et al. Mesotrione: a new selective herbicide for use in maize. Pest Management Science 57, 120–128 (2001).
Graham, T. L. Flavonoid and isoflavonoid distribution in developing soybean seedling tissues and in seed and root exudates. Plant physiology 95, 594 (1991).
Lozovaya, V. V. et al. Effect of temperature and soil moisture status during seed development on soybean seed isoflavone concentration and composition. Crop science 45, 1934–1940 (2005).
Tsukamoto, C. et al. Factors affecting isoflavone content in soybean seeds: changes in isoflavones, saponins, and composition of fatty acids at different temperatures during seed development. Journal of Agricultural and Food Chemistry 43, 1184–1192 (1995).
Ohta, T. et al. Untargeted metabolomic profiling as an evaluative tool of fenofibrate-induced toxicology in Fischer 344 male rats. Toxicologic Pathology 37, 521–535 (2009).
Evans, A. M., DeHaven, C. D., Barrett, T., Mitchell, M. & Milgram, E. Integrated, nontargeted ultrahigh performance liquid chromatography/electrospray ionization tandem mass spectrometry platform for the identification and relative quantification of the small-molecule complement of biological systems. Anal Chem 81, 6656–6667 (2009).
Storey, J. D. & Tibshirani, R. Statistical significance for genomewide studies. Proceedings of the National Academy of Sciences of the United States of America 100, 9440–9445 (2003).
Author information
Authors and Affiliations
Contributions
D.W., J.C. and L.G. proposed the original concept and study design. D.W., D.A., J.R. and L.G. performed data analysis and interpretation. M.M. and J.W. did the statistical analysis. All authors contributed to the writing of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Information
supplemental material (PDF 1985 kb)
Rights and permissions
This work is licensed under a Creative Commons Attribution 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/3.0/
About this article
Cite this article
Clarke, J., Alexander, D., Ward, D. et al. Assessment of Genetically Modified Soybean in Relation to Natural Variation in the Soybean Seed Metabolome. Sci Rep 3, 3082 (2013). https://doi.org/10.1038/srep03082
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep03082
This article is cited by
-
Metabolome of flue-cured tobacco is significantly affected by the presence of leaf stem
BMC Plant Biology (2023)
-
Utilization of GC–MS untargeted metabolomics to assess the delayed response of glufosinate treatment of transgenic herbicide resistant (HR) buffalo grasses (Stenotaphrum secundatum L.)
Metabolomics (2020)
-
Recent developments in metabolomics-based research in understanding transgenic grass metabolism
Metabolomics (2019)
-
Mechanism study of cyfluthrin biodegradation by Photobacterium ganghwense with comparative metabolomics
Applied Microbiology and Biotechnology (2019)
-
Metabolic dynamics and physiological adaptation of Panax ginseng during development
Plant Cell Reports (2018)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.