Feature-specific nutrient management of onion (Allium cepa) using machine learning and compositional methods

Hahn, Leandro; Kurtz, Claudinei; de Paula, Betania Vahl; Feltrim, Anderson Luiz; Higashikawa, Fábio Satoshi; Moreira, Camila; Rozane, Danilo Eduardo; Brunetto, Gustavo; Parent, Léon-Étienne

doi:10.1038/s41598-024-55647-9

Download PDF

Article
Open access
Published: 12 March 2024

Feature-specific nutrient management of onion (Allium cepa) using machine learning and compositional methods

Leandro Hahn¹,
Claudinei Kurtz²,
Betania Vahl de Paula³,
Anderson Luiz Feltrim¹,
Fábio Satoshi Higashikawa²,
Camila Moreira⁴,
Danilo Eduardo Rozane⁵,
Gustavo Brunetto³ &
…
Léon-Étienne Parent^3,6

Scientific Reports volume 14, Article number: 6034 (2024) Cite this article

388 Accesses
Metrics details

Subjects

Abstract

While onion cultivars, irrigation and soil and crop management have been given much attention in Brazil to boost onion yields, nutrient management at field scale is still challenging due to large dosage uncertainty. Our objective was to develop an accurate feature-based fertilization model for onion crops. We assembled climatic, edaphic, and managerial features as well as tissue tests into a database of 1182 observations from multi-environment fertilizer trials conducted during 13 years in southern Brazil. The complexity of onion cropping systems was captured by machine learning (ML) methods. The RReliefF ranking algorithm showed that the split-N dosage and soil tests for micronutrients and S were the most relevant features to predict bulb yield. The decision-tree random forest and extreme gradient boosting models were accurate to predict bulb yield from the relevant predictors (R² > 90%). As shown by the gain ratio, foliar nutrient standards for nutritionally balanced and high-yielding specimens producing > 50 Mg bulb ha⁻¹ set apart by the ML classification models differed among cultivars. Cultivar × environment interactions support documenting local nutrient diagnosis. The split-N dosage was the most relevant controllable feature to run future universality tests set to assess models’ ability to generalize to growers’ fields.

Plant responses to changing rainfall frequency and intensity

Article 09 April 2024

Hotspots of biogeochemical activity linked to aridity and plant traits across global drylands

Article 12 April 2024

Closing the gap between climate regulation and food security with nano iron oxides

Article 16 April 2024

Introduction

Onion (Allium cepa L.) is the 4th economically most important vegetable crop grown worldwide¹. In Brazil, onion ranks 3rd behind potato and tomato in production volume and economic value. Because onions require long days to initiate and speed up bulb swelling and reduce the maturation period^1,2, cultivars were adapted to site conditions to reach high bulb yield and quality³. Rainfall also impacts onion yields⁴. High temperatures accelerate, while low temperatures delay, bulb formation². The Brazilian national yield average of 26 Mg bulb ha⁻¹⁴ remains far below expectations.

The law of the optimum states that production factors are used most efficiently if all combined at their optimum levels⁵, a challenging goal for growers. Alternatively, growers attempt to adjust their practices through comparisons with successful cases and relying on state recommendations from soil and tissue test results⁶. However, climatic conditions, fertilization, soil quality, irrigation, soil management and crop rotation systems are key factors of success that vary widely among agroecosystems. Uncertainty in optimum nutrient dosage often leads growers to apply ‘insurance’ fertilization against the risk of yield loss⁷. Excessive fertilization leads not only to economic loss but also to increased incidence of diseases^8,9,10,11, product loss during storage, and environmental damage such as nitrate leaching and N₂O emissions¹² and surface water eutrophication by phosphates¹³. Field trials conducted under the ceteris paribus assumption form the backbone of sound fertilizer dosage. Such assumption no more holds at the step of assembling multi-environmental field trials due to highly variable site-specific features. Nevertheless, well-documented trials can be assembled into large databases and decrypted using powerful tools of artificial intelligence to support wise decisions on site-specific fertilization.

The traditional objective of conducting fertilizer trials is to define critical and maintenance soil test levels to “feed the plant” (sufficiency levels of available nutrients) or to “feed the soil” (basic cation sufficiency ratios; nutrient buildup and maintenance)^14,15,16. In Brazil, the concept to “feed the plant” for N fertilization involved the contribution of soil organic matter content to the nitrogen budget of the agroecosystem¹⁷. The concept to “feed the soil” for P and K fertilization relies, respectively, on clay content and soil test P, and on cation exchange capacity (CEC) and soil test K. The clay content is assumed to be related to the soil P fixing capacity controlling P-use efficiency¹⁸. The CEC implied that soil test K should be maintained at a ‘High’ soil test level despite high risks of K leaching. The CEC can be computed from exchangeable cations and exchangeable acidity.

While yield-impacting features interact in agroecosystems, testing myriads of interactions between fertilizer management and environmental and managerial features would be a gigantic task. Machine learning (ML) decision trees such as random forest and extreme gradient boosting are commonly used non-parametric data-processing methods that can address multivariate interacting effects in high-dimensional databases^19,20,21. On the other hand, the classical tissue test interpretation has long been criticized for not considering nutrient interactions²². This is especially important for onions, a high S-demanding crop, where cross-talks between sulfur and cationic micronutrients as modulated by mycorrhizae²³ are common²⁴. Nutrient interactions and cross-talks are generally represented by pairwise ratios^25,26. The centered log ratio ($clr$) transformation is a multi-ratio that expands pairwise ratio by adjusting any nutrient level to the geometric mean across nutrients. The log-ratio transformation can control numerical biases caused by spurious correlations in the statistical analysis of compositional data²⁷. The clr transformation thus allowed to compute means and variances unbiasedly. Nonetheless, decision-tree machine learning methods could handle nutrient interactions in onion tissue with no need for data transformation.

Fertilizer recommendations, above all nitrogen, have been puzzling for decades without agreement on which methodology is the best to balance environmental and economic outcomes²⁴. We hypothesized that (1) a minimum dataset of features easy to document by stakeholders suffice to predict onion yield accurately using machine learning methods, and (2) tissue nutrient standards depend on cultivar × environment interactions. Our objective was to evaluate the capacity of machine learning models to predict onion yields and to derive tissue nutrient standards for onions under Brazilian conditions.

Results

Model performance to predict bulb yields

Features to model bulb yields were cultivar, soil management, cropping system, previous crop, fertilization (N-P-K), soil test results (clay content, CEC, organic matter, pH, nutrients) and climatic variables (length of the growing season, rainfall, SDI, cumulated degree-days, date of crop establishment), as described in Table 6. As shown by the RReliefF scores (Fig. 1), the split-N dosage as well as soil micronutrient and S tests were the most relevant features in relation to bulb yield. Other features were weaker predictors. Soil test Fe reflects the presence of weatherable minerals (Fe) in Cambisols. Fertilizer S, B, Zn and Mn are applied at planting or sowing or as foliar sprays, hence accumulating in the soil. The Cu, Zn, and Mn applied as fungicides contribute to their accumulation in the soil.

Learners were similarly accurate (R² > 0.90) to predict marketable bulb yields using either all features or a minimum data set of the most relevant features (Table 1 and Fig. 2). Non-climatic features readily available to stakeholders at the beginning of the growing season apparently sufficed to make accurate predictions of marketable yields and draw nutrient response models. The P and K dosage showed little contribution to bulb yield prediction. The N dosage was the most relevant controllable feature.

Table 1 Accuracy of machine learning models to predict bulb yields.

Full size table

Tissue nutrient standards

Features used to run ML classification models and compute tissue nutrient standards comprised cultivars and tissue tests. Random forest, and extreme gradient boosting returned values for area under curve (AUC) and classification accuracy (CA) at 50 Mg ha⁻¹ yield cutoff (Table 2). The AUC and CA (> 90%) were high whether raw concentrations or centered log ratios were used as features, indicating that the ML models handled nutrient interactions efficiently.

Table 2 Area under curve (AUC) and classification accuracy (CA) for machine learning models at yield cutoff of 50 Mg ha⁻¹ using raw concentrations or clr values as features.

Full size table

The gain ratio showed that cultivars impacted yield more than nutrient compositions (Fig. 3). As shown by gain ratios, sulfur, phosphorus, and micronutrients impacted the ionomes of cultivars, indicating genetics × environment interactions. Indeed, several ranges of centered log ratios ($clr$) used to compute nutrient standards did not overlap among cultivars (Table 3). The back-transformed $clr$ means at high yield levels indicated differences among cultivars, especially for P, S and micronutrients. Lower and upper quartiles of nutrient concentrations among true negative specimens are presented by cultivar in Table 4. The corresponding median values of soil properties is presented in Table 5. Obviously, tissue compositions can be impacted not only by the genetic background of cultivars but also by differential soil properties.

Table 3 Comparison between tissue compositions of nutritionally balanced cultivars at high yield level (> 50 ton ha⁻¹).

Full size table

Table 4 Lower quartile (LQ) and higher quartile (HQ) of nutrient concentrations for nutritionally balanced onion cultivars producing more than 50 ton ha⁻¹.

Full size table

Table 5 Soil test median values at experimental sites for four cultivars at high yield level (> 50 Mg ha⁻¹).

Full size table

Discussion

Much efforts have been deployed by research groups in southern Brazil to reach growers’ application scale by accounting for soil test, organic matter content, clay content and cation exchange capacity¹⁷. In the present research, we also considered cultivar, soil and crop management, climatic indices, and tissue tests. Machine learning models using features readily available to the stakeholders were found to be accurate.

Nitrogen recommendations

The nitrogen demand by onions was found to depend on bulb yield, cultivar, tissue nutrient levels, soil properties and fertilizer timing and placement, and thus needed to be calibrated locally¹⁰. Although OMC did not appear as relevant feature in relation to bulb yield as shown by its low RRelieffF score, OMC may impact N fertilizer recommendations. The N fertilization of onions in southern Brazil was adjusted to local conditions by accounting for organic matter content (OMC) (120, 100 and $\le$ 80 kg N ha⁻¹ for OMC of 2.5%, 2.5–5% and > 5%, respectively) and at a rate of 4 kg N ton⁻¹ for yield expectations exceeding 30 Mg ha⁻¹¹⁷. Because OMC was included as feature in the ML model, OMC may impact the response models in future universality tests. While optimum N fertilization may vary locally from 157 to more than 200 kg N ha⁻¹^28,29,30, the N dosage must minimize yield loss²⁸. In Cambisols of Santa Catarina, the best economic yield was reached applying 249 kg N ha⁻¹ in a sandy soil of low organic matter content, and 116–142 kg N ha⁻¹ in clayey soils of medium organic matter content³¹.

Boyhan et al.³² reported that N recommendations for onions at maximum yield in Georgia, USA, were 95–123 kg N ha⁻¹ higher than the recommended N rates of 140–168 kg N ha⁻¹. In contrast, maximum bulb yield of 52 Mg ha⁻¹ on a Thermic Plinthic Paleudult was reached applying 263 kg N ha⁻¹, as suggested by a quadratic model. However, yield differences were not significant applying 263 kg N ha⁻¹ or 140 to 168 kg N ha⁻¹, indicating random variation of onion yields on the plateau and high risk of overfertilization using the quadratic model. Initiating the model close to the observed optimum rate near the yield plateau can avoid that problem of overestimation. Quadratic response models initiated at zero-N depends on the flatness of the slope and may lead to over-fertilization supporting speculative ‘insurance’ decisions³³. Controlling the trajectory of the quadratic model using an economic constraint alone, the recommended N rate for ‘Optima F1’ in Minas Gerais state, Brazil, was found to be 148 kg N ha⁻¹³⁴.

Although the N dosage can vary widely under different growing conditions the number of N trials was limited (25) in the present study compared the 93 and 461 multi-environmental N fertilizer trials to run ML models on potato (Solanum tuberosum)³⁵ and maize (Zea mays)³⁶, respectively. More trials and universality tests should be conducted to validate model outcomes in growers’ fields.

Phosphorus and potassium recommendations

The P and K features did not appear to relevant enough to run the ML models. Irrigation and features that improves P and K diffusion in the soil increase nutrient use efficiency in tropical soils³⁷. Nevertheless, the number of trials was small for P (5) and K (3) compared to N (25). As a result, more P and K trials should be conducted to support any change in state recommendations¹⁷. State-based recommendations integrate information from available field trials, local knowledge, and agronomic expertise.

The P dosage is generally high in tropical soils due to high soil P-fixing capacity and the limited root system of onions³⁸. The clay content is representative of P fixing capacity and is integrated into the Brazilian P recommendation scheme¹⁷. The P_Mehlich-1/clay ratio (Mehlich-1 extraction method) could also be used as soil test similar to the [P/(Al + Fe ratio)]_Mehlich-3 (Mehlich-3 extraction method) currently used in North America^{13,39,40,41,42}. In a low-P Humic Dystrophic Cambisol (6.9 mg P-Mehlich1 dm⁻³ and 24% clay), onion responded linearly to P fertilization in the range of 0 to 210 kg P ha⁻¹ at yield levels up to 45 ton ha⁻¹^28,38. In a medium-P dystrophic red-yellow Latosol (9.1 mg P-Mehlich1 dm⁻³ and 26% clay), onion responded non-linearly to added P up to ≈131 kg P ha⁻¹ at yield levels of 36–40 ton ha⁻¹⁴³. In a low-P dystrophic red-yellow Latosol (23.8 mg P-Mehlich1 dm⁻³), onion responded non-linearly to added P in the range of 27 to 80 kg P ha⁻¹ at yield levels of 75–76 Mg ha⁻¹⁴⁴. Those results may fit state recommendations²¹ if the yield level is considered. The split of P fertilization may improve P-use efficiency, especially in high P-fixing soils⁴⁵. On the other hand, onion P uptake is facilitated by the positive effect of irrigation on the P diffusion process⁴⁶. The P dosage using the efficiency coefficient of fertilizer P alone¹⁸ and disregarding water supply that facilitates P diffusion in the soil could thus lead to overfertilization³⁷. Moreover, colonization of onion roots by arbuscular mycorrhiza fungi (AMF) can regulate the P uptake by exploring a larger volume of soil⁴⁷.

The K dosage is most often prescribed to ‘feed the soil’ depending on the selected maintenance soil test K level and the CEC. In a soil containing 77 mg K-Mehlich1 dm⁻³ and showing CEC of 7 cmol_c dm⁻³, onion crops responded non-linearly to added K up to 75 kg K ha⁻¹ at yield level of 66 Mg ha⁻¹⁴³. In a high-K Red-Yellow Argisol showing 97–109 mg K-Mehlich1 dm⁻³ and CEC of 7 cmol_c dm⁻³, onion responded non-linearly to added K up to 150 kg K ha⁻¹ to reach yield levels of 46–54 Mg ha⁻¹⁴⁸. Those results may fit state recommendations²¹ if the yield level is considered. In a work carried out in Santa Catarina state with cultivar Empasc 352 Bola Precoce, 86.5 kg K ha⁻¹ was taken up by the onion crop at yield level of 37 Mg ha⁻¹, accumulating 2.3 kg of K per Mg⁴⁷. While soil K supply capacity also depends on soil mineralogy⁴⁹, the K release from minerals that contributes to plant K uptake requires conducting fertilizer trials⁵⁰. Large discrepancies may thus occur among K recommendation systems.

Tissue diagnosis

In the present study, we suggested ranges of tissue nutrient levels as nutrient standards to conduct nutrient-by-nutrient diagnosis. S-É Parent⁵¹ suggested using a concept of reachable hyper-islands or ‘hyper-blobs’ each representing multivariate combinations of successful conditions compared to those of defective specimens. Using KNN as machine learning model, compositional proximity was shown as an Euclidean distance between the composition of the diagnosed specimen and that of its successful neighbors⁵². Benchmark blobs were also called ‘Enchanting Islands’⁵³, ‘Humboldtian loci’⁵⁴, and ‘Ilhas Encantadas’ in Portuguese⁵⁵. This emphasizes the need to diagnose tissue nutrient compositions holistically rather than separately^56,57.

Need for large and diversified databases

Large and diversified experimental and observational data sets must be acquired by stakeholders to cross-over the numerous combinations of crop-impacting features in onion agroecosystems^57,58,59,60. Kyveryga et al.³³ stated that the development of new nutrient calibration procedures has been limited by the inability in the past to collect a sufficient number of yield responses to enable calculating reliable economic optimum rates. To follow-up on model predictions, universality tests are needed to verify the reliability of model outcomes in growers’ fields^36,61. The prediction of N dosage can be conducted as shown in S4 by providing the site-specific feature and drawing a response curve predicted from those features. Such tests require close collaboration with growers to facilitate the acceptance of a site-specific fertilizer program and update the database.

Precision farming technologies could allow collecting trustful data at low cost in growers’ fields. Efforts to develop technological tools of precision agriculture for site-specific fertilization have been limited by non-specific state-based fertilizer recommendations. For some high-valued crops like maize, the nitrogen dosage can be adjusted to local factors using ML methods³⁶. Observational and experimental data sets could be further combined and processed by machine learning to customize nutrient management for a given set of controllable and uncontrollable features⁶². In this paper, accurate ML learners processed a minimum data set to support wise decisions for the feature-specific N fertilization in onion agroecosystems of southern Brazil.

Conclusions

This paper addressed onion nutrient management at local scale. We assembled the results of fertilizer experiments conducted between 2007 and 2020 in Santa Catarina state, the major onion production region in Brazil. We showed that decision-tree machine learning models can return accurate yield predictions under a set of easy-to-collect features. Key features available to growers before planting or seeding included cultivar, soil management, cropping system, previous crop, fertilization (N-P-K), soil test results (clay content, CEC, organic matter, pH, nutrients) and date of crop establishment. The RReliefF scores revealed that split-N dosage as well as soil test S and micronutrients were the most relevant features to predict onion yield. The accuracy of the regression models reached R² > 90% using random forest and extreme gradient boosting. The N dosage was the most relevant controllable feature to run universality tests in growers’ fields to assess the ability of ML model to generalize.

The accuracy of the classification models also reached R² > 90% using random forest and extreme gradient boosting. The cultivar and tissue nutrients impacted bulb yield, allowing to develop cultivar-specific nutrient standards. Sulfur and micronutrients were the most relevant features to differentiate onion cultivars, indicating cultivar × environment interactions. It is thus advisable to conduct tissue diagnosis considering agroecosystem-specific nutrient standards to reflect cultivar × environment interactions. To set apart genetics and environment, feature-specific cultivar ionomes should be determined in comparable agroecosystems. However, such agroecosystem nutrient standards would require larger and more diversified databases than the one used in this study.

Material and methods

Experimental setup

Fertilizer trials were conducted from 2007 to 2020 in the municipalities of Ituporanga, Atalanta, Lebon Régis and Caçador, Santa Catarina state, Brazil (Fig. 4). The soils of the region are Cambisols, also classified as Nitossolo Bruno Distrophic⁶³, and Typic Hapludox⁶⁴. The subtropical climate is mesothermic and humid with mild summers.According to Köppen’s classification, the climate is classified as Cfa in Ituporanga and Atalanta, and as Cfb in Lebon Régis and Caçador.

Climatic data

Daily precipitations as well as minimum and maximum daily temperatures were obtained from the EPAGRI⁶⁶ meteorological station closest to the trial. Temperature indices were the minimum and maximum seasonal temperatures and the cumulated degree-days with base temperature of 5 °C for cold crops⁶⁴. Rainfall distribution was estimated by the standardized Shannon diversity index (SDI) as follows⁶⁵:

$$SDI=\frac{-{\sum }_{i=1}^{n}{p}_{i}\times ln\left({p}_{i}\right)}{ln\left(n\right)}$$

where ${p}_{i}$ is the fraction of daily rainfall (RAIN) to the rainfall cumulated during the growing period (PPT), i.e. the daily RAIN/PPT ratio, and $n$ is the length of the growing season; SDI = 1 implied that rainfall was uniformly distributed during the indicated period (equal daily amount of rainfall over the selected period); SDI = 0 implied that rainfall was unevenly distributed (total rainfall concentrated in 1 d). Where ${p}_{i}=0$, ${p}_{i}\times ln\left({p}_{i}\right)=0$. Crops were sprinkler irrigated.

Experimental setup

There were 26 N trials, five K trials and three P trials, totaling 1182 observations (Supplementary Material S4). Treatments were arranged as randomized block designs with four replications. In Ituporanga and Atalanta, plots were 4 m long and 3 m wide, and comprised eight rows spaced 35 cm apart. Transplants were spaced 8 cm apart on the row. The population of transplants was approximately 375,000 plants ha⁻¹. Bulbs were harvested in five internal rows 4-m long. In Caçador and Lebon Régis, plots were 5 m long and 2.7 m wide, and comprised nine rows spaced 30 cm apart. Plants were spaced 5.5 cm on the row. The population of seeded onions was 600 000 plants ha⁻¹. Bulbs were harvested at leaf sagging in three double line, 5-m long rows, per plot. The bulbs were left on the field for a pre-curing period of one week, then bagged and stored for weighing and sizing. Bulbs were classified as commercial, non-commercial and harvest loss. Marketable bulbs included #2 (< 50 mm), #3 (50–70 mm), #4 (70–90 mm), and #5 (> 90 mm) bulb categories⁶⁷. Bulbs showing secondary growth or damage were classified as non-marketable.

Fertilizer treatments

The N, P and K treatments were applied separately at increasing rates at each experimental site. The N rates varied from 0 to 370 kg N ha⁻¹ split-applied 45, 80, 110, and 130 days after seeding, 20, 30, 30 and 20% of N broadcast-applied, respectively, or 35, 60, and 85 days after transplanting, 30, 40 and 30% of N broadcast-applied, respectively. The P rates ranged from 0 to 349 kg P ha⁻¹. The K rates varied between 0 and 667 kg K ha⁻¹, split-applied together with the N. Where the rates of N, P and K were varied, the rates of the other nutrients were fixed following state recommendations¹⁷. Fertilizers were in granular form.

The sources of N were ammonium nitrate, urea, ammonium sulfate, algae-coated ammonium sulfate (29% N, 5% Ca, 2% Mg, 9% S, and 0.3% B), azoslow (organo-mineral fertilizer containing 20% C and 29% N as urea and hydrolyzed proteins) or poultry manure (pH of 7.8, 15.9% moisture, 3.5% N, 3.1% P, 2.7% K, 37 mg Cu kg⁻¹, 43 mg Zn kg⁻¹, 73 mg Mn kg⁻¹, and 1160 mg Fe kg⁻¹). The source of N fertilizer may differ among trials. However, we assumed that differences among mineral N sources were negligible due to the rapid conversion of ammonium to nitrate in agricultural soils⁶⁸. The P and K treatments were applied as triple superphosphate and potassium chloride¹⁷. The N and K were split at up to four occasions during the season, i.e., at planting and 35, 60, 85 or 90 d later for transplants, or at planting and 45, 80, 110 or 130 d later for seeded onions¹⁷. The P was applied entirely at planting.

Soil analysis

Soils were sampled in the 0–20 cm layer 45–60 days before planting across the experimental area, then composited. Soils were dried in a forced-air oven at 65 °C then ground to less than 2 mm. Chemical analyses were conducted as follows¹⁷: pH in 1:2.5 soil-to-water volumetric ratio, clay by sedimentation, Mehlich-1 extraction for P and K, and EDTA-extraction for cationic micronutrients. Elements were quantified by colorimetry for P and B, flame photometry for K, turbidimetry for S, and atomic absorption spectrophotometry for Ca, Mg, Cu, Fe, Mn, and Zn. Total carbon was quantified by dichromate oxidation (Walkley–Black procedure) then multiplied by 1.724 to derive organic matter content. Base saturation was computed as the sum of cationic species (cmol_c kg⁻¹) divided by CEC computed as the sum of exchangeable cations and acidity. Exchangeable acidity was assessed as follows⁶⁹:

$$\left(Exchangeable\; acidity\right)=10exp\left(7.76+1.053{\times pH}_{SMP}\right),\;\; {\text{R}}^2 = 0.98$$

Tissue analysis

After planting, leaf analysis, based on appropriate sampling methods and correct interpretation of analytical data, is a reliable tool for assessing the nutritional status of perennial plants and their response to fertilizers⁶⁹. Ten young fully expanded leaves were collected in each plot at the beginning of plant differentiation into bulb¹⁷, i.e. 70 to 75 d after transplanting and 115 to 128 d after sowing, depending on year and cultivar. Tissue samples were composited per plot for chemical analysis. The leaves were cleaned gently under distilled water then dried at 65 ± 5 °C and ground to less than 1 mm. Total N was quantified by micro-Kjeldahl. Tissue samples were digested in a mixture of nitric and perchloric acids then analyzed by colorimetry for P and B, flame photometry for K, turbidimetry for S, and atomic absorption spectrophotometry for Ca, Mg, Cu, Fe, Mn, and Zn^70,71.

Statistical analysis

Log ratio transformation

Concentrations are parts of a compositional vector constrained to the compositional space⁶⁸ such as 1000 g kg⁻¹ for tissue tests. The compositional space for cationic species could also be defined as cmol_c kg⁻¹ and constrained to CEC. Conducting parametric statistical analyses using raw concentrations produces numerical biases that may lead to sums of components in statistical results that differ from measurement unit (e.g., sums of sand + silt + clay different than 100% after conducting ANOVA). Moreover, ignoring nutrient interactions may decrease the accuracy of nutrient diagnosis using parametric methods^37,55.

In contrast, $clr$ values are relative expressions allowing compositions to move from the constrained compositional space to the unconstrained real space ($\pm \infty$) that is required to run statistical analyses. Nutrient concentrations are constrained to the measurement unit using a filling value ${F}_{v}$ computed by difference as follows using a measurement unit in g kg⁻¹:

$${F}_{v}=1000-\sum \limits_{i=1}^{D}{c}_{i}$$

where D is the number of parts including the filling value, and ${c}_{i}$ is the concentration of each nutrient and the filling value. The centered log ratio centers any concentration against the geometric mean across parts [$clr=ln\left({x}_{i}/G\right)$], hence accounting for all pairwise ratios that reflect nutrient interactions and cross-talks^24,69, as follows for nitrogen (N):

$$\begin{aligned} {clr}_{N}&=ln\left(\frac{N}{G}\right)=ln\left(\frac{N}{{\left(N\times P\times K\times Ca\times Mg\times S\times B\times Cu\times Zn\times Mn\times Fe\times {F}_{v}\right)}^{1/D}}\right) \\ & =\frac{1}{D}\left[ln\left(\frac{N}{N}\right)+ln\left(\frac{N}{P}\right)+ln\left(\frac{N}{K}\right)+ln\left(\frac{N}{Ca}\right)+ln\left(\frac{N}{Mg}\right)+ln\left(\frac{N}{S}\right)+ln\left(\frac{N}{B}\right)+ln\left(\frac{N}{Cu}\right)+ln\left(\frac{N}{Zn}\right)+ln\left(\frac{N}{Mn}\right)+ln\left(\frac{N}{Fe}\right)+ln\left(\frac{N}{{F}_{v}}\right)\right]\end{aligned}$$

Because the $clr$ values are computed about the geometric mean, the sum of $clr$ values is zero. The mean $clr$ value for component i can be back transformed into its concentration value ${x}_{i}$ as follows:

1.
${exp}_{{x}_{i}}=exp\left({clr}_{{x}_{i}}\right)$
2.
${x}_{i}=\frac{{\kappa \times exp}_{{x}_{i}}}{{\sum }_{i=1}^{D}{exp}_{{x}_{i}}}$

Where exp is the exponential transformation of the centered log ratio and $\kappa$ is the unit of measurement (e.g., 1000 g kg⁻¹) to force closure to the measurement unit (here, g kg⁻¹).

The clr variables have Euclidean geometry. The diagnosed composition can thus be compared to the composition of the closest successful neighbors (high-yielding and nutritionally balanced specimens) as the ones showing the shortest Euclidean distance $\varepsilon$ from the diagnosed composition computed as follows:

$$\varepsilon =\sqrt{\sum \limits_{k=1}^{D}{\left({clr}_{i}-{clr}_{i}^{*}\right)}^{2}}$$

where ${clr}_{i}$ is the $clr$ value of component $i$ of the diagnosed composition, and ${clr}_{i}^{*}$ is the clr value of component $i$ of a close successful compositional neighbor. In Brazil, clr indices are widely used to diagnose the plant nutrient status⁷² using $clr$ reference values⁷³. Tissue nutrient indices (${I}_{{x}_{i}}$) are differences between diagnosed $clr$ value (${clr}_{{x}_{i}}$) and the $clr$ mean (${clr}_{{x}_{i}}^{*}$) for true negative specimens (TN) weighted by the standard deviation (${SD}_{{x}_{i}}^{*}$), computed as follows⁷⁴:

$${I}_{{x}_{i}}=\frac{{clr}_{{x}_{i}}-{clr}_{{x}_{i}}^{*}}{{SD}_{{x}_{i}}^{*}}$$

Nutrient indices can be displayed in a histogram to indicate relative excess or shortage of nutrients, respectively. The nutrient standards for high-yielding and nutritionally balanced specimens can be computed regionally (e.g., across the surveyed area), or from a selection of close compositional neighbors.

Machine learning models

Several machine learning (ML) models can be tested using the Orange data mining freeware vs. 3.29. In the ML models, the target variable was marketable bulb yield. Features were climatic indices, nutrient dosage, soil and tissue analyses, cultivar, crop establishment (direct seeding or manual transplanting), soil management, municipality, climatic indices, date of stand establishment, and harvest date (source), as described in Table 6.

Table 6 List of candidate features in the onion data set of Santa Catarina state, Brazil.

Full size table

Summaries of tissue and soil test results used as features are presented in Table 3 and Supplementary Material S3, respectively. Other features were managerial or climatic. ‘Empasc 352 Bola Precoce’ and ‘SCS373 Valessul’ are short-day cultivars requiring 11–13 h to initiate bulbification. Median-day cultivars requiring 13–15 h to initiate bulbation were ‘’Epagri 362 Crioula Alto Vale, ‘Mulata’, ‘Omega’ and ‘Caeté’. We discarded ‘Bola Precoce’ specimens because tissue analysis for sulfur was absent. Onions were seeded or transplanted. Crops were established by direct seeding or were transplanted manually. Stand establishment, soil management and previous crops are reported in Supplementary Material S4. Previous crops were black oat (Avena sativa), millet (Pennisetum glaucum), sweet potato (Ipomoea batatas), tobacco (Nicotiana tabacum), corn (Zea mays), cowpea (Vigna unguiculata (L.) Walp.), velvet bean (Mucuna aterrima) and millet (P. glaucum). Preceding crops varied among years and locations. Climatic conditions varied widely at experimental sites as shown in Supplementary Material S5. The importance of features in relation to bulb yield was measured as RReliefF ranking scores⁷⁵. The RReliefF algorithm computes a difference between actual and predicted values in regression problems based on the nearest neighbor paradigm after considering feature interactions.

Two decision-tree ML regression models were tested among more than 100 variants commonly used in soil science^40,66, i.e., random forest and extreme gradient boosting, both available in the Orange Data Mining freeware v. 3.39.0 programmed in the Python language (University of Ljubljana, Ljubljana, Slovenia). The Python algorithms are encoded into icons and arrows. The scheme of icons and arrows is presented in Supplementary Materials S1 and S2. There were several missing data in the dataset (13%). The dataset was thus rebalanced by model-based imputation using the random forest imputation method^76,77.

Decision-tree models separate two subsets recursively about cutoff points that minimize the variance of the target variable until a minimum number of instances is reached. Random forest and extreme gradient boosting are structurally different. Random forest is a bagging model that averages predictions made by sampling with replacement. We selected 10 trees per bag at each run. Extreme gradient boosting is a variant of the tree-based ensemble gradient boosting method that combines weak predictive models to minimize prediction error. The extreme gradient boosting creates and adds trees of learners sequentially to correct the weakness of the preceding estimators. We selected 100 trees as basic property.

The partition between the training and testing datasets was conducted by stratified random sampling. The population of data comprised subgroups of categorial variables or strata. Data were randomly sampled within each strata. This avoids sampling data from the same strata during the partition between the training set and testing sets. Otherwise, complete random sampling leads to model overfitting. The train/test partitions were repeated 100 times, and model accuracy was averaged. The accuracy of the partition between the training and the testing sets reached a plateau at 70:30. Such partition was thus selected to process the data.

The regression ML model returns a relationship between the actual and the predicted starget variable. Model accuracy is reported as root mean squared error (RMSE), median absolute error (MAE), and coefficient of determination or R². Model strength is substantial if R² is > 75%⁷⁸. The classification mode returns a confusion matrix where specimens are classified into four quadrants: true negative (yield above cutoff, nutritionally balanced composition), false negative (yield below cutoff, nutritionally balanced composition), false positive (yield above cutoff, nutritionally imbalanced composition) and true positive (yield below cutoff, nutritionally imbalanced composition). True negative specimens provided a set of successful features to compute tissue nutrient standards amongst others. The accuracy of the classification model is measured by the area under curve and the classification accuracy.

Data availability

All data and the model used for analysis are available at Zenodo DOI https://doi.org/10.5281/zenodo.10615658. The experimental research and field studies on plants in this work strictly comply with relevant institutional, national and international guidelines and legislation.

References

Torquato-Tavares, A., Pascual-Reyes, I. D., Barros-Milhomens, K. K., Alves-Ferreira, T. & Rodrigues-do-Nascimento, I. Planting dates of Allium cepa L. hybrids in Gurupi, Tocantins, Brazil. Rev. Chapingo Ser. Hortic. 43, 123–133 (2017).
Article Google Scholar
Bachie, O. G., Santiago, L. S. & McGiffen, M. E. Physiological responses of onion varieties to varying photoperiod and temperature regimes. Agriculture 9, 214 (2019).
Article CAS Google Scholar
Cardoso, A. I. I. & da Costa, C. P. Selection for bulb maturity in onion. Sci. Agric. 60, 59–63 (2003).
Article Google Scholar
Souza, M. et al. Soil chemical properties and yield of onion crops grown for eight years under no-tillage system with cover crops. Soil Till. Res. 208, 104897 (2021).
Article Google Scholar
de Wit, C. T. Resource use efficiency in agriculture. Agric. Syst. 40, 125–151 (1992).
Article Google Scholar
Amare, G. Review on mineral nutrition of onion (Allium cepa L). Open Biotechnol. J. 14, 134–144 (2020).
Article CAS Google Scholar
Kyveryga, P. M., Blackmer, T. M. & Caragea, P. C. Categorical analysis of spatial variability in economic yield response of corn to nitrogen fertilization. Agron. J. 103, 796–804 (2011).
Article Google Scholar
Martinez, D. A., Loening, U. E., Graham, M. C. & Gathorne-Hardy, A. When the medicine feeds the problem; do nitrogen fertilisers and pesticides enhance the nutritional quality of crops for their pests and pathogens?. Front. Sustain. Food Syst. 5, 234 (2021).
Article Google Scholar
Díaz-Pérez, J. C., Bautista, J., Gunawan, G., Bateman, A. & Riner, C. M. Sweet onion (Allium cepa L.) as influenced by organic fertilization rate: 2. Bulb yield and quality before and after storage. HortScience 53, 459–464 (2018).
Article Google Scholar
Geisseler, D., Ortiz, R. S. & Diaz, J. Nitrogen nutrition and fertilization of onions (Allium cepa L.)—A literature review. Sci. Hortic. 291, 110591 (2022).
Article CAS Google Scholar
Kurtz, C., Ernani, P. R., Pauletti, V., de Menezes Junior, F. O. G. & Vieira Neto, J. Produtividade e conservação de cebola afetadas pela adubação nitrogenada no sistema de plantio direto. Hortic. Bras. 31, 559–567 (2013).
Article Google Scholar
Stewart, B. A. & Lal, R. The nitrogen dilemma: Food or the environment. J. Soil Water Conserv. 72, 124A-128A (2017).
Article Google Scholar
Pellerin, A. et al. Environmental Mehlich-III soil phosphorus saturation indices for Quebec acid to near neutral mineral soils varying in texture and genesis. Can. J. Soil Sci. 86, 711–723 (2006).
Article CAS Google Scholar
Nelson, L. A. & Anderson, R. L. Partitioning of soil test-crop response probability. In Soil Testing: Correlating and Interpreting the Analytical Results Vol. 1 19–38 (Wiley, 1984).
Google Scholar
McLean, E. O. Contrasting concepts in soil test interpretation: Sufficiency levels of available nutrients versus basic cation saturation ratios. In Soil Testing: Correlating and Interpreting the Analytical Results Vol. 1 39–54 (Wiley, 1984).
Google Scholar
Culman, S., Fulford, A., Camberato, J. & Steinke, K. Tri-State Fertilizer Recommendations. Bulletin 974 (College of Food, Agricultural, and Environmental Sciences, 2020).
CQFS-RS/SC. Manual de calagem e adubação para os Estados de Rio Grande do Sul e de Santa Catarina. (Sociedade Brasileira de Ciência do Solo, 2016).
dos Santos, F. C., Neves, J. C. L., Novais, R. F., Alvarez, V. V. H. & Sediyama, C. S. Modeling lime and fertilizer recommendations for soybean. Rev. Bras. Ciência do Solo 32, 1661–1674 (2008).
Article Google Scholar
Chlingaryan, A., Sukkarieh, S. & Whelan, B. Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review. Comput. Electron. Agric. 151, 61–69 (2018).
Article Google Scholar
Huynh-Thu, V. A. & Geurts, P. Unsupervised gene network inference with decision trees and random forests. In Gene Regulatory Networks; Methods in Molecular Biology (eds Sanguinetti, G. & Huynh-Thu, V.) 195–215 (Humana Press, 2019). https://doi.org/10.1007/978-1-4939-8882-2_8.
Chapter Google Scholar
Padarian, J., Minasny, B. & McBratney, A. B. Machine learning and soil sciences: A review aided by machine learning tools. Soil 6, 35–52 (2020).
Article ADS CAS Google Scholar
Bates, T. E. Factors affecting critical nutrient concentrations in plants and their evaluation: A review. Soil Sci. 112, 116–130 (1971).
Article ADS CAS Google Scholar
de Oliveira, R. A. et al. Release of phosphorus forms from cover crop residues in agroecological no-till onion production. Rev. Bras. Ciência do Solo 41, 160272 (2017).
Google Scholar
Mandrini, G., Archontoulis, S. V., Pittelkow, C. M., Mieno, T. & Martin, N. F. Simulated dataset of corn response to nitrogen over thousands of fields and multiple years in Illinois. Data Br. 40, 107753 (2022).
Article CAS Google Scholar
Kenworthy, A. L. Plant analysis and interpretation of analysis for horticultural crops. In Soil Testing and Plant Analysis (eds Hamilton, H. & Stelly, M.) 59–75 (Soil Science Society of America, 1967).
Google Scholar
Courbet, G. et al. Disentangling the complexity and diversity of crosstalk between sulfur and other mineral nutrients in cultivated plants. J. Exp. Bot. 70, 4183–4196 (2019).
Article CAS PubMed Google Scholar
Aitchison, J. The statistical analysis of compositional data. J. R. Stat. Soc. Ser. B 44, 139–160 (1982).
MathSciNet Google Scholar
de Resende, G. M. & Costa, N. D. Effects of levels of potassium and nitrogen on yields and post-harvest conservation of onions in winter. Rev. Ceres 61, 572–577 (2014).
Article Google Scholar
Kurtz, C., Pauletti, V., Fayad, J. A. & Neto, J. V. Crescimento e absorção de nutrientes pela cultivar de cebola Bola Precoce. Hortic. Bras. 34, 279–288 (2016).
Article CAS Google Scholar
Rodrigues, G. S. D. O. et al. Onion yield as a function of nitrogen dose. Rev. Ciências Agrárias 41, 46–51 (2018).
Article Google Scholar
Tremblay, N. et al. Corn response to nitrogen is influenced by soil texture and weather. Agron. J. 104, 1658–1671 (2012).
Article Google Scholar
Boyhan, G. E., Torrance, R. L. & Hill, C. R. Effects of nitrogen, phosphorus, and potassium rates and fertilizer sources on yield and leaf nutrient status of short-day onions. HortScience 42, 653–660 (2007).
Article CAS Google Scholar
Kyveryga, P. M., Blackmer, A. M. & Morris, T. F. Alternative benchmarks for economically optimal rates of nitrogen fertilization for corn. Agron. J. 99, 1057–1065 (2007).
Article Google Scholar
Vidigal, S. M., Pedrosa, M. W., Fonseca, M. S. & Santos, I. C. Adubação com nitrogênio em cobertura na produção de cebola. Hortic. Bras. 28, 3705–3711 (2010).
Google Scholar
Parent, S. -É., Leblanc, M. A., Parent, A.-C., Coulibali, Z. & Parent, L. E. Site-specific multilevel modeling of potato response to nitrogen fertilization. Front. Environ. Sci. https://doi.org/10.3389/fenvs.2017.00081 (2017).
Article Google Scholar
Parent, L. E. & Deslauriers, G. Simulating maize response to split-nitrogen fertilization using easy-to-collect local features. Nitrogen 4, 331–349 (2023).
Article CAS Google Scholar
Nowaki, R. H. D. et al. Phosphorus over-fertilization and nutrient misbalance of irrigated tomato crops in Brazil. Front. Plant Sci. 8, 825 (2017).
Article PubMed PubMed Central Google Scholar
Weingartner, S., Gatiboni, L. C., Dall’Orsoletta, D. J., Kurtz, C. & Mussi, M. Rates and localization of phosphorus fertilizer on onion yield. Rev. Ciências Agroveterinárias 17, 23–29 (2018).
Article Google Scholar
Khiari, L. et al. An agri-environmental phosphorus saturation index for acid coarse-textured soils. J. Environ. Qual. 29, 1561–1567 (2000).
Article CAS Google Scholar
Sims, J. T., Maguire, R. O., Leytem, A. B., Gartley, K. L. & Pautler, M. C. Evaluation of Mehlich 3 as an agri-environmental soil phosphorus test for the mid-Atlantic United States of America. Soil Sci. Soc. Am. J. 66, 2016–2032 (2002).
Article ADS CAS Google Scholar
Guérin, J., Parent, L. -É. & Abdelhafid, R. Agri-environmental thresholds using Mehlich III soil phosphorus saturation index for vegetables in histosols. J. Environ. Qual. 36, 975–982 (2007).
Article PubMed Google Scholar
Leblanc, M. A., Parent, L. E. & Gagné, G. Phosphate and nitrate release from mucky mineral soils. Open J. Soil Sci. 03, 107–114 (2013).
Article Google Scholar
da Silva, L. L., Tavares, A. T., Nascimento, I. R., Milhomem, K. K. B. & dos Santos, J. L. Crescimento vegetativo e teor de fósforo em cultivares de cebola. Rev. Bras. Tecnol. Apl. Nas Ciências Agrárias 10, 7–14 (2017).
Google Scholar
de Resende, G. M., Costa, N. D. & Yuri, J. E. Efeito de doses de fósforo na produtividade e armazenamento pós-colheita de dois cultivares de cebola. Rev. Ceres 63, 249–255 (2016).
Article Google Scholar
de Aquino, R. F. B. A. et al. Split fertilization of phosphate in onion as strategy to improve the phopsphorus use efficiency. Sci. Hortic. 290, 110494 (2021).
Article Google Scholar
Barber, S. A. Soil Nutrient Bioavailability: A Mechanistic Approach. (1995).
Golubkina, N. et al. Prospects of arbuscular mycorrhizal fungi utilization in production of allium plants. Plants 9, 279 (2020).
Article CAS PubMed PubMed Central Google Scholar
Marrocos, S. D. T., Grangeiro, L. C., de Sousa, V. D. F. L., Ribeiro, R. M. P. & Cordeiro, C. J. Potassium fertilization for optimization of onion production. Rev. Caatinga 31, 379–384 (2018).
Article Google Scholar
Goli-Kalanpa, E., Roozitalab, M. H. & Malakouti, M. J. Potassium availability as related to clay mineralogy and rates of potassium application. Commun. Soil Sci. Plant Anal. 39, 2721–2733 (2008).
Article CAS Google Scholar
Breker, J. S. et al. Potassium requirements for corn in North Dakota: Influence of clay mineralogy. Soil Sci. Soc. Am. J. 83, 429–436 (2019).
Article ADS CAS Google Scholar
Parent, S.-É. Why we should use balances and machine learning to diagnose ionomes. Authorea 1, (2020).
Yamane, D. R. et al. Site-specific nutrient diagnosis of orange groves. Horticulturae 8, 1126 (2022).
Article Google Scholar
Coulibali, Z., Cambouris, A. N. & Parent, S. -É. Cultivar-specific nutritional status of potato (Solanum tuberosum L.) crops. PLoS ONE 15, e0230458 (2020).
Article CAS PubMed PubMed Central Google Scholar
Betemps, D. L. et al. humboldtian diagnosis of peach tree (Prunus persica) nutrition using machine-learning and compositional methods. Agronomy 10, 900 (2020).
Article CAS Google Scholar
Paula, B. V., Squizani Arruda, W., Etienne Parent, L., Frank de Araujo, E. & Brunetto, G. Nutrient diagnosis of eucalyptus at the factor-specific level using machine learning and compositional methods. Plants 9, 1049 (2020).
Article Google Scholar
Parent, S. -É., Parent, L. E., Rozane, D.-E. & Natale, W. Plant ionome diagnosis using sound balances: Case study with mango (Mangifera indica). Front. Plant Sci. 4, 449 (2013).
Article PubMed PubMed Central Google Scholar
Morris, T. F. et al. Strengths and limitations of nitrogen rate recommendations for corn and opportunities for improvement. Agron. J. 110, 1–37 (2018).
Article Google Scholar
Kyveryga, P. M., Caragea, P. C., Kaiser, M. S. & Blackmer, T. M. Predicting risk from reducing nitrogen fertilization using hierarchical models and on-farm data. Agron. J. 105, 85–94 (2013).
Article CAS Google Scholar
Anderson, C. J. & Kyveryga, P. M. Combining on-farm and climate data for risk management of nitrogen decisions. Clim. Risk Manag. 13, 10–18 (2016).
Article Google Scholar
Liu, S., Yang, X., Guan, Q., Lu, Z. & Lu, J. An ensemble modeling framework for distinguishing nitrogen, phosphorous and potassium deficiencies in winter oilseed rape (Brassica napus L.) using hyperspectral data. Remote Sens. 12, 4060 (2020).
Article ADS Google Scholar
Sinclair, T. R. & Seligman, N. Criteria for publishing papers on crop modeling. F. Crop. Res. 68, 165–172 (2000).
Article Google Scholar
Parent, S. -É., Lafond, J., Paré, M. C., Parent, L. E. & Ziadi, N. Conditioning machine learning models to adjust lowbush blueberry crop management to the local agroecosystem. Plants 9, 1401 (2020).
Article CAS PubMed PubMed Central Google Scholar
Santos, H. G. Sistema Brasileiro de Classificação de Solos. (2018).
Soil Survey Staff. Soil Survey Staff - Keys to Soil Taxonomy. (United States Department of Agriculture Handbook, 2017).
QGIS development team. QGIS. Open source (2024).
EPAGRI. EPAGRI/CIRAM-Agroconnect. Centro de informações ambientais e hidro meteorológicas de Santa Catarina https://ciram.epagri.sc.gov.br/agroconnect/ (2021).
MAPA. Portaria 529 - Norma de identidade, qualidade, acondicionamento, embalagens e apresentação da cebola. (Ministério da Agricultura, Pecuária e Abastecimento, 1995).
Norton, J. & Ouyang, Y. Controls and adaptive management of nitrification in agricultural soils. Front. Microbiol. https://doi.org/10.3389/fmicb.2019.01931 (2019).
Article PubMed PubMed Central Google Scholar
Bould, C., Bradfield, E. G. & Clarke, G. M. Leaf analysis as a guide to the nutrition of fruit crops. I.—general principles, sampling techniques and analytical methods. J. Sci. Food Agric. 11, 229–242 (1960).
Article CAS Google Scholar
Government of Canada. Cool wave days for cool season/overwintering crops (< 5 °C). https://open.canada.ca/data/en/dataset/1687cac6-ee13-4866-ab8a-114c2ede7b13 (2021).
Tedesco, M. J., Gianello, C., Bissani, C. A. & Bohnen, H. Análises de solo, plantas e outros materiais. (1995).
Rozane, D. E. et al. Compositional nutrient diagnosis (CND) applied to grapevines grown in subtropical climate region. Horticulturae 6, 56 (2020).
Article Google Scholar
Beaufils, E. Diagnosis and recommendation integrated system (DRIS). (1973).
Wilkinson, S. R., Grunes, D. L. & Sumner, M. E. Nutrient interactions in soil and plant nutrition. In Handbook of Soil Fertility and Plant Nutrition (ed. Sumner, M. E.) 91 (CRC Press, 2000).
Google Scholar
Robnik-Šikonja, M. & Kononenko, I. Theoretical and empirical analysis of ReliefF and RReliefF. Mach. Learn. 53, 23–69 (2003).
Article Google Scholar
Hu, S., Wang, Y.-G., Drovandi, C. & Cao, T. Predictions of machine learning with mixed-effects in analyzing longitudinal data under model misspecification. Stat. Methods Appt. 32, 681–711 (2023).
Article MathSciNet Google Scholar
Petrazzini, B. O., Naya, H., Lopez-Bello, F., Vazquez, G. & Spangenberg, L. Evaluation of different approaches for missing data imputation on features associated to genomic data. BioData Min. 14, 44 (2021).
Article PubMed PubMed Central Google Scholar
Ravelojaona, N. et al. STICS soil-crop model performance for predicting biomass and nitrogen status of spring barley cropped for 31 years in a gleysolic soil from Northeastern Quebec (Canada). Agronomy 13, 2540 (2023).
Article CAS Google Scholar

Download references

Acknowledgements

This article was supported financially by Santa Catarina State Agricultural Research and Rural Extension Agency, Epagri (Seplan N. 6312106) and the Natural Sciences and Engineering Research Council of Canada (NSERC-2254).

Author information

Authors and Affiliations

Caçador Experimental Station, Research and Rural Extension of Santa Catarina (Epagri), Epagri, Abílio Franco Street, 1500, Caçador, Santa Catarina, 89501-032, Brazil
Leandro Hahn & Anderson Luiz Feltrim
Ituporanga Experimental Station, Research and Rural Extension of Santa Catarina (Epagri), Epagri, Lageado Águas Negras General Road, Ituporanga, Santa Catarina, 88400-000, Brazil
Claudinei Kurtz & Fábio Satoshi Higashikawa
Department of Soil, Federal University of Santa Maria, Ave. Roraima, 1000, Building 42, Santa Maria, RS, 97105-900, Brazil
Betania Vahl de Paula, Gustavo Brunetto & Léon-Étienne Parent
University Alto Vale do Rio do Peixe, Uniarp, Victor Baptista Adami Street, 800, Caçador, Santa Catarina, 89500-000, Brazil
Camila Moreira
State University Paulista “Julio Mesquita Filho”, Campus Registro. Registro, Av. Nelson Brihi Badur, 430, São Paulo, 11900-000, Brazil
Danilo Eduardo Rozane
Department of Soils and Agrifood Engineering, Laval University, Quebec, QC, G1V 0A6, Canada
Léon-Étienne Parent

Authors

Leandro Hahn
View author publications
You can also search for this author in PubMed Google Scholar
Claudinei Kurtz
View author publications
You can also search for this author in PubMed Google Scholar
Betania Vahl de Paula
View author publications
You can also search for this author in PubMed Google Scholar
Anderson Luiz Feltrim
View author publications
You can also search for this author in PubMed Google Scholar
Fábio Satoshi Higashikawa
View author publications
You can also search for this author in PubMed Google Scholar
Camila Moreira
View author publications
You can also search for this author in PubMed Google Scholar
Danilo Eduardo Rozane
View author publications
You can also search for this author in PubMed Google Scholar
Gustavo Brunetto
View author publications
You can also search for this author in PubMed Google Scholar
Léon-Étienne Parent
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: L.-É.P. and L.H. Field sample collection: L.H., C.K., A.L.F., F.S.H. and C.M. Laboratory analysis: A.L.F., F.S.H. and C.M. Date curation: L.H., D.E.R., G.B., and L.-É.P. Writing of the original draft: L.H., C.K. and B.V.d.P. Resources: L.H. and L.-É.P. Revision: B.V.d.P., G.B. and L.-É.P.

Corresponding author

Correspondence to Betania Vahl de Paula.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hahn, L., Kurtz, C., de Paula, B.V. et al. Feature-specific nutrient management of onion (Allium cepa) using machine learning and compositional methods. Sci Rep 14, 6034 (2024). https://doi.org/10.1038/s41598-024-55647-9

Download citation

Received: 06 February 2023
Accepted: 26 February 2024
Published: 12 March 2024
DOI: https://doi.org/10.1038/s41598-024-55647-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Plant responses to changing rainfall frequency and intensity

Hotspots of biogeochemical activity linked to aridity and plant traits across global drylands

Closing the gap between climate regulation and food security with nano iron oxides

Introduction

Results

Model performance to predict bulb yields

Tissue nutrient standards

Discussion

Nitrogen recommendations

Phosphorus and potassium recommendations

Tissue diagnosis

Need for large and diversified databases

Conclusions

Material and methods

Experimental setup

Climatic data

Experimental setup

Fertilizer treatments

Soil analysis

Tissue analysis

Statistical analysis

Log ratio transformation

Machine learning models

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links