Modelling daily plant growth response to environmental conditions in Chinese solar greenhouse using Bayesian neural network

Mohmed, Gadelhag; Heynes, Xanthea; Naser, Abdallah; Sun, Weituo; Hardy, Katherine; Grundy, Steven; Lu, Chungui

doi:10.1038/s41598-023-30846-y

Download PDF

Article
Open access
Published: 16 March 2023

Modelling daily plant growth response to environmental conditions in Chinese solar greenhouse using Bayesian neural network

Gadelhag Mohmed^1,2,
Xanthea Heynes¹,
Abdallah Naser²,
Weituo Sun^1,3,
Katherine Hardy¹,
Steven Grundy¹ &
…
Chungui Lu¹

Scientific Reports volume 13, Article number: 4379 (2023) Cite this article

2844 Accesses
4 Citations
Metrics details

Subjects

Abstract

Understanding how plants respond to environmental conditions such as temperature, CO₂, humidity, and light radiation is essential for plant growth. This paper proposes an Artificial Neural Network (ANN) model to predict plant response to environmental conditions to enhance crop production systems that improve plant performance and resource use efficiency (e.g. light, fertiliser and water) in a Chinese Solar Greenhouse. Comprehensive data collection has been conducted in a greenhouse environment to validate the proposed prediction model. Specifically, the data has been collected from the CSG in warm and cold weather. This paper confirms that CSG’s passive insulation and heating system was effective in providing adequate protection during the winter. In particular, the CSG average indoor temperature was 18 \(^{\circ }\)C higher than the outdoor temperature. The difference in environmental conditions led to a yield of 320.8g per head in the winter after 60 growing days compared to 258.9g in the spring experiment after just 35 days. Three different architectures of Bayesian Neural Networks (BNN) models have been evaluated to predict plant response to environmental conditions. The results show that the BNN network is accurate in modelling and predicting crop performance.

Future groundwater potential mapping using machine learning algorithms and climate change scenarios in Bangladesh

Article Open access 06 May 2024

Environmental drivers of increased ecosystem respiration in a warming tundra

Article Open access 17 April 2024

Climate change impacts and adaptations of wine production

Article 26 March 2024

Introduction

Climate change is the biggest challenge to global food security. Protected cultivation can protect crops from extreme weather conditions, reduce the incidence of pests and diseases, and ensure that food is provided all year round. Globally, using a greenhouse environment is the most popular way to produce horticultural crops, with an estimated 496,800 hectares (ha) in 2019, with total production worth 20 billion US dollars in 2020¹. China has become the world’s largest economy in protected horticulture, with 3.3 million ha (polytunnels included) and a total output of 1.43 trillion Chinese Yuan in 2019, and after polytunnels (45%), Chinese solar greenhouses (CSG) (30.5%) are the second most popular choice of greenhouse structure across China (Institute of Protected Agriculture AoAPaE, 2020). However, in cooler climates, the heating energy demand in a commercial greenhouse is responsible for 65-85% of total the greenhouse energy demand².

CSGs employ a passive thermal recycling system to reduce the energy consumption needed through active heating, and external meteorological factors and control mechanisms determine the internal microclimate of a greenhouse (e.g., ventilation openings, exhaust fans, heaters, and evaporative cooling systems)³. However, they have significant structural differences compared to greenhouses in the Netherlands, Israel, and Spain^4,5 regarding the cover, envelope and structure. A CSG has three thermal storage walls along the structure’s north, east, and west sides. The north wall, a core feature of the CSG, plays an essential role in thermal storage, heat preservation, and heat insulation⁶. The cambered south roof, north wall, and thermal blanket enable the CSG to perform well on daylight access, heat storage and insulation, as shown in Fig. 1. During the day, the greenhouse captures heat from the sun, storing it within the thermal mass of the walls, which is then released, at night, as a passive heating source. During the night, an insulating sheet closes over the transparent plastic sheet to reduce heat loss from the greenhouse. This passive solar heating strategy employed by CSGs enables huge energy savings compared to the heating required to heat a glass greenhouse. A study in Manitoba, Canada, showed that the supplemental energy required to maintain temperatures above 10 \(^{\circ }\)C at all times was 43 times less for the CSG compared to a glass greenhouse⁷.

Until recently, many approaches have been taken to optimise microclimate control performance and have considered the following environmental factors individually; light, temperature, humidity, ambient CO₂ concentration, soil type, water, and nutrient availability. However advancements in environmental sensor technology which can record real-time fluctuations in microclimate, combined with data-driven machine learning approaches, offer the potential to resolve the highly complex relationship between these numerous factors which influence plant growth and development. For example, cultivation season can significantly impact plant growth and development, even when using an indoor greenhouse environment. Reduced light intensity and photoperiod experienced during winter reduces the physiological responses of plants (i.e., rate of photosynthesis and stomatal conductance), which negatively affects overall biomass yield, nutritional value, and is also attributed to increased nitrate content^8,9. In this scenario, sensor technology could effectively record these real-time fluctuations, highlighting the requirement for an appropriate adaptation to improve the growing conditions of the CSG. Temperature, relative air humidity and CO₂ concentration regulation in greenhouse environments must also be carefully considered. Whilst evaporative systems can be controlled by opening/closing the roof windows, this could adversely cause fluctuations in vapour pressure deficit (VPD)¹⁰, reducing a plant’s net CO₂ assimilation rate¹¹. A recent study analysed the plant growth characteristics of greenhouse lettuce grown under drastically fluctuating VPD conditions (1.63 kPa for 6 min and 0.63 for 3 min) and moderately fluctuating VPD conditions (1.32 kPa for 7 min and 0.86 kPa for 3 min), concluding dry shoot weight and leaf area was 15 and 29% lower in lettuce grown under drastically fluctuating conditions¹⁰. Temperature is also critically important environmental variable for maximising crop yield and productivity. The relationship of temperature and crop development often shows a sigmoidal relationship, where growth and development cease below a critical temperature threshold at both extremes, but a linear positive correlation exists between the two extreme thresholds. Regression analysis of time to harvest of field grown Romain Lettuce in South Carolina grown over multiple years identified that for every 1 \(^\circ\)C decrease in growing season mean (GSM) minimum or maximum temperatures from the optimum values, days to harvest increased by 5 days with a 5 \(^\circ\)C increase in GSM min or GSM max temps, total days to harvest increased 50%¹². By dividing complex environmental data into elements, their effects on crop growth could be quantified, enabling an accurate prediction of the impact of fluctuating environmental conditions on plant growth.

The development of predictive models for plant growth in protected horticulture will allow for the optimisation of microclimate control strategies to maximise crop yield while reducing energy consumption. Currently, most CSG climate solutions are controlled manually based on experience, resulting in poor performance on greenhouse production. Many research works have been conducted to analyse and predict plant growth performance using many different Artificial Intelligence (AI) approaches to predict the environmental conditions, mainly ambient temperature¹³. Hence, there is currently no data model for lettuce growth in CSGs which incorporates climate management control including the effects of air temperature, humidity, CO₂ concentration, and radiation under CSG scenarios, where environmental conditions are usually not within the optimal range for crop growth and development due to the limited climate control ability. Currently, AI is mainly employed for indoor and outdoor agriculture to enhance plant productivity by finding the most suited conditions for plant growth in terms of soil management, crop management, weed management and disease management. For indoor agriculture, the main concept of using AI is its flexibility, high performance, accuracy, and cost-effectiveness. Regulation of environmental components in greenhouses is crucial for better plant growth and many studies support that this can be achieved through employing Artificial Intelligence (AI) systems over manual control methods^14,15. DL has also been applied to greenhouse yield prediction, although this has predominantly focused on tomato crops. A Dynamic Artificial Neural Network (DANN) is implemented in¹⁶, to predict tomato yield using phenotypic parameters including CO₂ fixation, transpiration, as well as environmental parameters such as solar radiation and past yield. While CO₂ fixation was found to be the most important variable, a high degree of predictive accuracy (R=0.917) was found with external parameters alone. Alhnaity et al (2019) evaluated several Machine Learning (ML) and Deep Learning (DL) techniques to achieve high predication accuracy in plant yield and growth within greenhouse environments using two different plants; ficus and tomato. The study specifically focused on ficus growth and the variation in stem diameter throughout their development, and tomato yield measurements, in combination with environmental measurements¹⁷. Recent research work by Gong et al. (2021) exhibited that DL based on a LSTM model achieved high prediction accuracy for both problems, outperforming classical ML approaches. This was achieved through applying artificial neural networks to predict tomato yield in a greenhouse environment based on historical yield and environmental data. Based on statistical analysis of the RSME, deep learning approaches outperformed classical models, with a combined tempo- ral convolutional network (TCN) with recurrent neural network (RNN) model providing the most accurate tomato yield predictions¹⁸.

These studies provide examples of how the development of predictive models for plant growth in protected horticulture will allow for the optimisation of microclimate control strategies to maximise crop yield while reducing energy consumption. Model predictive control has a large potential to provide higher control efficiency. As the basis, a crop model responding to greenhouse climate is needed. DL has demonstrated how it can be used as a powerful tool for yield prediction, however its application in a greenhouse environment has focused primarily on tomatoes grown in European style greenhouses. These approaches are utilised for dealing with the randomness and complexity of agricultural data including data that obtained from CSG. To give a better understanding of the data-driven approaches proposed for agricultural data processing, this paper grouped the data-driven methods into three main groups: Deterministic Methods, Stochastic Methods, and Machine Learning (ML) methods¹⁹. In this paper, an ML method, called BNN, has been utilised to enhance the crop productivity through predicting plant response to environmental conditions in CSG. The rationale behind using an ML approach, in particular, BNN is to overcome the limitations of stochastic and deterministic methods, e.g., deterministic methods can not deal with high random distribution data, which likely to accrue in agricultural environment²⁰. While stochastic method, e.g., Markov Chain Model, Hidden Markov Model (HMM), and entropy, is a promising approach in agricultural applications due to its computational and time efficiency²¹. The potential randomness measure of agricultural data can be analogous plant growth response.

The main aim of this study is to generate a predictive model to understand the effect of environmental conditions on lettuce yield in CSG’s to enhance plant performance and resource-use efficiency. This will be achieved through the following objectives: (1) collect temperature, light, CO₂ and humidity data across a warm and cold season to train the ANN structures, (2) determine the most effective structure based on the data generated from these environmental parameters.

This section has introduced the study and provided an overview of the related works. The rest of this research paper is organised as follows; the experimental setup is presented in section Experimental setup, including data collection, data pre-processing and the proposed models. The results from the proposed Bayesian neural network-based model are presented and discussed in section Results and discussion including an evaluation of the following environmental benefits: temperature, light, CO₂, and humidity. This section also includes a discussion on the evaluation of seasonal difference in plant performance and modelling daily plant growth response to environmental conditions using BNN. Finally, a conclusion is drawn in section Conclusion with suggestions for future work provided.

Experimental setup

To evaluate the performance of the proposed approach, two different datasets were collected from a CSG during warm and cold seasons. In particular, the BNN, explained previously, is employed with the collected datasets to observe the effect of different environmental conditions on lettuce growth. Specifically, the lettuce cultivar (41–27) Tiberius RZ (produced by RIJK ZWAAN, the Netherlands) was selected for experimental trials. In the following sections, the collected datasets are explained in detail, as well as the BNN based model for modelling lettuce growth, based on the datasets collected from a Chinese Solar Greenhouse.

Data collection

The data used in this research was collected from a CSG located in Beijing, China. The CSG is oriented east-west, consisting of the north wall, side walls, south roof, and back roof, as well as the two controllable structural components of thermal blankets and vents. No climate conditioning equipment was used during the experiments. Water and fertiliser management, as well as pest control, were assumed to be ideal.

Environmental conditions including temperature, CO₂, relative humidity and light radiation were measured inside and outside of the CSG using a complex sensor module. Five temperature, humidity, and CO₂ sensors and three radiation sensors were placed at different locations throughout the CSG. 6–18 lettuce plants were randomly harvested for each sample. When conducting the lettuce experiments during warm and cold seasons, two different datasets were collected to comparatively measure the indoor and outdoor environmental conditions of the CSG whilst simultaneously collecting plant response data for the lettuce plants.

The first dataset was collected during a warm season (April 9 to May 14, 2020). The CSG used for the warm weather experiment has a floor area of approximately 577 m\(^{2}\), with a width of 7.4m and a length of 78m. The lettuce seedlings were transplanted to soil inside the experimental CSG when they had 11 fully developed leaves. The plant density is 5.30 plants/m\(^{2}\) (floor area).

Following this, the second dataset was collected during a cold season, from November 24, 2020 to January 23, 2021. The CSG used for the cold weather experiment has a floor area of approximately 686 m\(^{2}\), with a width of 7.95m and a length of 86.3m. The lettuce seedlings were transplanted to soil inside the experimental CSG when they had 5 fully developed leaves. The plant density 4.91 plants/m\(^{2}\) (floor area).

The sensors measured environmental parameters every 5 minutes. The real-time of measuring the environmental conditions were also used for training the model. The main focus of this paper was to collect indoor data from a Chinese Solar Greenhouse and lettuce plant response data through measuring Shoot fresh weight, Root fresh weight, Shoot dry weight, Root dry weight and leaf area, in accordance with the measured environmental conditions. Table 1, shows the average of each environmental parameter measured indoor and outdoor of the CSG. To improve the training efficiency of the BNN, data normalisation and augmentation techniques were used with the collected environmental conditions data. 10082 and 17282 samples were measured for each indoor and outdoor parameter during warm and cold seasons, respectively. A total of 27,364 data points for each environmental parameter was collected from CSG during both the seasons.

Table 1 Information about the collected datasets including measured parameters, measuring unit and average for each measured parameter during the experiment period. EX1 data is the dataset collected during the warm season for 35 days. EX2 data is the dataset collected during the cold season for 60 days.

Full size table

Data pre-processing

BNN has been trained using both of the collected datasets illustrated in Fig. 2 and Table 1. The datasets represent both indoor and outdoor environmental conditions (temperature, CO₂, relative humidity and light radiation), in addition to the daily yield measurements and growth rate (Shoot fresh weight (g), Shoot dry weight (g), Root fresh weight (g), Root dry weight (g) and leaf area cm\({^2}\). The recorded data representing the environmental conditions were measured every 5 minutes, followed by being averaged on an hourly/daily basis. Simultaneously, the yield measurement growth data was recorded every 5 days. To deal with these data characteristics, the data augmentation technique was performed, through interpolation of days’ data, resulting in daily data measurements. An hourly average for the environmental parameters was also performed to achieve a similar daily representation that matched the yield measuring observations. Moreover, to deal with the missing information in the obtained dataset, the missing observations data were replaced by the moving average interpolation of the latest neighbouring time series data.

The response of lettuce growth to changes in environmental conditions can be categorised as a long response that can be identified on a daily basis. Therefore, resampling or aggregation of time series data is applied. This includes data being resampled on a daily basis using the averaging procedure to identify the growth response and the daily growth rate.

Bayesian neural network-based model

In this study, Bayesian Neural Networks (BNNs)²² have been used for modelling and predicting lettuce plant growth in CSG based on the back-propagation algorithm^23,24. The primary aim of the utilised network is to find the relationship between the network inputs (temperature, CO₂, relative humidity, day and light radiation) and the network outputs (the biomass measurements (dry and fresh weight) for the plant shoot, root matter, and leaf area). Various neuron numbers in the hidden layer have been examined, including, 10, 20 and 25 neurons.

Figure 3 shows the architecture of the proposed approach using Bayesian neural network for modelling and predicting the lettuce growth in CSG. The BNN structure, used in this study, consists of three layers: input, hidden, and output layers. To obtain a high performance of modelling and lettuce growth , a feedback loop procedure was applied to produce time-series historical data for the input and output datasets. This was achieved by using a time delay operator with the inputs and outputs during the training mode.

To increase the ability of the designed BNN for modelling lettuce growth in the CSG with sufficient accuracy, the two collected datasets representing the warm and cold seasons were combined together and randomly split into three independent subsets: training, validation, and testing. The training dataset is the sample of data used for learning the BNN algorithm to fit the model. The validation dataset is used during the training mode consisting of the trained model and to regularise the early stopping training iterations to prevent the issue of model overfitting when generalisation was not improving²⁵. The testing dataset is used to test the model after the training mode is done to evaluate the model performance. This means that the training and validation dataset D is (70%), and the testing set W is (30%). The training and validation datasets include \(D_x\) and \(D_y\) for the training input parameters and training labels, respectively.

As the structure of the used BNN model is significant in the performance of the model, the model parameters, consisting of the prior distribution, the likelihood function and the number of neurons in the hidden layer, needed to be determined. Therefore, the model parameters and the achieved results are determined using a Mean Absolute Error (MAE) for comparing the different used networks to determine the optimal structure.

Results and discussion

Real-time environmental data was collected from indoor and outdoor of CSGs during two experiments performed in a warm season and cold season, and plant performance was measured throughout. This data provides insight into the performance of the CSG and has also been used to develop a predictive crop growth model using BNN from the environmental and plant performance data. It is becoming increasingly prominent that AI and human–machine learning can improve our understanding of how input features can influence behaviours in volatile environments; simultaneously improving prediction accuracy and reliability, which are important components of Agriculture²⁶. Based on observations from the collected datasets, and the responses of lettuce growth, the daily growth rate of the lettuce represents the accumulation of plant weight over time. The growth rate observed after analysing the fresh and dry weights of the lettuce plants, measured the sensitivity of the change in biomass to the change in environmental conditions, over time. This means, the dynamic effect of the environmental conditions on the lettuce growth weight can be identified more clearly by using a growth rate variable, rather than using the normal change in the growth. Thus, the daily growth rate of lettuce weight was chosen as the main output for the developed BNN modelling and prediction model.

Evaluation of the environment benefits of CSG

Temperature

Temperature has a profound effect on crop growth, with every 1 \(^{\circ }\)C divergence from the optimum growing season mean daily temperature was found to greatly influence both head yield and cultivation time of lettuce grown in a moderate climate¹². As would be expected due to the geographical location of the trial and time of year, the average temperature varied greatly between experiment 1 in the spring and experiment 2 in the early winter, with average outside temperatures of 17.2 \(^{\circ }\)C and \(-7.11\,^{\circ }\)C, respectively. The average temperature within the CSG was 22.96 \(^{\circ }\)C during the “warm” experiment and 10.9 \(^{\circ }\)C in the “cold” experiment, with the temperature difference inside the CSG relative to the outside environment in the warm season being 5.7 \(^{\circ }\)C and 18 \(^{\circ }\)C in the winter. The 18 \(^{\circ }\)C difference in average temperature inside the CSG compared to the out outside temperature show that the CSG is capable of providing an effective passive insulating and heating system during harsh winter conditions. Furthermore, comparison of the average daily minimum temperature in the winter experiment show the coldest temperature reached was \(-0.98\,^{\circ }\)C in the CSG compared to \(-23\,^{\circ }\)C outside, a difference of 22.02 \(^{\circ }\)C Critically, these results show that the thermal insulation and passive heating of the CSG is reducing the crop exposure to cold stress which would drastically increase productivity and profitability.

Light

Light is a critical environmental variable as photosynthetically active radiation is absorbed by plants to drive photosynthetic carbon fixation needed to fuel plant growth. A 58.8% seasonal difference in average light radiation was recorded in the outside environment in the winter relative to the late spring experiment. In the winter conditions, the average light radiation inside the CSG were 179.5 W/m\({^2}\) in experiment 1 and 63.88 W/m\({^2}\) in experiment 2. According to²⁷, light was a limiting factor below 130 W/m\({^2}\) on the yield of lettuce grown in a controlled environment under optimal temperatures, indicative that light could be a limiting factor between the two experiments but especially in the warm experiment which had more favourable growing conditions. The percentage light lost between inside compared to outside the CSG was 26% in the late spring and 36% in the winter, likely explained by the seasonal difference solar elevation angle affecting transmission of light through the south roof.

CO₂

CO₂ concentration was also measured as it is an important environmental factor elevated CO₂ causes increased photosynthesis in plants through increasing the efficiency of carbon fixation, which leads to greater production of carbohydrates and biomass. The average CO₂ concentration inside the CSG was substantially elevated in the winter compared to the spring experiment by 144 ppm. In the winter, to retain heat windows and vents are generally closed which reduces the ventilation in the glasshouse and can lead to elevated CO₂ caused by respiration from workers and soil respiration. Previous research from has shown CO₂ concentration generally increases crop yield, however, this was dependent on light radiation, since both CO₂ and light be limiting factors inhibiting photosynthesis and therefore plant growth²⁸. Therefore, increased CO₂ during the winter may improve photosynthetic performance, plant growth and yield. Given the large variation between environmental factors in the two experiments, this should provide a robust dataset for accurately modelling lettuce growth in CSG.

Humidity

Humidity was also measured as stomata tend to close in dry air to reduce water loss which indirectly affects photosynthesis and therefore biomass accumulation due to reduced intracellular CO₂ concentrations lowering the efficiency of photosynthetic carbon fixation. Average humidity did not vary greatly seasonally as measured from outside the CSG, with 51.3% in the late spring compared to 58.7% in th winter. In the warm experiment as the CSG is well ventilated during this time, the inside RH was only marginally increased to 51.3%. However, in the winter, when windows are closed to reduced heat loss, the inside RH was elevated to 81.75%, a 39% increase compared to the outside average RH. Slightly increased RH in the winter will improve the water use efficiency by reducing the evaporative water demand on the stomate while also improving photosynthetic performance²⁹.

Evaluation of seasonal difference in plant performance

The plant phenotypic traits measured throughout both experiment 1 and 2 are presented in Fig. 4. Cultivation time between experiments differed greatly between experiment 1 and 2, with time to harvest 35 days in the warm season and 60 days during the winter. Shoot fresh weight, which is equivalent to yield in lettuce as the entire above-ground plant material is harvested, was 320.8g per head in the winter after 60 growing days compared to 258.9g in the spring experiment after 35 days. If the yield of the crop is considered relative to the harvest time then the warm season head a greater fresh weight per day compared to the winter season, with 7.4g per day compared to 5.3g per day in the winter season, a 38% increase. Shoot dry weight, is a biomass measurement that indicates the net primary production and growth rate of the plant excluding difference in water content. Dry shoot weight was also higher in the winter experiment, with 13.8g compared to 11.8g in the warm experiment. However, if compensating for cultivation time the rate of dry weight at harvest normalised for growing time was increased by 47% in the warm season relative to the winter season. The leaf area at harvest was 3085cm\({^2}\) in the warm experiment compared to 6545cm\({^2}\) in the winter experiment. While the CSG offers substantial passive thermal heating to improve the growing conditions relative to the outside environment, during the winter the mean daily temperature and light intensity fall well below ideal range as reported by other studies^12,27. The sub-optimal growing conditions in the winter necessitate an increased cultivation time which reduces the number of crop cycles which can be obtained in annually, decreasing the productivity of the CSG and leaving the potential for improvement.

Modelling daily plant growth response to environmental conditions using BNN

The lettuce growth performance in CSG, as determined by the environmental conditions, was then identified by using different structures of back-propagation neural network algorithm that employed Bayesian inference framework for modelling and predicting the lettuce fresh and dry weights growth rate was developed. The performance of the used BNN models was evaluated by comparing the predicted values (BNN models outputs) with the actual observed values (target). Figure 5, shows the comparison of the predicted estimated dynamic response of the fresh and dry weights and the leaf area that was calculated by using three different structures of BNN models and the actual observed response for the daily growth rate of the lettuce plant. The used three different structures for the number of neuron units in the hidden layer are 10, 20 and 25 neurons.

Figure 5 illustrates the difference between the predicted estimated dynamic response of the fresh and dry weights and the leaf area that was modelled by using three different structures of BNN models and the actual observed response for the daily growth rate of the lettuce plant to find the optimum required number of neurons to predict the daily plant grown in CSG. The three different structures for the number of neuron units used in the hidden layer are 10, 20 and 25 neurons. The diversity of modelled and predicted outputs of the monitored plant response parameters are different in each BNN structure once they were tested and evaluated using validation and test datasets. Lettuce shoot fresh and dry weights and leaf area are considered to observe the ability of the designed BNN models for modelling and predicting the growth response in this research study. There is a minor difference between the actual observed response with the predicted results when the BNN with 20 neuron units in the hidden layer was employed for predicting plant biomass (dry weight) and fresh weight, which indicates the high accuracy of this BNN structure is used to model and predict the plant biomass CSG. However, when the same designed BNN model was employed to model and predict the leaf area, the accuracy of the achieved result was a bit less compared with the results achieved for predicting the plant dry and fresh weights. These achieved results demonstrate the remarkable capability of the BNN to model and predict the diversity of plant growth responses. It is be seen that the BNN with 20 neuron units in the hidden layer design is followed the trend of the actual plant response parameters and showed its ability for modelling and predicting the temporal nature of the given data compared to the 10 and 25 neuron units in the hidden layer designs.

Mean absolute error rate is calculated for the obtained results from the three different used BNN structures to be compared with the observed response for the daily growth rate of plant response as it is shown in Fig. 6. In this evaluation technique, n is the number of errors in the observation samples, \(x_i\) and x are the targets and the BNN achieved results values. Evaluation of the best fitting BNN designed network to optimise the most accurate BNN model structure was done by calculating the MAE values. This evaluation aims to maximise the coefficient of determination and minimise the MAE values. To minimise the MAE and achieve accurate modelling and predicting results, three different structures of back-propagation neural network algorithms that employed a Bayesian inference framework were used during the training phase for training the models.

The optimal BNN model structure is determined by the model parameters, which are a combination of environmental condition inputs (temperature, CO₂, relative humidity and light radiation) and the number of neuron units in the hidden layer to find the best performance of the identified model. The effect of these combined input parameters was investigated. This effect is determined in Fig. 5 by calculating the mean absolute error (MAE). As it is shown in Table 2, it was found that the MAE is reached to its minimal value by 0.18, 0.02 and 1.32 when 20 neuron units are used in the hidden layer of the BNN for predicating the shoot fresh and dry weights and leaf area, respectively. Therefore, the foregoing suggests that the neural network structure with 20 neuron units in the hidden layer h is useful for modelling and predicting such a dataset.

Table 2 Mean absolute error values of the used BNN models with different number of hidden layers h for modelling the daily growth rate of fresh and dry weight, and the leaf area daily growth.

Full size table

Prediction of plant growth via BNN and other ML techniques could perform better due to its ability to handle unseen and high random data³⁰. Subsequently reducing farm costs, energy use, and potential environmental damage. Incorporating IoT big data with machine learning techniques can deliver profitable, accurate and reliable outcomes, such as plant recognition and crop type classification, detection of plant and leaf diseases, fruit counting and forecasting soil moisture content³¹. Saravi et al.³² employed a DL technique for modelling crop yield using different weather scenarios and varying environmental variables combined with random irrigation applications to create 10,000,000 possible scenarios. Results showed that a simpler Bayesian-based DNN model with a structure of 10 neurons in 5 layers performed just as well (78.6% accuracy) when examining crop productivity, comparable to a DNN crop model with 400 neurons in 10 layers, despite the size of the neural network reducing 80-fold. Whilst machine taught crop models are becoming more extensively used to predict crop productivity, input requirements and biomass yield, existing models are complex, requiring thousands of variables to produce accurate results. DL modelling techniques and Bayesian methods present an opportunity to remedy these limitations. Khan et al.²⁸ used three different DNN methods to analyse and predict the production output of major fruits based on data taken from the National Bureau of Statistics of Pakistan. The study found the Bayesian regularisation back propagation (BR) method (76.3% accuracy) to be most efficient—the Levenberg-Marquardt optimisation method (LM) and the scale conjugate gradient back propagation (SCG) method achieved 65.6% and 70.2% accuracy, respectively. Successfully adopting cross-disciplinary integration between the application of big data technology, IoT, DL techniques and our agricultural production systems is crucial for Agriculture 4.0 development. Widespread problems experienced in the agricultural field encompass crop diseases, poor pesticide control, inefficient irrigation, and ineffective weed management: all could be better controlled and remedied through automized farming practices. For food security, using such techniques for the prediction of crop yield and food availability on a national level would be influential for agricultural policy and assist in market forecasting. Additionally, AI investment can strongly influence the attainment of Sustainable Development Goals, particularly in emerging economies where a component of poverty reduction can be achieved through revolutionizing agriculture education³³.

In this research, three different structures, in terms of the number of neurons in the hidden layer, of the BNN approach for improving modelling and prediction daily crop yield growth performance using data gathered from CSG based on sensory devices. Considering the results obtained from the conducted experiments, it can be concluded that the 20 neurons in the hidden layer model exhibit a high score for accuracy and the minimum mean absolute error (MAE) when its performance is tested for predicting the daily growth performance of the shoot fresh and dry weights and leaf area separately. Also, the overall modelling growth performance, when it is over the whole system, demonstrates the effectiveness of the proposed approaches. The BNN model shows more robust and reliable performance once applied to a larger dataset that (e.g., dataset A and B) represent warm and cold seasons. In particular, when this dataset is mixed randomly together. Our findings show that DL approaches can accurately predict plant performance using environmental factors in CSGs. Modelling crop yield for CSGs offers the potential to develop better management strategies to maximise performance and profitability, allowing economic analysis of the benefit of supplemental heating, lighting, or CO₂ enrichment.

Conclusions and future work

This paper confirms that the Chinese Solar Greenhouse (CSG) design is an energy-saving and low-cost design technology that combines solar energy input and appropriate heat sinks. This design allows the CSG to provide a better crop growth environment, especially in the winter, which significantly influences the greenhouse microclimate and enhances crop productivity and sustainability. It can be concluded from this paper that the Bayesian Neural Networks (BNNs) are effective in modelling and predicting plant growth in response to the temperature, CO₂, humidity, and light radiation conditions in CSG across cold and warm seasons.

Future work can be undertaken to empirically compare this paper’s results with an array of scenarios across different growing environments and crop varieties to conclude the best intelligent crop simulation models and algorithms. This future research direction will potentially improve yield performance estimation and achieve energy-saving modelling strategies.

Data availibility

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Cuesta, R. Global Greenhouse Statistics, 2019 (2019).
Vadiee, A. & Martin, V. Energy management strategies for commercial greenhouses. Appl. Energy 114, 880–888 (2014).
Article Google Scholar
Fitz-Rodríguez, E. et al. Dynamic modeling and simulation of greenhouse environments under several scenarios: A web-based application. Comput. Electron. Agric. 70(1), 105–116 (2010).
Article Google Scholar
Pardossi, A., Tognoni, F. & Incrocci, L. Mediterranean greenhouse technology. Chron. Hortic. 44(2), 28–34 (2004).
Google Scholar
Vanthoor, B., Stanghellini, C., Van Henten, E. J. & De Visser, P. A methodology for model-based greenhouse design: Part 1, a greenhouse climate model for a broad range of designs and climates. Biosyst. Eng. 110(4), 363–377 (2011).
Article Google Scholar
Liu, C.-W., Sung, Y., Chen, B.-C. & Lai, H.-Y. Effects of nitrogen fertilizers on the growth and nitrate content of lettuce (Lactuca sativa L.). Int. J Environ. Res. Public Health 11(4), 4427–4440 (2014).
Article CAS PubMed PubMed Central Google Scholar
Beshada, E., Zhang, Q. & Boris, R. Winter performance of a solar energy greenhouse in southern Manitoba. Can. Biosyst. Eng. 48(5), 1–8 (2006).
Google Scholar
Voutsinos, O., Mastoraki, M., Ntatsi, G., Liakopoulos, G. & Savvas, D. Comparative assessment of hydroponic lettuce production either under artificial lighting, or in a Mediterranean greenhouse during wintertime. Agriculture 11(6), 503 (2021).
Article CAS Google Scholar
Kosma, C., Triantafyllidis, V., Papasavvas, A., Salahas, G. & Patakas, A. Yield and nutritional quality of greenhouse lettuce as affected by shading and cultivation season. Emir. J. Food Agric. 25, 974–979 (2013).
Article Google Scholar
Inoue, T. et al. Minimizing VPD fluctuations maintains higher stomatal conductance and photosynthesis, resulting in improvement of plant growth in lettuce. Front. Plant Sci. 12, 646144 (2021).
Article PubMed PubMed Central Google Scholar
Shipley, B. Net assimilation rate, specific leaf area and leaf mass ratio: Which is most closely correlated with relative growth rate? A meta-analysis. Funct. Ecol. 20(4), 565–574 (2006).
Article Google Scholar
Dufault, R. J., Ward, B. & Hassell, R. L. Dynamic relationships between field temperatures and romaine lettuce yield and head quality. Sci. Hortic. 120(4), 452–459 (2009).
Article Google Scholar
Mohmed, G., Grundy, S., Lotfi, A. & Lu, C. Using AI approaches for predicting tomato growth in hydroponic systems. in UK Workshop on Computational Intelligence 277–287 (Springer, 2021).
Mohmed, G., Grundy, S., Sun, W., Hardy, K., Heynes, X. & Lu, C. Modelling daily plant growth response to environmental conditions in Chinese solar greenhouse using Cayesian neural network. Available at SSRN 4082794.
Jospin, L. V., Laga, H., Boussaid, F., Buntine, W. & Bennamoun, M. Hands-on Bayesian neural networks-a tutorial for deep learning users. IEEE Comput. Intell. Mag. 17(2), 29–48 (2022).
Article Google Scholar
Salazar, R., López, I., Rojano, A., Schmidt, U. & Dannehl, D. Tomato yield prediction in a semi-closed greenhouse. in XXIX International Horticultural Congress on Horticulture: Sustaining Lives, Livelihoods and Landscapes (IHC2014): 1107 263–270 (2014).
Alhnaity, B., Pearson, S., Leontidis, G. & Kollias, S. Using deep learning to predict plant growth and yield in greenhouse environments. in International Symposium on Advanced Technologies and Management for Innovative Greenhouses: GreenSys2019 1296 425–432 (2019).
Gong, L., Yu, M., Jiang, S., Cutsuridis, V. & Pearson, S. Deep learning based prediction on greenhouse crop yield combined TCN and RNN. Sensors 21(13), 4537 (2021).
Article ADS PubMed PubMed Central Google Scholar
Ding, Y., Han, S., Tian, Z., Yao, J., Chen, W. & Zhang, Q. Review on occupancy detection and prediction in building simulation. in Building Simulation Vol. 15, pp. 333–356 (Springer, 2022).
Naser, A., Lotfi, A. & Zhong, J. A novel privacy-preserving approach for physical distancing measurement using thermal sensor array. in The 14th Pervasive Technologies Related to Assistive Environments Conference pp. 81–85 (2021).
Mohmed, G., Lotfi, A. & Pourabdollah, A. Enhanced fuzzy finite state machine for human activity modelling and recognition. J. Ambient Intell. Humaniz. Comput. 11(12), 6077–6091 (2020).
Article Google Scholar
Hosseini, S. & Ivanov, D. Bayesian networks for supply chain risk, resilience and ripple effect analysis: A literature review. Expert Syst. Appl. 161, 113649 (2020).
Article PubMed PubMed Central Google Scholar
MacKay, D. J. A practical Bayesian framework for backpropagation networks. Neural Comput. 4(3), 448–472 (1992).
Article Google Scholar
Yacef, R., Benghanem, M. & Mellit, A. Prediction of daily global solar irradiation data using Bayesian neural network: A comparative study. Renew. Energy 48, 146–154 (2012).
Article Google Scholar
Aji, G. K., Hatou, K. & Morimoto, T. Modeling the dynamic response of plant growth to root zone temperature in hydroponic chili pepper plant using neural networks. Agriculture 10(6), 234 (2020).
Article CAS Google Scholar
Hafezi, R. How artificial intelligence can improve understanding in challenging chaotic environments. World Futures Rev. 12(2), 219–228 (2020).
Article MathSciNet Google Scholar
Fu, W., Li, P. & Wu, Y. Effects of different light intensities on chlorophyll fluorescence characteristics and yield in lettuce. Sci. Horticu. 135, 45–51 (2012).
Article CAS Google Scholar
Khan, T. et al. Agricultural fruit prediction using deep neural networks. Procedia Comput. Sci. 174, 72–78 (2020).
Article Google Scholar
Stanghellini, C. et al. Greenhouse Horticulture: Technology for Optimal Crop Production (Wageningen Academic Publishers, 2019).
Book Google Scholar
Marcelis, L., Heuvelink, E. & Goudriaan, J. Modelling biomass production and yield of horticultural crops: A review. Sci. Hortic. 74(1–2), 83–111 (1998).
Article Google Scholar
Garg, D., & Alam, M. Deep learning and IoT for agricultural applications. in Internet of Things (IoT) 273–284. (Springer, 2020).
Saravi, B., Nejadhashemi, A. P., Jha, P. & Tang, B. Reducing deep learning network structure through variable reduction methods in crop modeling. Artif. Intell. Agric. 5, 196–207 (2021).
Google Scholar
Mhlanga, D. Artificial intelligence in the industry 4.0, and its impact on poverty, innovation, infrastructure development, and the sustainable development goals: Lessons from emerging economies?. Sustainability 13(11), 5788 (2021).
Article Google Scholar

Download references

Acknowledgements

This research was funded by National Key Research and Development Program of China, Project number: 2019YFE0125100, Project name: Development and demonstration of smart and high-efficient technologies for fruits and vegetables production in greenhouse and plant factory. The work is also supported by UKRI Innovate UK funding (grant references: 107459 and 51565).

Author information

Authors and Affiliations

School of Animal, Rural and Environmental Sciences, Nottingham Trent University, Brackenhurst Campus, Nottingham, NG25 0QF, UK
Gadelhag Mohmed, Xanthea Heynes, Weituo Sun, Katherine Hardy, Steven Grundy & Chungui Lu
Department of Computer Science, Nottingham Trent University, Clifton Campus, Nottingham, NG11 8NS, UK
Gadelhag Mohmed & Abdallah Naser
Intelligent Equipment Research Centre, Beijing Academy of Agriculture and Forestry Sciences, Beijing, 100097.3, China
Weituo Sun

Authors

Gadelhag Mohmed
View author publications
You can also search for this author in PubMed Google Scholar
Xanthea Heynes
View author publications
You can also search for this author in PubMed Google Scholar
Abdallah Naser
View author publications
You can also search for this author in PubMed Google Scholar
Weituo Sun
View author publications
You can also search for this author in PubMed Google Scholar
Katherine Hardy
View author publications
You can also search for this author in PubMed Google Scholar
Steven Grundy
View author publications
You can also search for this author in PubMed Google Scholar
Chungui Lu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors contributed equally to this work.

Corresponding authors

Correspondence to Gadelhag Mohmed or Chungui Lu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mohmed, G., Heynes, X., Naser, A. et al. Modelling daily plant growth response to environmental conditions in Chinese solar greenhouse using Bayesian neural network. Sci Rep 13, 4379 (2023). https://doi.org/10.1038/s41598-023-30846-y

Download citation

Received: 02 October 2022
Accepted: 02 March 2023
Published: 16 March 2023
DOI: https://doi.org/10.1038/s41598-023-30846-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.