Introduction

Photosynthesis is one of the most important biological processes involved in plant growth, and the rate of photosynthesis rate can be affected by a list of factors, such as temperature, CO2 concentration and light intensity1,2. In particular, many nonlinear models have been developed for describing the rate of photosynthesis in response to the change of irradiance3,4,5. There are three types of photosynthetic light response models currently available in literature: exponential, rectangular hyperbola and nonrectangular hyperbola.

Exponential Model6:

where is the net photosynthetic rate, I the irradiance, the initial quantum efficiency, Amax the net light saturated photosynthetic rate, and the dark respiration rate.

Rectangular Hyperbolic Model7,8:

Nonrectangular Hyperbola Model9:

where is a curvature parameter.

Modified Rectangular Hyperbola Model10,11:

where β and γ are constants.

Selecting appropriate initial values is crucial for fitting these nonlinear models and estimating their parameters. For biologically meaningful parameters, using the estimates directly from experiments would be a first try for assigning the initial values. However, for model specific parameters that cannot be measured in experiments, choosing appropriate initial values when fitting the model could become problematic. For example, two of the parameters in Eq. (4) are constants without direct biological meanings. Even for those parameters with direct biological meanings, assigning their initial values in the model fitting could still be time consuming. For instance, all the three parameters in the exponential model (Eq. 1) have sound biological meanings, but only Amax can be directly measured; the other two parameters, a and Rd, require additional procedures to derive.

There are many statistical software packages currently available for fitting nonlinear models; to list a few, SAS, R, GenStat, MatLab, Sigmaplot, and SPSS. The built-in functionality to fit nonlinear models in these software packages normally implement algorithms of local optimization for regression and parameter estimation, such as the Gauss-Newton, Levenberg-Marquardt and Nelder-Mead algorithms. As local optimization methods are highly sensitive to the setting of initial values, failed convergence often occurs due to wrongly assigned initial values. However, there are no reliable methods available for selecting initial values5.

To develop effective methods for resolving the problem of assigning appropriate initial values when using these nonlinear models, we need to go beyond local optimization. Genetic algorithms and simulated annealing algorithms have been widely applied to search global optimal solutions in multi-dimensional space12,13,14,15,16,17. In particular, recent comparisons have suggested that the Differential Evolution (DE) algorithm performs better than other global optimization methods such as the genetic algorithm and simulated annealing18,19. There are four main steps for implementing the DE algorithms: initialization, mutation, crossover and selection, which is also capable of dealing with non-differentiable problems18. Compared to local optimization methods, global optimization algorithms are less sensitive to initial values20,21. However, the application of global optimization algorithms to parameter estimation in photosynthetic light-response models is lacking.

The objectives of this study are to: (1) test the performance of the Levenberg-Marquardt (LM) algorithm, a local optimization method, and three global optimization algorithms: Genetic Algorithm (GA), Generalized Simulated Annealing (GSA) and the Differential Evolution (DE) algorithm; (2) develop a reliable method for fitting photosynthetic light-response models using the global optimization algorithm and simultaneously solve the issue of hyper initial-value sensitivity in local optimization algorithms.

Results and Discussion

Model comparison

According to calculated values of AIC (Tables 1 and 2), the modified rectangular and the nonrectangular hyperbolic models performed better than the exponential and rectangular hyperbolic models. The nonrectangular model is more flexible for its additional curvature parameter (; when θ = 0, it becomes the rectangular model5,22. For example, the estimates of for the photosynthetic datasets of Brassica rapa var. chinensis (L.) Kitam., Camellia sinensis (L.) Kuntze and Impatiens balsamina L. were all close to zero, the other three parameters of the nonrectangular model were also equal to those estimated from the rectangular model (Supplementary Tables S1 and S2).

Table 1 Akaike’s information criterion (AIC) and R2 values of each model using the Differential Evolution algorithm.
Table 2 Akaike’s information criterion (AIC) and R2 values of each model using the Differential Evolution algorithm.

Comparison of local and global optimization algorithms

Not surprisingly, failed convergence often occurred from wrongly assigned initial values when only using the LM algorithm, due to its hyper sensitivity to initial values (Table 3). In particular, the nonrectangular hyperbolic model failed to converge for 1260 times due to inappropriate initial values in the simulation (Table 3, Supplementary Table S3). There were also many compromised fits with R2 < 0.90 even when the LM algorithm converged successfully (Fig. 1). By contrast, the DE algorithm was less sensitive to initial values, which provided good fits with R2 approximately 0.98 for all the 42 datasets of 21 species (Tables 1 and 2) when using the same initial values for each photosynthetic model, as listed in Table 4. The DE also performed the best among the three global optimization methods, followed by the GSA (Fig. 2). In comparison, parameter estimates from the GA were not reliable with even negative R2(Fig. 3). The speed of convergence for the GA algorithm was also slower than that of the DE and GSA algorithms. Although the best optimum solutions of the 100 simulation tests from the LM were similar to those from the DE (Fig. 4), however, it was not recommended to only use the LM as the large amount of failures occurred due to wrongly sampled initial values (Table 3). After setting the initial values to the parameter estimates from the DE algorithm, the value of R2 estimated from the LM algorithm only slightly increased (<0.05) for the rectangular and modified rectangular models; however, the values of R2 almost remained the same for the exponential and nonrectangular models (Fig. 5). Although local optimization methods are sensitive to the input of initial values, local optimization algorithms are generally faster and more accurate than global optimization method in a local space20,23,24. In addition, there was no failures of convergence when the parameters of rectangular and modified rectangular models obtained from the DE algorithm were set as the initial values for running the LM algorithm. Therefore, we recommend to combine the DE and LM algorithms for estimating parameters of the rectangular and modified rectangular models if higher accuracy is needed, instead of increasing the number of iterations when running the DE algorithm.

Table 3 Performance of the Levenberg-Marquardt in the simulation test.
Figure 1
figure 1

Distribution of R2 when using the Levenberg–Marquardt algorithm with successful convergence of the modified rectangular (A), nonrectangular (B), rectangular (C) and exponential (D) models.

Table 4 Lower and upper bounds of parameters for each model optimized by the Genetic Algorithm, Generalized Simulated Annealing and the Differential Evolution algorithms.
Figure 2
figure 2

Difference of R2 when using the Differential Evolution and Generalized Simulated Annealing algorithm for the modified rectangular (A), nonrectangular (B), rectangular (C) and exponential (D) models.

Figure 3
figure 3

Values of R2 when using the Genetic Algorithm for the modified rectangular (A), nonrectangular (B), rectangular (C) and exponential (D) models.

Figure 4
figure 4

Difference of R2 when using the Differential Evolution and the best optimum from the Levenberg–Marquardt algorithm for the modified rectangular (A), nonrectangular (B), rectangular (C) and exponential (D) models.

Figure 5
figure 5

Difference of R2 when using the Differential Evolution, combined with the Levenberg–Marquardt algorithm for the modified rectangular (A), nonrectangular (B), rectangular (C) and exponential (D) models.

In conclusion, the application of the DE algorithm can not only effectively resolve the issue of hyper sensitivity of initial values in local optimization but also provide good fits for the photosynthetic models. The lower and upper bounds of initial values (Table 4) for each photosynthetic light-response model can be regarded as a standard setting when using the DE algorithm. In addition, the estimated parameters (Supplementary Tables S1 and S2) of the mainstream photosynthetic models for both herbaceous and woody plant species can be used as a reference for selecting initial values when using the traditional local optimization method.

Methods

A total of 42 datasets from 21 species (Supplementary Table S4), including both herbaceous and woody plants, were used for evaluating the performance of three global optimization algorithms: Genetic Algorithm (GA), Generalized Simulated Annealing (GSA) and the Differential Evolution (DE) algorithms, when fitting the photosynthetic light response models. For comparison, the Levenberg-Marquardt (LM) algorithm, a derivative based local optimization method, was also applied to estimate parameters of the photosynthetic models. The raw data of species 2, 5–9, 14–17, 20 and 22 (see Supplementary Table S4) were extracted from figures in publications25,26,27,28,29,30,31,32,33,34,35,36 using the GetData Graph Digitizer 2.26 (http://getdata-graph-digitizer.com). After 500 iterations, the estimates from the three optimization methods were used to evaluate their performance. To avoid biasing the results, we used the same lower and upper bounds for parameter estimation when using the three global optimization (Table 4). For each dataset, 100 random values were drew from a uniform distribution with the same lower and upper bounds for the global optimization algorithms as initial values for testing the LM algorithm. A total of 4200 simulations were repeated for each photosynthetic model. Parameters of all the photosynthetic light-response models were estimated by the LM, GA, GSA and DE algorithms, implemented in the minpack.lm, GA, GenSA, and DEoptim packages of R37. The coefficient of determination and Akaike’s information criterion (AIC)38 were used as the goodness-of-fit index and for model comparison, respectively.

Additional Information

How to cite this article: Chen, L. et al. A general method for parameter estimation in light-response models. Sci. Rep. 6, 27905; doi: 10.1038/srep27905 (2016).