Introduction

Moist Evergreen Afromontane Forest is one of the vegetation types in Ethiopia and it is located in the western highland, the patch of Wondo genet, and the southern slope of Bale Mountain1. The Moist Evergreen Afromontane Forest, as cited by Gole et al.2, serves as a habitat for wild coffee. Moreover, it plays a pivotal role in the conservation of highland forest birds and plant diversity. It provides various benefits for the local people like coffee production, honey, spices, fuel wood, construction wood, farm tools, and grazing land. Additionally, it has a role in climate change mitigation by sequestering and storing carbon. Despite this, it faces threats of deforestation and degradation3 from agricultural expansion, illegal cutting, and overgrazing.

In Ethiopia, there is a high rate of Forest degradation and deforestation. from 2000 to 2013 the country lost a mean of 92,000 ha forest annually4, and the majority of this share was taken by Moist Evergreen Forest next to Combretum-Terminalia woodlands. To address the problem, Ethiopia engaged in international initiatives like REDD + (reduction of deforestation and degradation; conservation and sustainable forest management; and enhancing forest carbon). Hence, measuring and determining the biomass and forest carbon stock is the center of attention. Besides, understanding the carbon balance of forest ecosystems is vital to minimize the impact of climate change, and exploring mitigation options5. The biomass of the individual tree is determined in two ways. The first method involves cutting down the tree, weighing it all, and converting it into dry biomass. The second method involves converting measured tree variables (Diameter, Height, and Crown) into biomass using an allometric equation. The second approach is feasible and environmentally sound.

An allometric equation is a method that establishes a relation between some easily measured tree dimensions (height, diameter, and crown diameter) and the dependent variable, total above-ground biomass. It gives an insight into the carbon sequestration potential of woody vegetation. As well, it also helps to calculate the costs and benefits associated with forest carbon projects and improves bio-energy systems and sustainable forest management6. It can be a generic, site-specific, or species-specific allometric equation. Some scholars7,8,9,10 are in favor of using species-specific allometric equations. Additionally, studies indicate that the application of a generic equation leads to uncertainty in the estimation of biomass11,12,13,14,15. Species-specific allometric equations can reflect biomass variance caused by differences in tree nature (number of stems, height to branches), age, diameter, stand density, cultivars, site environment (climate and soil), and management approach.

In Ethiopia there is growing interest in developing allometric equations, for example, some researchers develop site-specific allometric equations11,12,13,14,15 and species-specific12,16. In Moist Evergreen Afromontane Forest, only a limited number of tree species developed species-specific allometrics. For example, Olea europaea by Kebede and Soromessa17; Diospyros abyssinica by Daba and Soromessa18; Albizia grandibracteata and Trichilia dregeana by Daba and Soromessa19. Even though the country has made significant efforts to develop allometric equations at site and species level. According to Sebrala et al.20, Ethiopia lacks sufficient national models for assessing forest biomass and carbon stocks and monitoring changes using Tier-2 and Tier-3 methodologies recommended by the IPCC. Additionally, there are issues with the developed model's inclusion of wood basic density as a predictor variable in species-specific allometric equations, taxonomic naming issues, and the notion that models are best fitted when they have negative parameter estimators. So, the present study aims to develop allometric equations for estimating above-ground biomass for five selected tree species in the Moist Evergreen Afromontane Forest of South-Central Ethiopia. The specific objectives were to develop species-specific allometric equations for the estimation of above-ground biomass for five native tree species; to develop a biomass model for the tree compartment of this selected tree; to evaluate and compare the newly developed allometric equations with the existing site-specific and generic allometric equations.

Materials and methods

Study site location

The study site (or area) was located in the Wondo Genet natural forest on the southeast part of Ethiopia (Fig. 1), [N 7° 5.4′–N 7° 7.2′, and E 38° 38.4′–E 38° 40.4′]. The altitude ranges from 1850 to 2400 m.a.s.l. It is categorized in the remnant Moist Evergreen Afromontane Forest21 located in the protected and inaccessible mountain chains of Abaro. The area has a mean annual rainfall and temperature of 1200 mm and 22 °C respectively22 with bimodal rainfall distribution with longer precipitation from June to October and lower from March to April23. The topography of the study area has 43.5% mountains and hills, 36.25% flat areas, and 20.25% undulating parts of the district24. The soils are young and of volcanic origin, characterized by well-drained loam or sandy loam. The soil pH of the study area is between 5.6 and 6.525.

Figure 1
figure 1

Map of the study area.

Data collection

Sampling design and sample tree selection

To estimate the number of native trees to be included in the development of the allometric equation and to observe how tree species are distributed over the diameter range, tree inventory data collected by Asrat et al.26 were used. Based on this inventory result, the dominance of tree species was calculated based on the following formula.

$${\text{Basel}}\;{\text{area}} = \left( {\pi d^{2} } \right)/4$$
(1)

where “d’” is the diameter of the tree. Accordingly, five tree species (i.e.: Albizia gummifera, Bersama abyssinica, Croton macrostachyus, Vepris dainellii, and Syzygium guineense) were selected for the development of the allometric equations. A total of 59 trees were harvested with a minimum diameter of 5 cm and a maximum diameter of 106.5 cm for the development of the allometric equation and covers 32.0% of the basal area. . The diameter classes were formulated with a 10 cm diameter interval for each tree species. Representative sample trees were distributed in diameter class based on the basal area proportion and sample trees were selected systematically within the diameter class. Trees having unusual forms such as broken crowns and stem knots were removed from the selection in model development unless they represent a significant portion of the forest, and trees grown in the unrepresentative site such as forest edge were not included. Hence, trees that are free from broken branches and defects were selected for harvesting27. Before the tree felled, each sample tree from each species was identified and located with a GPS coordinate point and marked by the researcher and one local guide. Number and statistical summary of sampled trees; and distribution of the harvested tree with diameter class for each tree species presented in (Table 1).

Table 1 Summary (Range, Mean, and Standard error) of biometric attributes of the harvested sample trees.

Biomass determination

The destructive method was employed for the determination of the biomass of individual trees. After the tree diameter at stump height (0.3 m), diameter at breast height (1.3 m) (if buttress occurred tree diameter measured above buttress 0.3), and crown diameter was recorded. The tree was cut down closest to the ground and the total tree height using a tape meter was measured. The felled tree is sorted into three main sections: stem (stump plus to top > 10 cm diameter), branches (tree parts apart from the main stem and diameter > 2 cm), and foliage (leaves, twigs, small branches diameter < 2 cm, and fruit part). The section of all felled trees weighed independently in the field using a hanging balance (200 kg capacity). The weight of the stump was determined by using the volume of the stump and the wood's basic density.

For the determination of foliage dry to fresh ratio, the foliage 200-250 g sample was taken from each tree. Additionally, four disks constitute three from the stem part, and one from the branch to determine the dry-to-fresh weight ratio of stem and branch. The fresh weight of the sample was measured immediately in the field to avoid moisture loss. Then after labeling, the sample was transported to the WGCF-NR laboratory for oven-drying at 72 C for foliage, and 103 C for the wood part until it reached a constant weight27. The dry weight of the sample was determined by digital balance (± 0.1 g). Finally, the dry weight of each section was determined by taking the dry-to-fresh weight biomass ratio.

Data analysis

The data analysis was undertaken in R software version 4.0128 by using 'nlstools' package29. Prior to conducting the analysis, an investigation was performed by plotting a scatter plot (Fig. 2) to examine the relationship between dependent variable (above ground biomass) and independent variable (diameter at breast height). Accordingly, nonlinear relationship between the independent and dependent variables were observed. As a result, nonlinear regression methods were established on power model. Consequently, power models used in several studies14,15,26 were tested in the present study. Based on this we formulated six different model forms for testing the Species-specific biomass models, by using dbh and dsh as sole predictors and combined with a stepwise inclusion of ht and crw.

Figure 2
figure 2

Scatter plot dbh (cm) against Total above-ground biomass (TAGB) (kg).

Additionally, weighted regression was employed to reduce the heteroscedasticity in nonlinear regression30. It is the method of data transformation used in our data set to remove the error variance. Based on the procedure adopted by Picard et al.27, the weighting Factor (“c”) will be developed for each tree species. Finally, the weight will be = \(\frac{1}{{({\text{dbh}})}^{{\text{c}}}}\) ; where dbh, is the diameter of the tree, and “c” is the weight factor.

$${\text{AGB }} = a{* }\left( {dbh} \right)^{b} \quad \left( {{\text{Model 1}},{\text{ M1}}} \right)$$
$${\text{AGB }} = a{* }\left( {dbh} \right)^{b} {*}\left( {ht} \right)^{c} \quad \left( {{\text{Model 2}},{\text{ M2}}} \right)$$
$${\text{AGB }} = a{* }\left( {dsh} \right)^{b} \quad \left( {{\text{Model 3}},{\text{ M3}}} \right)$$
$${\text{AGB }} = a{* }\left( {dsh} \right)^{b} {*}\left( {ht} \right)^{c} \quad \left( {{\text{Model 4}},{\text{ M4}}} \right)$$
$${\text{AGB}} = {\text{a}}* \left( {{\text{dbh}}} \right)^{{\text{b}}} *\left( {{\text{cd}}} \right)^{{\text{c}}} \quad \left( {{\text{Model}}\;5,{\text{M}}5} \right)$$
$${\text{AGB }} = a{* }\left( {dbh} \right)^{b} {*}\left( {ht} \right)^{c} {* }\left( {cw} \right)^{d} \quad \left( {{\text{Model 6}},{\text{ M6}}} \right)$$

where: AGB is the biomass of the tree in (kg), dbh is the tree diameter at breast height (cm), ht is tree height in (m), dsh is the diameter at stump height (cm), cd is the crown diameter in (m) and a, b, c, d, are model parameters.

Model evaluation and comparison

For evaluating the models' performance, cross-validation specifically leave one out cross-validation (LOOCV, where a model fitted to the ‘n-1’ dataset and then performance is assessed on the single observation left out and repeats the procedure n times until all observations covered by the process) was used. It is an efficient method of model validation, where every data set is used for training and test data31. This kind of model validation is important when a small data set exists36. Furthermore, it has no randomness since each observation is used as a training and validation. Then the developed allometric models were evaluated through goodness-of-fit measures such as mean prediction error (MPE), root mean square error (RMSE), Akiaka information criterion (AIC), True Relative Error (TRE), and R-square. Thus, models that recorded the lowest value of MPE, RMSE, AIC, and the higher values − R2 were selected. Paired t-tests were used to see the difference between observed and predicted values. Pearson’s correlation test was also used for testing the correlation between AGB and independent variables (dbh, dsh, crown diameter, and height).

$$\mathrm{RMSE }=\sqrt{\frac{1}{n}(\sum_{i=1}^{n}(yi-}{\widehat{ yi})}^{2}$$
(2)
$${\text{RMSE}}\% = \frac{RMSE}{{\overline{y}}}* 100$$
(3)
$${\text{MPE}} = \frac{1}{n}{ }\mathop \sum \limits_{i = 1}^{n} (yi - \widehat{{{ }yi}})$$
(4)
$$\mathrm{MPE\% }=\frac{{\text{MPE}}}{\overline{{\text{y}}} }* 100$$
(5)
$$\mathrm{TRE\% }=\frac{\sum \widehat{ {\varvec{y}}}i-yi}{\sum yi}*100$$
(6)
$$R^{2} = 1 - \frac{{\mathop \sum \nolimits_{i = 1}^{n} (y_{i} - \overbrace {{y}_{i}} )^{2} }}{{\mathop \sum \nolimits_{i = 1}^{n} (y_{i} - \overline{y})^{2} }}$$
(7)

where: MPE is the mean prediction error, RMSE is a root mean square error, \({\varvec{y}}i\) is the observed value of the ith sample tree, \(\widehat{{\varvec{y}}}i\) is the predicted value of the ith sample tree, \(\overline{\mathbf{y} }\) is the mean observed value and n is the number of observations.

To compare model performance, the best-ranked model in the present study was used to compare it with previously developed pantropic32,33,34 and site-specific models26. The models used in the comparison are presented in Table 2.

Table 2 Selected previously published model for comparison.

Ethical approval and consent to participate

The collection of plant material and the performance of experimental research on such plants complied with the national guidelines of Ethiopia.

Results

Correlation between tree variables and different biomass components of the tree

Spearman correlation between the independent and dependent tree variables is presented (Table 3). For all tree species tree dependent variable, has a strong relationship with dbh. For the C. macrostachyus tree, height has a weak relationship with the tree-dependent variable. Regarding crown diameter, B. abyssinica doesn’t strongly correlate with all tree’s dependent variables except merchantable stem biomass. One of the factors for the variation of the correlation between tree species and different biomass compartments nature of species and growth conditions. Based on the nature of the tree species and growing conditions the relation between the tree variable and the biomass component of the tree will be affected35.

Table 3 Correlation Matrix between dependent and independent tree variables.

Species-specific allometric equation for the selected tree species

The best-performed allometric equation with a measure of goodness-of-fit for the five tree species for all compartments is presented (Table 4) and all tested models are presented (Appendix A and B) The selected model holds the highest R-square, lowest RMSE, MPE, and AIC. The model that gives negative and insignificant parameters is not considered a valid model. Accordingly, for the total above-ground biomass (TAGB) model (M1), with dbh sole predictor gives significant parameter estimates for all tree species. And explained 99.3% of biomass variation for S. guineense whereas the lowest by B. abyssinica 89.3%. Whereas M3 with dsh as a single predictor variable explained 97.8% of the variation in the case of A. gummifera followed by C. macrostachyus (97%), and V. dainellii (72%). The addition of other tree variables in the model didn’t improve the model performance and resulted in a negative regression coefficient in some cases.

Table 4 Selected species-specific biomass model for all tree compartments and model performance evaluation.

Based on the analyzed result, M3overestimated the foliage biomass by 8.6 kg for A. gummifera, while M1 underestimated by 13.7 kg for S. guineense. For B. abyssinica including height with dbh results in better estimation by explaining 98.6% of the variation in Merchantable stem biomass. Correspondingly, M4 with dsh and height results in a better estimation of Merchantable stem biomass, by overestimating the biomass with 31.3 kg for S. guineense. In C. macrostachyus, all tested models for the tree like Foliage, Branch, and Merchantable stem resulted in insignificant regression coefficients.

The observed and predicted AGB for the selected model were plotted in (Fig. 3). The result showed that there is no significant difference between the observed AGB and predicted AGB for the best-selected model. Based on the P-value there is no proof to reject the null hypothesis (intercept = 0 and slope = 1). However, the discrepancy between observed and predicted varies between tree species.

Figure 3
figure 3

The relationship between the observed and predicted total ABG of the five tree species. The red line represents the line that best fits the residuals, while the black line represents the 1:1 line.

Biomass model comparison with the previous study

The comparisons were made by applying the previously published generic allometric model, site-specific model, and selected model M1 of the current study on our data, and the result is shown in (Table 5). The model by Brown36 overestimates the AGB For S. guineense (77.8%), C. macrostachyus (66.4%), B. abyssinica (43.2%), and A. gummifera (9.6%). On the other hand, a model developed by Chave et al.33,34 underestimate the AGB for A. gummifera (17.8% and 17.4%), V. dainellii (50.3% and 47.4%), and B. abyssinica (21.2% and 16.4%), respectively, whereas overestimate the AGB for S. guineense (15.8% and 17.1%) and overestimate for C. macrostachyus (9.7% and 12.4%), respectively. The site-specific model developed by Asrat et al.26 overestimates the biomass for C. macrostachyus (37.4%) and S. guineense (71.9%). In all tree species, the currently developed model M1 has the least prediction error with the highest for V. dainellii (7.3%) whereas the lowest is B. abyssinica (0.1%). The comparison based on the value of Total relative Error (TRE) and Mean prediction Error (MPE) indicates that for all tree species, the developed model in the current study outperformed the previously developed model.

Table 5 Comparison of the selected general models and previously published both generic and for each species.

Discussion

Species-specific allometric equation for the selected tree species

A considerable proportion of variations in the total above-ground biomass (TAGB) are explained by the dbh as a sole predictor variable in each tree species case. The highest explaining potential of dbh is present in S. guineense (99.3%), and the lowest in B. abyssinica (89.3%), This finding is similar to those reported in several studies12,17,37,38.

The addition of tree height in the model results in a negative regression coefficient and doesn’t improve model performance. This finding is inconsistent with some studies39,40,41. Tree allometry is affected by differences in tree nature (number of stems, height to branches), age, diameter, stand density, cultivars, site condition (climate and soil), and management practice42. For example, a tree that grows in an open forest will have a shorter height than a tree that grows in a closed forest for the given diameter. This also affects the relationship the biomass and tree height43,44. Due to this diameter is the most important tree variable in the estimation of biomass.

Regarding crown diameter, the inclusion of this predictor variable in the model did not improve the biomass estimation. This finding is inconsistent with some studies14,15,26. The forest ecosystem of Wondo Genet was exposed to disturbance from fuelwood collectors, illegal logging, and man-made fire21,23. This reduces competition for the upper canopy and in this type of forest, trees invest more in diameter than height45 and tree allometry will be changed. In this case, dbh will be an important predictor for the estimation of above-ground biomass. Besides, dsh is another important tree variable explaining the variation existing in biomass and performed better than dbh for tree species like C. macrostachyus, but because of the measurement difficulty in natural forests, models that include dsh are not recommended as the best option for further application.

Species-specific biomass model comparisons with previous study

The best-ranked selected model was compared with one site-specific developed generic allometric equation and three pantropic allometric equations. Additionally, different statistics are used as performance indicators to evaluate the performance of each model. A pantropic model developed by Brown36, overestimates the biomass by 9.6–77.8%. As well, Chave et al.33 gave a prediction error range of 12.4–47.4%; and Chave et al.34 resulted in an error range from 15.8—50.3%. This finding is in line with some reports12,15. However, for some tree species, the pantropical model performs well; for example, Chave et al.34 and Brown36 did not show a significant bias for tree species C. macrostachyus and V. dainellii respectively. This suggests that the bias of the pantropic generic allometric equation varies between tree species16. The tested site-specific allometric equation overestimated the biomass for tree species such as S. guineense (71.9%) and C. macrostachyus (37.4%) and did not exhibit significant bias towards the other tree species. The performance of the site-specific and pantropic generic model in the forest biomass estimation leads to some uncertainties46. Species-specific allometric equation plays an important role in reducing the uncertainty associated with the estimation of biomass. Whenever there is a lack of species-specific allometric equation site-specific model is more important47 for the estimation of above-ground biomass than a pantropical model. Tree allometry is affected by the environmental conditions and nature of the tree, while the pantropic model data was collected from outside the Ethiopia; Brown36, collected from Central and South America and Southeast Asia; Chave et al.34 collected Tropical America and Asia; but Chave et al.33 incorporated some tree species as part of Africa. Due to the above-mentioned factor, the application of the pantropical model leads to uncertainty, and using a species-specific allometric model reduces the uncertainty in biomass estimation.

Conclusions

Interest in the estimation of the biomass and carbon sequestration potential of the forest increased because of the result-based incentive in forest management and conservation. In this regard, allometric equations give an insight into the potential of an intervention, and how much biomass and carbon are stored in the forest. Species-specific allometric equations for the estimation of above-ground biomass were developed for five tree species. The developed models have a great role in improving the accuracy of biomass estimation. Models that used only dbh will have importance in reducing cost in the measurement. The best-ranked allometric equation compared with the allometric equation developed as a site-specific and pantropic generic equation; however, the developed species-specific allometric equation showed better accuracy in the estimation of aboveground tree biomass. The developed biomass model was applied to a Moist evergreen Afromontane Forest, considering the diameter range for each species. However, to improve the model further, it is necessary to include sample trees from various locations within the forest ecosystem.