Introduction

The species-area relationship (SAR) that links species richness with area is a fundamental concept as well as a useful tool in ecology and biodiversity conservation1. Since the SAR is affected by multiple factors at various scales, several functions, such as power law equation and random-placement model2,3,4, have been employed for this general convex-upward curve. However, much attention has been paid to statistical or mathematical formulation of the SAR, rather than its biological meaning and fundamental ecological theory3.

In addition, there has been a heated debate over the SAR and its derived concepts, such as the endemics-area relationship (EAR) and the extinction rate calculation5. He and Hubbell4 proposed that the sampling area difference between the SAR and the EAR was key for the overestimation of extinction rates and the backward SAR method (total species number in the total area A minus the species number remained in the area A − a after area a is cleared) was incorrect. For a long time, however, the effect of total area has been neglected, which is seldom shown in the SAR such as the power law format, even if it is an important parameter in the EAR calculation.

It requires the ecologists to clarify the relationship between the SAR and the EAR in the first place to settle the debate over the accurate calculation of extinction rate using the SAR and the EAR. In addition, as an important parameter in the SAR and EAR equations, area (including total area) should be carefully defined and measured. Here, I introduce the concept of the overlap-area relationship (OAR), the number of species that exist in both areas, to link the SAR and the EAR1. Two fundamental equations are derived to describe the general interrelations among the SAR, the EAR and the OAR.

Result

SAR, EAR and OAR

The general SAR equation, where the species number, Sa, is a function of area a, Sa = f(a), is expected to take the form of a convex-upward decelerating curve1,6. Similarly, SA−a = f(Aa) and SA = f(A), where A is the total area. The general EAR equation, Ea = g(a), where Ea is the number of species that disappear if area a is destroyed and g(a) is the function relating Ea and a. Similarly, EA−a = g(Aa) and EA = g(A). Obviously, SA = EA, where the total species number in area A equals the total number of species that will be removed if the whole area is destroyed. The OAR describes the number of species that exist in both areas a and A − a, thus, Oa = OA−a. In addition, OAR is also an important ecological concept to describe the species similarities of two (or many) habitats.

When area a is destroyed (species number for the SAR in the area a is Sa), some species are lost from the whole area A (species number for the EAR in the area a is Ea). However, some of the species Sa still exist in the area A − a (species number for the OAR is Oa). Thus, the general OAR equation is Oa = SaEa (or Oa = Sa + SAaSA), meaning that the number of species that are common to both areas a and A − a equals the species number in area a minus the endemics number in area a (or minus the difference of species number between area A and area A − a). From Oa = SaEa = Sa + SAaSA, we have Ea + SAa = SA, or Sa + EAa = SA, meaning that the sum of the species number in area a and the number of species that disappear when area A − a is destroyed is the total species number in area A. The interrelations among the SAR, the EAR and the OAR are shown in Fig. 1. In other words, SA is the union of sets Sa and SA−a, Oa is the intersection of sets Sa and SA−a and Ea is the difference set of Sa and SA−a.

Figure 1
figure 1

Schematic diagram of species distribution in areas a and A − a.

Sa, Ea and Oa refer to the functions for species-area relationship (SAR), endemics-area relationship (EAR) and overlap-area relationship (OAR), respectively.

For a + (Aa) = A and Sa + EAa = SA, the SAR and the EAR are rotationally symmetric in the interval [0, A] and the center point is (A/2, SA/2) (Fig. 2, Oa0, when A > a > 0). If Oa0, Oa = Sa − Ea, the SAR curve and the EAR curve will coincide. The discussion of mirror images (axial symmetry) between the SAR and the EAR in He and Hubbel's paper4 is misleading, because what has been discussed is rather “rotational symmetry (point symmetry)”.

Figure 2
figure 2

Rotational symmetry of species-area relationship (SAR) and endemics-area relationship (EAR).

The center point for SAR and EAR is (A/2, SA/2). Total species SA = EA.

For the widely used convex-upward decelerating curve, the SAR curve should be enveloped by the straight lines and Sa = SA in the interval [0, A] with Y axis (Fig. 2). And the EAR curve should be enclosed by the straight lines of and Ea = 0 in the interval [0, A] with X axis. OAR is introduced here to link the SAR and the EAR, which is the D-value of SAR and EAR changing with area and the distance between the curves of SAR and EAR in the space. For the power law equation, Oa = caz + c(A − a)z − cAz, the value of the convex curve in the interval [0, A] peaks at a = A/2.

Discussion

In Fig. 3, the area that is required to find n species is smaller than the area that is required to remove n species, although the species that are found or removed are not necessarily the same. Thus, the SAR curve will be higher than or equal to the EAR curve, or SaEa (Oa = SaEa ≥ 0), based on the comparison of their respective areas for the same species number. In addition, for the same species, although sampling difference between the SAR and the EAR areas exist in most cases, except that there is only one individual of the species, the extinction rate can still be calculated accurately based on either the backward SAR or EAR equations in the case of the random placement model, which will produce the same results4.

Figure 3
figure 3

Sampling difference for species-area relationship (SAR) and endemics-area relationship (EAR).

For the number of n species, the sampling area for the SAR is smaller than the area for the EAR.

Apart from spatial heterogeneity, several other factors, such as incomplete sampling, small data sets (few data points) and inappropriate model (fit of a power law model to the SAR), may lead to overestimation or underestimation of extinction rates. Since the calculation of Sa doesn't use SA and the total species number SA can be obtained from the SAR equation, one more parameter (total area A) needs to be considered in the EAR equation Ea = SASAa. For instance, in the power law equation for SAR, Sa = caz, the SAR has two parameters, c and z. The corresponding power law equation for the EAR, Ea = cAzc(Aa)z, has three parameters, c and z, and A. Thus an incorrect total area A can lead to inaccurate estimation of extinction rates. In Fig. 4 (Oa0, when A > a > 0), if total area is not identified accurately, when A′ < A (A the real total area, A′ is an incorrect total area), the calculation of extinction rate based on the backward SAR or EAR equation will be overestimated, no matter equation for the SAR or EAR is used. However, if the incorrect total area is larger than the real total area, the extinction rate based on the SAR or EAR is underestimated.

Figure 4
figure 4

Species-area relationship (SAR) and two different endemics-area relationships (EAR) due to different total area A′ and A. A′ is an underestimated total area and A is the actual total area.

The relationships among the SAR, the EAR and the OAR have been explored. Thus, two fundamental equations, namely, Sa + EAa = SA and Oa = SaEa, are derived for the species-area theory, which can numerically describe the species numbers in the areas a and A − a. In addition, the SAR and EAR curves are rotationally symmetric in the space of Area vs. Number of Species. Because the EAR equation includes one more parameter than the SAR, i.e., total area A and its corresponding total species number SA, incorrect real total area will overestimate or underestimate the extinction rate.