Introduction

Local biodiversity faces a serious threat from the introduction of alien invasive species, which can result in extinctions or population declines1,2,3,4,5. The trade of invasive species facilitates the establishment of non-native species in new areas6. In many cases, the intentional introduction of non-native species represents an important benefit to economies7. However, the trade of non-native species, e.g., Bombus terrestris, produces conflicts between the associated benefits and problems1,3,8,9,10.

In approximately 1900, a European bumblebee, Bombus terrestris (Hymenoptera: Apidae), was first introduced in New Zealand for pollination services. Currently, it is an invasive species and has spread well beyond its native range10,11,12,13. Since the 1980s, with the development of greenhouse agriculture, millions of colonies of bumblebees (B. terrestris) have been produced per year as managed pollinators to meet the demands of pollination services worldwide. Trade in bumblebee colonies has provoked concerns over the invasion of B. terrestris in regards to competitive displacement and the potential for horizontal disease transmission to local bumblebees5,14,15,16,17,18,19,20,21 via accidental escape of queens from greenhouses that establish colonies in areas where this bumblebee is non-native10. Recently, North America, Japan, Chile, and Argentina have faced challenges due to the establishment of B. terrestris and population declines of local bumblebee species3,16,22.

Among countries worldwide, China has the richest bumblebee diversity; 125 species are found in China, representing 50% of the total number of bumblebee species worldwide. Bumblebee pollinators play significant roles in agricultural and natural ecosystems23,24. Although B. terrestris is now part of the Chinese bumblebee biodiversity, it originated in Europe and has been expanding its range from Europe to Central Asia, including into northwest Xinjiang, the westernmost region of China bordering Kazakhstan25,26. Four decades ago, bumblebees collected in Xinjiang were considered B. lucorum, but recent DNA barcoding has confirmed that the B. lucorum complex in this region now includes B. terrestris27,28. The continued expansion of this species towards eastern China has been restricted by the natural barriers of the Takla Makan Desert, the Badain Jaran Desert and the high-altitude Tibetan Plateau between the western and eastern parts of China. In the past 10 years, commercial B. terrestris colonies from Europe have increasingly been introduced into China and some East Asian countries to meet the demands of crop pollination in greenhouses. This practice has raised the question of whether there are risks to the habitats of native bumblebees from this introduced species. They might pose a serious threat to native bumblebees considering the high likelihood that native bumblebees they will interact with B. terrestris around flowers29. Habitat or range contact (overlap) can enhance interspecific competition between species30, and B. terrestris introduction has already disturbed the natural mating of closely related local bumblebee species in Japan31. An analysis of habitat overlap risk is necessary to identify the areas that may have favourable conditions for interspecific competition between B. terrestris and vulnerable local bumblebee species. Such an analysis can be performed through spatial distribution modelling (SDM)32 and GIS (geographical information system) approaches. SDM techniques are considered good predictive tools and have been used previously to model the distribution of invasive or unknown species33,34. These approaches are among the most important for developing conservation strategies for declining bumblebee species19,35. These techniques combine climatic data and species occurrence records to identify the most suitable environmental conditions for population maintenance or possible species overlap. If the niches of different bumblebee species overlap, interspecific competition for food resources and habitats between invasive and local bumblebee species may occur. As overall biases and uncertainty are inherent to different prediction algorithm models, these models predict distributions differently36,37,38. Therefore, in the present study, five modelling approaches were used to identify the areas most suitable for B. terrestris; the results may guide the development of effective importation regulations for this species within East Asian countries, including China. This study attempts to answer the following questions: (1) Are all East Asian countries susceptible to B. terrestris invasion? (2) What regions of China offer potential habitats facilitating B. terrestris development and establishment? (3) Are some groups of local bumblebee species more sensitive than others to habitat overlap with B. terrestris within China?

Results

Modelling accuracy

The area-under-the-curve (AUC) values showed average to excellent model performances (0.77~0.98). However, variation in model performance from poor to excellent was observed based on the true skilled statistics (TSS) values (0~0.98). The ranges of TSS values were 0~0.86, 0.27~0.97, 0.35~0.95, 0.40~0.93 and 0~0.98 for Envelope Score, Environmental Distance, Genetic Algorithm for Rule Set Prediction (GARP), MaxEnt and Support Vector Machine (SVM), respectively (see Supplementary Table S1). The highest modelling accuracy was found with MaxEnt for almost all 25 species including B. terrestris, and the models with the next highest levels of accuracy were GARP and Environmental Distance, Envelope Score and SVM for 24, 19, 19, 7 and 1 species, respectively, under the receiver operating characteristics curves (ROC) threshold (see Supplementary Fig. S1). Different models (with TSS >0.5) were selected for each species to calculate the final “summed” distribution maps; however, for six species, B. ignitus, B. longipennis, B. lucorum, B. patagiatus, B. picipes and B. pyrosoma, only one modelling algorithm had a TSS >0.5 (see Supplementary Table S1).

Potential distributions of bumblebee species

Predicted “suitable” and “highly suitable” areas for B. terrestris were detected in all East Asian countries, with the highest risk of invasion occurring in China, followed by Japan, North Korea, South Korea and Mongolia (Figs 1 & 2). However, within China, three different regions, north-west, central to south-east and north-east, were determined to be at risk; these areas cover an area of 2,001,333 km2, representing 21% of the total study area. B. terrestris expanded originally from Europe to the north-west part of China, so the central to south-east and north-east parts of China are new potential habitats for B. terrestris to become established and develop. These three zones cover 24 provinces considered to be under the threat of invasion from B. terrestris: Xinjiang, Gansu, Ningxia, Shaanxi, Shanxi, Hebei, Henan, Shandong, Chongqing, Hubei, Hunan, Guizhou, Guangxi, Guangdong, Fujian, Jiangxi, Anhui, Jiangsu, Shanghai, Zhejiang, Neimenggu, Liaoning, Jilin and Heilongjiang (Fig. 3).

Figure 1
figure 1

Potential distribution of Bombus terrestris in East Asian countries. Red represents highly suitable habitat, yellow represents suitable habitat, and green represents unsuitable habitat for B. terrestris. The map was created with ArcGIS v 10.0 (www.arcgis.com).

Figure 2
figure 2

Suitable habitat areas (km2) for the invasion of Bombus terrestris in all East Asian countries.

Figure 3
figure 3

Map of the Chinese mainland showing the predicted distribution of Bombus terrestris with the names of the provinces in China. Red represents the highly suitable habitat, yellow represents suitable habitat, and green represents unsuitable habitat for B. terrestris. The map was created with ArcGIS v 10.0 (www.arcgis.com).

The modelling results revealed the potential distributions of all local bumblebee species across the different regions of China. The potential “suitable” and “highly suitable” areas for these 25 species ranged from 665,430 to 5,707,845 km2. The species with the largest distribution area, representing 59% of the total study area of China, was B. sibiricus, and B. longipennis had the smallest predicted distribution area, representing 7% of the total country area (Table 1). The optimum cut-off values for all 25 species ranged from 0.44~0.91, as shown in Table 139.

Table 1 Bumblebee species, their distribution, abundance, and occurrence records used for developing the models; Youden Index values; area under the curve (training and testing); potential suitable areas; overlapping areas; and vulnerability status within China.

Six of seven native Tibetan bumblebee species, B. convexus, B. minshaenensis, B. rufofasciatus, B. supremus, B. ladakhensis and B. longipennis, had potential suitable and highly suitable habitats towards the Tibetan Plateau of China. However, only one species from this region, B. friseanus, had a potential distribution towards both Tibet and south-east China. All five southern species, B. bicoloratus, B. braccatus, B. breviceps, B. picipes and B. trifasciatus, had potential distributions towards the southern and south-eastern parts of the country. Unlike the other regions, the species from the northern regions had four different types of distribution trends. Two of these species, B. hypocrita and B. sporadicus, had potential distributions towards the north-east of China. Four species, B. lantschouensis, B. patagiatus, B. longipes and B. ussurensis, were present from central to north-east China. Two species, B. ignitus and B. pyrosoma, had distributions towards central to northern China. The remaining four species, B. cryptarum, B. lucorum, B. muscorum and B. sibiricus, had distributions towards north-east and north-west China (see Supplementary Fig. S2).

Habitat overlap risk analysis

There were 19 of 24 (79%) native bumblebee species overlapping with B. terrestris (Table 1). On average, the potential area of overlap between B. terrestris and native species was 464,900 ± 360,527 km2 (mean ± standard deviation). The sensitivity of habitat overlap was found to be greater (89%) for the “closely related” group (biological group) of the same subgenus, Bombus s. str., with B. terrestris compared to the “non-closely related” group (73%) of other subgenera of bumblebee species. For regional species, species from both the “southern” and “northern” regions had maximum overlap (100%) compared to the species from the “Tibetan Plateau” (29%). However, in the overlap areas, the species from the southern region had greater overlap with B. terrestris than did the species from the northern region, and only two species from the Tibetan Plateau had (slight) overlap with B. terrestris (Figs 4 and 5).

Figure 4
figure 4

Habitat overlap risk analysis between Bombus terrestris and vulnerable native bumblebee species in China. Map was created with ArcGIS v 10.0 (www.arcgis.com).

Figure 5
figure 5

Potential suitable areas vs overlapping habitat areas (%) between native bumblebee species and B. terrestris. The three shades of grey representing the “southern”, “northern” and “Tibetan Plateau” regions revealed that the species from the southern region have the most overlapping habitat areas with B. terrestris followed by the northern region and the Tibetan Plateau.

The spatial distributions of overlapping habitats between native and invasive species were found in all three regions with suitable habitat for B. terrestris: north-west, central to south-east and north-east within China. All five “southern” species, including B. bicoloratus, B. braccatus, B. brevicepsB. picipes and B. trifasciatus, mainly had potential overlapping habitats in central to south-east China. Six of the 12 northern species, B. cryptarum, B. hypocrita, B. lantschouensis, B. longipes, B. lucorum and B. muscorum, had potential overlapping habitats in all three regions with habitats suitable for B. terrestris. Five other “northern” species, B. ignitus, B. patagiatus, B. pyrosoma, B. sporadicus and B. ussurensis, had potential overlapping habitats towards central to north-east China. However, only one “northern” species, B. sibiricus, had a potential overlapping habitat towards central to north-west China. Similarly, two “Tibetan Plateau” species, B. minshaenensis and B. rufofasciatus, had potential overlapping habitats towards central to north-west China.

Eight of the nine species from the “closely related” group had overlapping habitats, and the range of overlapping habitat area was 114,425~776,801 km2 (1~9%). However, 11 of the 15 species from the “non-closely” related group had overlapping habitats, and the range of overlapping habitat area was 109,520~1,160,506 km2 (1~12%). Although all species in the “southern” and “northern” regional groups had overlapping habitats, the species in the “southern” group had a greater area of overlapping habitats, from 389,307~1,160,507 km2 (4~12%), than that, from 132,155 km2~776,801 km2 (1~8%), of the species in the “northern” group. However, the species from the “Tibetan Plateau” had an overlapping habitat area that ranged from 109,520~114,425 km2, which was only 1% of the whole area of China (Table 1, Fig. 5).

Discussion

In comparison to a single-model approach, summed distribution maps can provide more accurate estimations of the spatial distributions of species40. In the present study, five modelling approaches were implemented here to accurately determine the spatial distributions of B. terrestris throughout East Asian countries and of 24 local bumblebee species within China. To assess the accuracy of such models, TSS, AUC and kappa statistics values can be used. However, there are inherent limitations to using kappa statistics41, and AUC approaches do not allow the prediction of detection differences between more than two models42. The prevailing problem in kappa statistics involves their dependency on the ratio between the proportion of correctly predicted presences (sensitivity) and the proportion of correctly predicted absences (specificity)41. To avoid these problems, TSS values were calculated to assess model accuracy in the present study. The different ranges of TSS values among the different modelling approaches are due to the different mechanistic features of the models, and typically, these different features lead to model differences in predicted distributions37,43. These TSS ranges facilitated our evaluation of the modelling approaches used here. MaxEnt attained the highest performance among the modelling approaches, which is consistent with previous studies40,44,45,46,47.

In contrast to the findings of Acosta19, where East Asian countries including Japan, South Korea, North Korea and south-eastern coastal parts of China were considered suitable for B. terrestris invasion, our habitat suitability modelling approaches predicted that Mongolia as well as north-west, central to south-east and north-east parts of China (covering 24 provinces) are also suitable for invasion by B. terrestris. These regions are physiographically similar to Europe and have already been predicted to be climatically suitable for B. terrestris48,49. However, none of our models predicted that habitat susceptibility to invasion would occur only towards the south-eastern area. Our results are supported by the presence of some new records of B. terrestris in the predicted suitable habitats in China as well as in neighbouring countries surrounding China.

Interspecific mating between “closely related” invasive and native bumblebee species may occur, disturbing reproduction and reducing local bumblebee diversity. Likewise, in Japan, the invasive species B. terrestris has been found to mate with its closely related local species B. hypocrita and is responsible for the reduction in the local bumblebee population50. In addition, hybrid-mating disturbance has been found between B. terrestris males and native queens of Chinese B. lantschouensis under artificial conditions and may also occur in the field51. Although eight of the nine species from the “closely related” group (species of the same subgenus, Bombus s. str.) within China overlapped with B. terrestris, four species of the “non-closely related” group from the subtropical region of south China, B. trifasciatus, B. braccatus, B. breviceps and B. bicoloratus, had greater areas of overlapping habitat than did any of the “closely related” species” (Fig. 5). This habitat overlap between the invasive species and native species from the “non-closely related” group might be due to their ecological similarities30. More suitable similar habitats are responsible for the maximum overlap between B. terrestris and local bumblebees. More importantly, B. muscorum from the “non-closely related” group is already categorised as vulnerable in Britain52 and is vulnerable in China. Given that the foraging distance of B. muscorum (which forages mostly within 500 m of its nest) is shorter than that of B. terrestris (which forages mostly over 2 km from its nest)53, Habitat overlap with B. terrestris may be harmful for this local species because B. terrestris may be able to outcompete B. muscorum for food resources within areas close to the latter’s nests.

Alarmingly, eight species endemic to China23, B. braccatus, B. convexus, B. longipes, B. minshaenensis, B. picipes, B. pyrosoma, B. supremus and B. trifasciatus, are threatened by habitat overlap with the invasive B. terrestris. These species are important pollinators of many wild flowers and crops within China24, and endemic species are considered more important to local biodiversity than are non-native species54. Therefore, a management strategy should be implemented that prioritises the conservation of these endemic bumblebees.

Similarly, some of the overlapping species, specifically, B. braccatus, B. cryptarum, B. hypocrita, B. lucorum and B. muscorum, were “rare” in terms of abundance in this study (Table 1). Two of them, B. lucorum and B. muscorum, are also “moderately rare” in Hungry and Russia55,56,57,58. These observations highlight the potential for declines in vulnerable species within a country. However, due to species differences in food selection, the conservation strategies used for other species will not be effective for these rare species. Thus, from a conservation point of view, specific conservation planning is urgently needed to conserve these bumblebee species59,60.

The native bumblebee species from the “southern”, “northern” and “Tibetan Plateau” regions were distributed at different altitudes (Fig. 6). All the native species from the southern and northern regions at mean elevations <2500 metres had overlapping habitats. However, for the high, cold species from the Tibetan Plateau at mean elevations >2500, only two species, B. minshaenensis and B. rufofasciatus, had a very small proportion of overlapping habitats (<1% areas) with B. terrestris, and the remaining species were not overlapping (Fig. 6). This scenario might indicate that the high, cold region of the Tibetan Plateau has the least amount of suitable habitat for the invasive species or that these species have ecological dissimilarities30.

Figure 6
figure 6

The proportion of contribution of the potential suitable habitat areas (%) of both B. terrestris and native bumblebee species that overlap at different mean elevations.

Our current findings regarding the distribution of B. terrestris and its overlapping habitats with different groups of local bumblebees showed that the local bumblebees of other East Asian countries might also be threatened by habitat overlap and loss of native biodiversity. Proper management strategies and coordinated implementation of regulations related to the introduction of B. terrestris61 should be considered for all East Asian countries to conserve the natural biodiversity of local bumblebee species within this region.

Methods

Modelled species

Records of B. terrestris were based on its natural distribution within China and surrounding countries, such as Kazakhstan, Kyrgyzstan and Russia, as well as within the areas close to China such as Japan and South Korea, where it is invasive (www.discoverlife.org) (Fig. 7). For the local bumblebees in China, we selected 24 native bumblebee species and B. terrestris with a total of 26,497 specimen records from the IAR (CAAS Institute of Apiculture Research) collection that covers the whole region of the Chinese mainland (Table 1). These species were represented in >50% of the records of the 125 bumblebee species in China and were selected by two principles, 1) representatives of two groups: species “closely related” to B. terrestris, i.e., from the subgenus Bombus s. str., all known in China (n = 10) and “non-closely related” species, i.e., species of other bumblebee subgenera (n = 14), and 2) representatives of the three major regions of China, i.e., the high cold “Tibetan Plateau” and the temperate “northern” and subtropical “southern” regions (Table 1).

Figure 7
figure 7

Relief map of the Chinese mainland showing the sample sites of the 25 bumblebee species with the different colourful spots representing the different species in this study. The records of B. terrestris for areas surrounding China are from (www.discoverlife.org). The international boundaries are shown in grey with the names of countries in capital letters. The map was created with ArcGIS v 10.0 (www.arcgis.com).

Environmental layers and modelling procedures

In our modelling, we used 19 bioclimatic and elevation layers obtained from www.worldclim.org with a grid cell size of 5 arc-min resolution (≈10 km2) of the WGS1984 projection62 with full coverage of East Asian countries. We converted all these layers into ASCII format to use in our modelling software with ArcGIS v 10.0. We calculated Pearson’s correlation coefficients with the species distribution modelling toolbox63 of ArcGIS v 10.0 to reduce the collinearity among these bioclimatic layers, and we retained only seven bioclimatic variables together with the elevation layers, which all yielded Pearson correlation coefficient (r) values < 0.8, to avoid overfitting the model (see Supplementary Table S2). Similarly, we used a spatial rarefy tool of the SDM toolbox63 to remove all the presence points of species that have distances <10 km2 to avoid spatial autocorrelation among these occurrence records of bumblebee species during modelling (Table 1). All the occurrence records were converted into comma separated value (CSV) comma delimited and text tab delimited formats for the use in MaxEnt v 3.3.3k and OpenModeller Desktop software v 1.1.0, respectively. For MaxEnt software, we prepared a biased file of all 25 species occurrence records for background point selection using ArcGIS v 10.0.

Five modelling algorithms, MaxEnt64, Envelope Score65, Environmental Distance66, GARP67 and SVM68, were implemented to increase the reliability of the potential distributions as well as to identify areas of overlap between the 24 native bumblebee species and B. terrestris40. These models were selected mainly for their different mechanistic features69. Due to the variation in the distribution data of a species, there is error in the estimation of the spatial distribution of species by different modelling approaches. As a result, similar data used in different modelling approaches can yield very different results70. Therefore, testing more than one model is preferable for addressing the error38. All modelling approaches have inherent differences in the generation of distribution predictions37,43. By comparing their results, one can retain only those modelling approaches that yielded similar and accurate species distributions37. Among the algorithms considered in our study, the envelope score and environmental distance algorithms are simpler algorithms that predict species distributions only on the basis of presence data. In contrast, the remaining three modelling approaches, MaxEnt, SVM and GARP, are more complex and involve artificial intelligence methods. These approaches have been shown to reliably predict species occurrence43. Two software programs, MaxEnt v 3.3.3k and OpenModeller Desktop v 1.1.0, were used to model the distributions of bumblebee species71. We developed our models using 75% of our data for training and the remaining 25% for testing our models.

In contrast to the Pearson72 approach, where the lowest presence threshold (LPT) is used, in this study, only receiver operating characteristics curve (ROC) threshold values were used to cut the modelled suitability matrices. This approach determines the balance between omission and commission errors while determining the spatial distributional ranges of a species40. Furthermore, TSS values were used (which range from −1 to 1) to evaluate the accuracies of the five modelling approaches26,41. Here, values close to zero or negative mean that the distribution is not much better than random, whereas values close to one indicate almost perfect agreement between the modelling prediction and distribution. Normally, TSS values ≥0.5 are considered acceptable40. Finally, we selected only the distribution modelling outputs that yielded TSS values ≥0.5 (see Supplementary Table S1) to develop the “summed” distribution maps within ArcGIS v 10.0 that produced relatively realistic and potentially accurate spatial distributions of the species. In this study, 10,000 random pseudo-absences were used. We categorised our final “summed” potential distribution maps into three categories, 1 = unsuitable habitats, 2 = suitable habitats and 3 = high suitable habitats, based on their specific optimum threshold cut off values within ArcGIS v 10.0. (Table 1).

Assessment of overlapping habitats

To assess the habitat overlap of both invasive and local bumblebee species, the Youden Index was applied, calculated with following the formula: sensitivity + specificity −139,73. This index is used to determine the cut-off point for the models. A high Youden Index value indicates good performance of the model39. The final “summed” potential distribution maps were divided into suitable (>threshold value of Youden Index) and unsuitable (<threshold value of Youden Index) categories within ArcGIS v 10.0. The threshold value for each species is calculated as the average TSS values of all those modelling algorithms with TSS values >0.5 (see Supplementary Table S1). Then, the distribution maps of the 24 local species were overlaid with the final “summed” distribution map of B. terrestris, and the cells were reclassified into three categories using the Arc toolbox of ArcGIS v 10.0: 1 = unsuitable habitat for both species (local and invasive species), 2 = suitable habitat for one species but unsuitable for the other species, and 3 = suitable habitat for both species (Fig. 4). The optimum cut off values for all the 25 bumblebee species are shown in Table 1. All the grid cells that have values below this average threshold are considered unsuitable for species presence or vice versa.