Abstract
The spread of American Bullfrog has a significant impact on the surrounding ecosystem. It is important to study the mechanisms of their spreading so that proper mitigation can be applied when needed. This study analyzes data from national surveys on bullfrog distribution. We divided the data into 25 regional clusters. To assess the spread within each cluster, we constructed temporal sequences of spatial distribution using the agglomerative clustering method. We employed Elementary Cellular Automata (ECA) to identify rules governing the changes in spatial patterns. Each cell in the ECA grid represents either the presence or absence of bullfrogs based on observations. For each cluster, we counted the number of presence location in the sequence to quantify spreading intensity. We used a Convolutional Neural Network (CNN) to learn the ECA rules and predict future spreading intensity by estimating the expected number of presence locations over 400 simulated generations. We incorporated environmental factors by obtaining habitat suitability maps using Maxent. We multiplied spreading intensity by habitat suitability to create an overall assessment of bullfrog invasion risk. We estimated the relative spreading assessment and classified it into four categories: rapidly spreading, slowly spreading, stable populations, and declining populations.
Similar content being viewed by others
Introduction
The American Bullfrog, Lithobates catesbeianus, has been introduced to more than 40 countries worldwide and is listed on the “100 of the World’s Worst Invasive Alien Species” 1. American Bullfrog was introduced to Korea in 1957 and cultivated for the purpose of establishing new food sources for human consumption, but due to its low economic efficiency and low demand as food, most farms gave up on farming and released them into rivers illegally, and bullfrogs were spread throughout the country the country2,3,4. Korea is originally an agricultural society, and even now, no crops other than rice can be grown on farmland5,6. Rice farming requires a lot of water, so the area around the farmland has a good environment for bullfrogs to live in, such as reservoirs, waterways, rice paddies, etc. Here, bullfrogs abandoned by farms find the best habitat and their population has rapidly increased. The negative effects of the bullfrog invasion on native species arise from competition, amphibian and fish predation, as well as the spread of ranavirus and the fungus Batrachochytrium dendrobatidis, which is systematically killing amphibians7. A variety of control methods are needed to prevent further invasions based on local ecology and land use8,9,10,11,12,13,14. In Korea at least 84% of native anurans (frogs and toads) were at moderate to extreme risks, which included all frogs but only 33% of toads. to set conservation priorities and strategies15. It is important to assess the extent of the spread of invasive species by integrating biotic and abiotic data collected at different spatial scales to assess where invasive species monitoring and management efforts should be focused focused16. Bullfrogs have continued to spread in an environment without natural enemies and have now spread nationwide except in some mountain areas in South Korea14,17. The species was reported to occur at 2716 sites, mainly distributed along the southern and western coasts, but rarely occurred in the northern part of Korea or along the eastern coast17. Future predictions suggest continued bullfrog spread14.
The Ministry of Environment and non-governmental organizations tried several approaches to eradicate L. catesbeianus populations which resulted in significant populations declines12,18,19,20,21,22,23,24. However, these actions were discontinued and populations were allowed to expand in some local regions12.
In this study, the intensity of spread by region was calculated using only spatial data, not temporal data. The analysis of the intensity of the spread of bullfrogs in this paper is based on decades of observational data and can be said to reflect the characteristics of each region. The biological and environmental conditions of the habitat vary from region to region and continue to change due to various socio-environmental factors. The population may temporarily decrease over time, but the population may change at any time depending on the characteristics of the region. In particular, although there is currently no significant population increase due to effective control, the population may increase rapidly in areas with high spread intensity at any time if vigilance is relaxed. Knowing the intensity of the regional spread of invasive alien species that cause changes in biodiversity is expected to be of great help in establishing and implementing management policies accordingly. Although observation data may have errors depending on the methods, the data includes environmental characteristics of the area where bullfrogs were observed and reflect many biological and ecological factors. However, it is impossible to observe a large area over time. In this paper, we estimate the intensity of regional spread only with accumulated spatial distribution data. In the process of estimating the spreading intensity, machine learning methods such as the clustering method, ECA method, and CNN method are used. Assessment of spreading is obtained by multiplying spread intensity by habitat suitability. Species distribution modeling software Maxent 3.4.1 was used to estimate habitat suitability by reflecting local environmental and ecological information16,25. The spreading assessments are scored by calculating the intensity of spread and habitat suitability in 25 regions. These are then classified into areas where the population is expected to continue to increase, areas where there is no significant change in the current population, and finally areas where the population is expected to decrease.
Material and methods
Biogeographic distribution patterns of amphibians are analyzed based on the clustering method26,27. Since we do not have time series data of bullfrog distribution, we analyze the spatial distribution using the hierarchical divisive clustering method using scikit-learn 1.3.026,27,28,29.
The entire data is clustered into small clusters, and the degree of spreading is estimated by the evolution rules from the elementary cellular automata scheme30,31,32 in each small cluster. Elementary cellular automata consist of cells with a value of 1 or 0 and are very useful for biological modeling consisting of presence or not data. It has been used for biological and ecological modeling since the 1980’s 28.
CNN is trained to learn the evolution rules 33,34,35,36. By recognizing small clusters as a single image of 0's and 1's, we count the number of 1's, representing the presence location. This allows us to define the spreading intensity as the ratio of the expected number of presence locations over 400 generations to the initial number of presence locations.
When calculating the intensity of spread using machine learning on accumulated observation data, biological and environmental factors were not reflected. We incorporated environmental factors by obtaining habitat suitability. The habitat suitability is achieved using Maxent software 9,16,25,37,38,39,40,41,42. Habitat suitability models can help to understand and predict the dynamics of invasions. MAXENT is a machine learning method that estimates the distribution of a species by finding the probability distribution of maximum entropy, subject to constraints representing our incomplete information about the distribution9. The model evaluates the suitability of each grid cell as a function of environmental variables. The estimated spreading intensity is multiplied by the habitat suitability to express the assessment of bullfrog spreading by region.
Observation data
All data is collected from the results of 3rd investigation of natural environment from 2006 to 2012, the National Wetland Center Report from 2011 to 2017, and the National Institute of Ecology from 2015 to 201717 (See the Supporting Documents and Tables S1, Tables S2). The surveys were conducted by amphibian and reptile experts between January and December of each year. For amphibian identification, daytime observations using fish pots, skimming nets, and visual observations of adults were conducted. Acoustic surveys for amphibian calls were conducted at night. Figure 1a shows the distribution of American Bullfrogs observed in recent decades in South Korea. Bullfrogs have rarely been observed in mountainous and urban areas. There is no time series data for the study area.
The Republic of Korea is a type of mountainous country rarely seen throughout the world and its mountainous area covers more than 70% of the land. Mountains in general are high to the north and to the east, and low to the west and to the south with the ridge of the spine lying inclined toward the east to form steep slopes along the east coast and slow slopes along the west coast in Fig. 1a 43.
Clustering
To estimate the intensity of spreading by region, a sequence of spatial distribution from the observed data in Fig. 1a is constructed using the divisive hierarchical clustering method. All observations start in one cluster of full data, and splits are performed recursively as one moves down the hierarchy by grouping neighboring data into the same cluster27. The scikit-learn clustering software29 is used, and clusters are numbered according to the order in which they are formed. Clustering is performed until 25 clusters are formed to roughly match the size of the administrative district. Rectangular images consisting of 20 by 20 cells are created by uniformly dividing the latitude and longitude including all observations in each cluster. If each cell had a bullfrog observation point, it is marked as 1, otherwise it is marked as 0. Here, the point density of each cell is inhomogeneous. Some cells have one observation point and some have many points. Those cells are equally treated as presence points. We assumed that bullfrogs had never been found outside of the cluster. In each rectangular image, some cells are not included in the cluster, and the corresponding cell value is assumed to be 0. Latitude and longitude information for all clusters is in Table 1. In Maxent Software, it is handled differently when using presence/absence data and when using presence-only data. It is recommended the logistic option for presence-only data, and the cloglog option for presence/absence data25. Hence, the cloglog (default in Maxent 3.4.1) option is used to treat occurrence records as points rather than grid cells to estimate relative habitat suitability25.
To estimate the spreading intensity of each cluster the agglomerate clustering method is performed in each cluster making the sequence of images, \({C}_{0}\to {C}_{1}\to \cdots {C}_{n-1}\to {C}_{n}\). Figure 2b illustrates the agglomerate clustering steps, taking cluster #5 in Fig. 1e as an example.
Learning elementary cellular automata rules
ECA is introduced to find rules in the sequences for each cluster. ECA is a one-dimensional array of cells, where each cell takes either 1 or 0, representing presence or not presence, respectively. It generates the next array depending on its state and the states of its two closest neighbors30,31,44,45. Hence, 256 rules numbering from 0 to 255 are available to represent the sequence evolution. In this study, only the even number rules are used. The odd number rules are excluded because it makes the next generation value 1 when both the current cell and the neighboring cells are 0, so the bullfrog appears after not being present, which is not suitable for the biological spreading model. In extreme cases, when a bullfrog is found in only one location, applying the odd number rule results in bullfrogs being found in the entire cell in the next generation. Each cell in an ECA row represents one generation, where ECA is a one-dimensional array. The next generation is generated by the ECA rules. By reconstructing a one-dimensional array into a 2D image, each generation can be made of a sequence of images that change according to the ECA rules in Fig. 2a.
The rules are learned by training the image change pattern using the Convolutional Neural Network(CNN) 46,47. CNNs are a subset of a class of deep learning algorithms, most commonly used for spatial pattern analysis in biology and ecology33,36,47. Additionally, CNN methods can efficiently classify the predicted distributions of many species35. In this simulation CNNs are trained with the Keras package in TensorFlow48.
Generate training data
The procedure is as follows:
-
Create a 20 by 20 matrix by random seeding of 1’s at 100, 200, and 300 initial points.
-
Reshape the 20 by 20 matrix to a 1 by 400 matrix
-
Generate the next generation of 1 by 400 matrix according to ECA rules.
-
Reshape two consecutive 1D matrices to two consecutive 2D matrices in Fig. 2a, which are considered as one set of images, such as \(({C}_{n-1}, {C}_{n})\) in Fig. 2b.
-
Generate sets of image data for all 128 possible even rules
-
Generate 500 sets of image data for each 100, 200, and 300 initial points for each rule
Hence, 500*3*128 = 192,000 sets of image data are generated.
Training the rules
-
Separate 80% of training data and 20% of test data from total data
-
Learning the rules using CNN(Convolution Neural Network) method
Spreading intensity
To estimate the intensity of spreading, the expected number of presence location variations depending on the rules governing the evolution of clusters is estimated. As an initial value, a value of 1 is randomly given to 100 cells out of 400 cells of the image, and then the number of 1 s in the image is counted while evolving over 400 generations according to all even-number rules of ECA. This process is repeated 10 times to get the expected number of 1 s. The expected number of presence locations over 400 generations with the initial value set to 100 for the rule 204, 206 and 220 is shown in Fig. 2b–d respectively. The mean of the expected number of presence location over 400 generations divided by the initial value of 100 is defined as the spreading intensity for each rule, which shows the growth rate of the expected number of presence locations. The results for all possible even rules are in Fig. S2. The mean of the expected number of presence locations according to each rule is multiplied by the percentile distribution of the rules to get the mean of the expected number of presence locations of the cluster. Here the spreading intensity is defined as the mean of the number of presence locations:
Assessment of spreading
Since the spreading intensity is evaluated based on the mean of the expected number of presence locations only without considering any other environmental and biological variables, the final predicted spreading intensity is weighted by the habitat suitability. The Maxent software (Maximum Entropy, version 3.4.1) is used in estimating the relative habitat suitability of sites by comparing environmental conditions at known observed sites to the available environmental conditions such as precipitation, temperature, elevation, and so on40,41. In this paper, correlation analysis between variables was applied using multicollinearity. As a result, 6 main environmental factors out of 19 factors were used, but there may be cases where the remaining factors should not be overlooked. A pairwise Pearson correlation was performed and highly correlated variables (|r| > 0.80) were excluded, to avoid collinearity in statistical models49. The main environmental factors when using Maxent software are annual mean temperature, mean diurnal range, temperature seasonality, annual precipitation, precipitation of wettest month, and precipitation of driest month.
Simulation results
Clustering
Using the hierarchical clustering method, the entire data is divided into 25 small clusters, and the size of the clusters became similar to the size of the administrative district in Fig. 1e. The number of clusters can be set to 1, 9 or 17, depending on the size of the region of interest in Fig. 1b–d. The common feature of the clusters is the high density mainly around the waterside and wetland. However, the shape of the cluster alone does not represent the spreading intensity for each cluster. Biological and environmental information are not taken into account when grouping the clusters.
Learning the rules using CNN
When the CNN was trained to learn ECA rules, the accuracy was over 99%. This would mean that the rules of change in the bullfrog distribution could be learned with very high confidence.
Spreading intensity
Figure S1 shows the distribution of rules predicted through CNN learning for each cluster in Fig. 1e. The expected number of presence locations for all 128 ECA rules estimated over 400 generations are in Fig. S2. Cluster 14 as an example, shows that 84.5% of the clusters are predicted to follow rule 204, 8.5% are predicted to follow rule 206, and 7.0% are predicted to follow rule 220 in Fig. 3b–d. The mean of the convergent number for rule 204 is 100, for the rule 206 is 323, and for the rule 220 is 322. Therefore, if bullfrogs are found in 100 cells now, the expected number of converged presence locations in cluster 14 can be calculated as 1.00*0.845 + 3.23*0.085 + 3.22*0.07 = 1.34495, which is the spreading intensity for the cluster 14. The spreading intensity of all clusters is shown in Table 1.
Spreading assessment
Figure 4 shows the Spreading Intensity (SI), Habitat Suitability (HS), and Spreading Assessment (SA) of 25 clusters. Figure 4b shows the SI distribution. It does not reflect environmental and biological variables, and it shows the spreading intensity calculated only by machine learning methods (clustering, CNN, etc.). Areas that are already saturated may have low SI values, and areas with low saturation, such as mountainous areas, may have large SI values. Figure 4c represents HS distributions. Habitat suitability values obtained using Maxent software reflect ecological environmental factors for the bullfrogs. Figure 4d shows the distribution of SA values obtained by multiplying SI values and HS values. The HS value ranges from 0 to 1, and the closer it is to 1, the more suitable. The spreading assessment values are categorized into four relative stages: strong spreading, weak spreading, strong retention, and weak retention.
Table 1 shows the result of calculating the spreading assessment. From left to right, each column represents the number of presence locations per cluster, spreading intensity, habitat suitability, spreading assessment, and geometric center latitude and longitude. The higher the value, the greater the probability of spreading. SI is a value calculated using machine learning based only on presence location data. Environmental and biological factors were reflected through Habitat Suitability (HS) to get a Spreading Assessment (SA).
Table 2 shows relative spreading assessments. Four cluster groups are created based on assessment scores. The clusters in groups (I) and (II) show spreading assessment scores greater than 2, which means that they will continue to spread. Clusters in group (III) show scores of 1 to 1.5, which can be considered as slow-spreading or maintaining the population. For group (IV) clusters, the spread appears to have stopped, and the population may decline, especially in clusters #4, #8, and #23.
Discussion
The study relies on static data from several surveys spanning different year but lacks true time series information to accurately track and model bullfrog spread. Obtaining time series data on bullfrogs requires a lot of manpower and budget, and in particular, it takes decades to obtain national distribution data, and it is almost impossible to obtain time series data especially when the distribution changes every year. If a space series is obtained using the unsupervised learning clustering method proposed in this paper, the intensity of spread by region can be estimated, but other observation methods must be continuously used to justify the results or prove the accuracy of the prediction. One way would be to select an area with strong spreading intensity and create time series data through continuous observation.
Methods in this study can be compared and modified using different data sources and various species distribution modeling methods. There are many sources of the occurrence data of L. catesbeianus from different archives such as GBIF, HerpNet49. Many spatial distribution models have been used in various combinations incorporating environmental and biological factors more comprehensively41,49,50,51. To evaluate projected range changes of L. catesbeianus in potentially suitable areas under current and future climate conditions, Johovic et al.49 used several algorithms combined. When implementing clustering, cases where areas with different biological and environmental conditions could not be grouped into the same cluster were not taken into consideration. Using constrained clustering, which uses biological and environmental factors as a limiting condition, will produce more meaningful results52. The number of clusters can be determined by determining the size of the area to be studied according to topographical, environmental, ecological, and biological characteristics.
If the number of clusters is changed, the spreading intensity, habitat suitability and spreading assessment is also changed, so the number of clusters should be adjusted to properly include the region of interest. Considering that local governments are responsible for habitat management, the size and number of clusters were determined according to the size of the administrative district.
When learning the rule, 20 by 20 cells with a maximum of 300 1 s were used, so if the number of clusters was selected when the maximum number of presence locations was less than 300, approximately 25 was appropriate. For 25 clusters the maximum number of presence locations is 267 in Table 1. Furthermore, if the maximum value exceeds 400, it becomes impossible to observe changes in the 20 by 20 cell image. Increasing the size of the image cells may allow for application to larger clusters. On the other hand, increasing the number of clusters may allow for observing more detailed changes in specific regions.
After dividing the observation data into 25 clusters, all presence sites are broadly categorized into four relative stages according to assessment sores, so the number of clusters can be set to around 20–30. When the number of clusters exceeds 20, the number of zero paddings is not very large because the sets in the cluster are relatively close together. Nonetheless, it can be a problem.
The size of cells is identical in each cluster. The density of observations in a cell is uniformly 0 or 1, where 1 represents the presence location. However, in each cell, the number of bullfrog observations is different. It is difficult to accurately measure the population density within a cluster. To increase precision, periodic observation is necessary and a method of observing the same point for a long time is also needed. The data used in this paper were assumed to exist if they were ever observed in a unit cell.
In this study, the numpy. reshape() function53 was used to rearrange two-dimensional images into a one-dimensional array. Future studies are needed to explore applying ECA with various array arrangements. When applying the ECA rules, zero padding was applied to both endpoints, that is, zeros are used for the cells at positions − 1 and 401 virtual cells. It is assumed that Bullfrog has been never found outside the cluster. If found, they should be included in other clusters.
In estimating spreading intensity, the mean of the expected number of presence locations is used, but the slope can be more useful in expressing the tendency of spreading in Fig. S2. Further research is needed to define the appropriate diffusion strength according to the variation pattern of the expected number of presence locations.
Since the spreading intensity is estimated based only on the data currently found it is relatively low in the region where spreading is already completed. Low spreading intensity may mean that it is already saturated, which is different from extinction. Alternatively, the carrying capacity may decrease from a population dynamics perspective due to the emergence of natural enemies or human quarantine.
Geographical characteristics and ecological characteristics are replaced by habitat suitability using Maxent but more detailed cultural characteristics should be applied. In addition to observations, appropriate detection methods for bullfrogs, such as eDNA method or audio recording devices, are required13.
In this paper, habitat suitability was used to reflect biological and environmental factors. Because the weight of habitat suitability was given directly as the value obtained by SDM, environmental and biological factors may not have been sufficiently reflected. Determining the weight of biological and environmental factors can be an important point. For example, if the habitat suitability obtained from Maxent is 0.6 or 0.8, and the spread intensity is 2 in some regions, then the spread assessment scores are 1.2 and 1.8, respectively and they are classified into the same group. There is no significant difference if the weight is directly assigned as a value. For a more accurate spreading assessment, the weights must be adjusted using an appropriate threshold function such as the sigmoid functions.
To classify the intensity of spread, we used 128 rules. It is possible to increase the number of rules, but it is still not possible to express patterns according to the overall environment variables. If environmental and biological variables are included, a clustering method with constraints must be applied from the cluster stage. In any case, continuous correction and supplementation work must be done through periodic observation, observation of specific areas, and various observation methods in parallel. Only the accuracy of machine learning is presented as a verification method. To verify its validity, it is necessary to select 3 or 4 regions and monitor the spreading intensity continuously for several years to generate time series data and compare it with the expected values from simulations.
Future work includes:
-
Design the threshold function: For a more accurate spreading assessment, the weights estimated by Maxent must be adjusted using an appropriate threshold function such as the sigmoid functions.
-
Comparative evaluation study on changes in SDM due to climate change and corresponding changes in regional spreading intensity
-
Study on clustering techniques considering environmental, biological, and ecological factors
-
Comparison study using other data and other SDM methods
Conclusion
In this paper, we used machine learning methods to assess the spreading of bullfrogs in areas where they have been frequently observed in South Korea in recent decades. The extent to which bullfrogs continue to spread at observation sites is quantified and assessed. Since there is no time series data, the accumulated data were used to evaluate the spread of bullfrogs by creating a spatial series using machine learning. In this process, biological and environmental factors were not considered at all.
Cells where bullfrogs are found, and presence, are assigned a value of 1, and the number of 1 s in 400 cells composed of 1 s and 0 s is counted and used as the spreading index of bullfrogs. The mean of the number of presence locations over 400 generations, divided by the initial value of 100, is assumed to be a measure of spreading intensity for each rule. The spreading intensity is weighted by the percentile of the rules estimated by the CNN method. Under the above assumption, the intensity of spread by region was calculated using only cumulative occurrence data, and then the spread of American Bullfrog was assessed using habitat suitability as a weight reflecting environmental, biological, and ecological characteristics. Habitat suitability obtained from Maxent software includes environmental and biological factors, which were applied in the form of weights to the final spreading assessment. For a more accurate spreading assessment, the weights obtained from Maxent need to be adjusted using an appropriate threshold function such as the sigmoid functions. The weights should be determined taking into account the impact of habitat suitability on spreading assessment.
The spreading intensity by region was calculated using only cumulative data, and the spreading assessment was scored by weighting the spreading intensity with the habitat suitability for each region obtained from Maxent. The spreading assessment is determined by multiplying spread intensity by habitat suitability, which can be used as an indicator of the risk of bullfrog spread in each area.
This paper is not to analyze and predict distribution changes due to various factors such as climate change, but to find out changes in the assessment of the spread of bullfrogs in the area where they are currently distributed.
Data availability
All data generated or analyzed during this study are included in this published article.
References
Invasive Species Specialist Group ISSG 2015. The Global Invasive Species Database. Version 2015.1 (2024) https://www.iucngisd.org/gisd/ .
Oh, H.-S. & Hong, C.-E. Current conditions of habitat for Rana catesbeiana and Trachemys scripta elegans imported to Jeju-do, including proposed management plans. Korean J. Environ. Ecol. 21, 311–317 (2007).
Kim, J. Taxonomic list and distribution of Korean amphibians. Korean J. Herpetol 1, 1–13 (2009).
Jang, H. & Suh, J. Distribution of amphibian species in South Korea. Korean J. Herpetol. 2, 45–51 (2010).
Kim, M. Rice in ancient Korea: Status symbol or community food?. Antiquity 89, 838–853. https://doi.org/10.15184/aqy.2015.52 (2015).
Jeong, O.-Y. et al. Review of rice in Korea: Current status, future prospects, and comparisons with rice in other countries. J. Crop. Sci. Biotechnol. 24, 1–11. https://doi.org/10.1007/s12892-020-00053-6 (2021).
Nori, J., Urbina-Cardona, J. N., Loyola, R. D., Lescano, J. N. & Leynaud, G. C. Climate change and American bullfrog invasion: What could we expect in South America?. PLoS ONE 6, e25718. https://doi.org/10.1371/journal.pone.0025718 (2011).
Schlaepfer, M. A., Sherman, P. W., Blossey, B. & Runge, M. C. Introduced species as evolutionary traps. Ecol. Lett. 8, 241–246. https://doi.org/10.1111/j.1461-0248.2005.00730.x (2005).
Ficetola, G. F., Thuiller, W. & Miaud, C. Prediction and validation of the potential global distribution of a problematic alien invasive species—The American bullfrog. Divers. Distrib. 13, 476–485. https://doi.org/10.1111/j.1472-4642.2007.00377.x (2007).
Giovanelli, J. G., Haddad, C. F. & Alexandrino, J. Predicting the potential distribution of the alien invasive American bullfrog (Lithobates catesbeianus) in Brazil. Biol. Invasions 10, 585–590. https://doi.org/10.1007/s10530-007-9154-5 (2008).
Iñiguez, C. A. & Morejón, F. J. Potential distribution of the American bullfrog (Lithobates catesbeianus) in Ecuador. S. Am. J. Herpetol. 7, 85–90. https://doi.org/10.2994/057.007.0211 (2012).
Groffen, J., Kong, S., Jang, Y. & Borzée, A. The invasive American bullfrog (Lithobates catesbeianus) in the Republic of Korea: History and recommendations for population control. Manag. Biol. Invasions 10, 517. https://doi.org/10.3391/mbi.2019.10.3.08 (2019).
Kamoroff, C. et al. Effective removal of the American bullfrog (Lithobates catesbeianus) on a landscape level: Long term monitoring and removal efforts in Yosemite Valley, Yosemite National Park. Biol. Invasions 22, 617–626. https://doi.org/10.1007/s10530-019-02116-4 (2020).
Koo, K. S. & Choe, M. Distribution change of invasive American Bullfrogs (Lithobates catesbeianus) by future climate threaten endangered Suweon Treefrog (Hyla suweonensis) in South Korea. Animals 11, 2865. https://doi.org/10.3390/ani11102865 (2021).
Park, H.-R. et al. Risk assessment for the native anurans from an alien invasive species, American bullfrogs (Lithobates catesbeianus), in South Korea. Sci. Rep. 12, 13143. https://doi.org/10.1038/s41598-022-17226-8 (2022).
Ficetola, G. F. et al. Knowing the past to predict the future: Land-use change and the distribution of invasive bullfrogs. Glob. Change Biol. 16, 528–537. https://doi.org/10.1111/j.1365-2486.2009.01957.x (2010).
Kang, H.-J., Koo, K. S. & Sung, H.-C. Current distribution of American bullfrog Rana catesbeiana Shaw, 1802 in the Republic of Korea. BioInvasions Rec. 8, 942–946. https://doi.org/10.3391/bir.2019.8.4.23 (2019).
Song, H.-R. et al. Monitoring of invasive alein species designated by the act on the conservation and use of biological diversity (III). (2016). https://ecolibrary.me.go.kr/nie/#/search/detail/5850270?offset=9
Kim, T.-S. et al. The 2nd Intensive Survey on Estuarine Ecosystem. (2016). https://doi.org/10.23000/TRKO201700008184
Kim, S.-H. et al. Nationwide Survey of Non-native Species in Korea (II). (Division of Ecological Conservation Bureau of Ecological Research National Institute of Ecology, 2016). https://ecolibrary.me.go.kr/nie/#/search/detail/5850270?offset=9
Kim, T. et al. The 4th Intensive Survey on National Inland Wetlands—Intensive survey for designation of wetland protective area. (National Wetlands Center National Institute of environmental Research, 2016). https://doi.org/10.23000/TRKO201700008183
Kim, T. et al. The 3rd Intensive Survey on the Wetland Protected Areas('16). (National Wetlands Center National Institute of Environmental Resarch, 2016). https://doi.org/10.23000/TRKO201700008185
No, S.-H., Jung, J.-S. & You, Y.-H. Ecological control of invasive alien species, American bullfrog (Rana catesbeiana) using native predatory species. Korean J. Environ. Ecol. 31, 54–61. https://doi.org/10.13047/KJEE.2017.31.1.054 (2017).
Chang, B., Kim, I., Choi, K., Cho, W. & Ko, D. W. Population dynamics of American bullfrog (Lithobates catesbeianus) and implications for control. Animals 12, 2827 (2022).
Phillips, S. J., Anderson, R. P., Dudík, M., Schapire, R. E. & Blair, M. E. Opening the black box: An open-source release of Maxent. Ecography 40, 887–893. https://doi.org/10.1111/ecog.03049 (2017).
da Silveira Vasconcelos, T., Rodríguez, M. Á. & Hawkins, B. A. Biogeographic distribution patterns of South American amphibians: A regionalization based on cluster analysis. J. Biogeogr. 39, 1720–1732. https://doi.org/10.4322/natcon.2011.008 (2011).
Patlolla, C. R. Understanding the concept of hierarchical clustering technique. Towards Data Sci. 21 (2018). https://towardsdatascience.com/understanding-the-concept-of-hierarchical-clustering-technique-c6e8243758ec
Ermentrout, G. B. & Edelstein-Keshet, L. Cellular automata approaches to biological modeling. J. Theor. Biol. 160, 97–133. https://doi.org/10.1006/jtbi.1993.1007 (1993).
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830. https://doi.org/10.48550/arXiv.1201.0490 (2011).
Wolfram, S. Statistical mechanics of cellular automata. Rev. Modern Phys. 55, 601. https://doi.org/10.1103/RevModPhys.55.601 (1983).
Wolfram, S. A New Kind of Science. Vol. 5 (Wolfram media Champaign, IL, 2002). https://doi.org/10.1115/1.1553433
Nagatani, T. & Tainaka, K.-I. Cellular automaton for migration in ecosystem: Application of traffic model to a predator–prey system. Physica A Stat. Mech. Appl. 490, 803–807. https://doi.org/10.1016/j.physa.2017.08.151 (2018).
Brodrick, P. G., Davies, A. B. & Asner, G. P. Uncovering ecological patterns with convolutional neural networks. Trends Ecol. Evolut. 34, 734–745. https://doi.org/10.1016/j.tree.2019.03.006 (2019).
Qin, J., Pan, W., Xiang, X., Tan, Y. & Hou, G. A biological image classification method based on improved CNN. Ecol. Inform. 58, 101093. https://doi.org/10.1016/j.ecoinf.2020.101093 (2020).
Deneu, B. et al. Convolutional neural networks improve species distribution modelling by capturing the spatial structure of the environment. PLoS Comput. Biol. 17, e1008856. https://doi.org/10.1371/journal.pcbi.1008856 (2021).
Kattenborn, T., Leitloff, J., Schiefer, F. & Hinz, S. Review on Convolutional Neural Networks (CNN) in vegetation remote sensing. ISPRS J. Photogrammetry Remote Sensing 173, 24–49. https://doi.org/10.1016/j.isprsjprs.2020.12.010 (2021).
Elith, J. et al. Novel methods improve prediction of species’ distributions from occurrence data. Ecography. 29, 129–151. https://doi.org/10.1111/j.2006.0906-7590.04596.x (2006).
Phillips, S. J., Anderson, R. P. & Schapire, R. E. Maximum entropy modeling of species geographic distributions. Ecol. Model. 190, 231–259. https://doi.org/10.1016/j.ecolmodel.2005.03.026 (2006).
Phillips, S. J. & Dudík, M. Modeling of species distributions with Maxent: New extensions and a comprehensive evaluation. Ecography 31, 161–175. https://doi.org/10.1111/j.0906-7590.2008.5203.x (2008).
Venne, S. & Currie, D. J. Can habitat suitability estimated from MaxEnt predict colonizations and extinctions?. Diversity Distributions 27, 873–886. https://doi.org/10.1111/ddi.13238 (2021).
Tesfamariam, B. G., Gessesse, B. & Melgani, F. MaxEnt-based modeling of suitable habitat for rehabilitation of Podocarpus forest at landscape-scale. Environ. Syst. Res. 11, 1–12. https://doi.org/10.1186/s40068-022-00248-6 (2022).
Phillips, S. J. A Brief Tutorial on Maxent (2017). http://biodiversityinformatics.amnh.org/open_source/maxent/
Kang, K. Forest Policy and Reclamation in the Republic of Korea. (2017). https://ap.fftc.org.tw/article/1270
Martinez, G. J., Adamatzky, A. & Alonso-Sanz, R. Complex dynamics of elementary cellular automata emerging from chaotic rules. Int. J. Bifurcation Chaos 22, 1250023. https://doi.org/10.1142/S021812741250023X (2012).
Weisstein, Eric W. Elementary Cellular Automaton. From MathWorld—A Wolfram Web Resource. (2017). https://mathworld.wolfram.com/ElementaryCellularAutomaton.html
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444. https://doi.org/10.1038/nature14539 (2015).
Webb, S. Deep learning for biology. Nature 554, 555–557. https://doi.org/10.1038/d41586-018-02174-z (2018).
Abadi, M. et al. TensorFlow: Large-scale machine learning on heterogeneous systems. (2015). https://doi.org/10.48550/arXiv.1603.04467
Johovic, I., Gama, M., Banha, F., Tricarico, E. & Anastácio, P. M. A potential threat to amphibians in the European Natura 2000 network: Forecasting the distribution of the American bullfrog Lithobates catesbeianus. Biol. Conserv. 245, 108551. https://doi.org/10.1016/j.biocon.2020.108551 (2020).
Cho, K. H., Park, J.-S., Kim, J. H., Kwon, Y. S. & Lee, D.-H. Modeling the distribution of invasive species (Ambrosia spp.) using regression kriging and Maxent. Front. Ecol. Evolut. 10, 1036816. https://doi.org/10.3389/fevo.2022.1036816 (2022).
Jarnevich, C. et al. Invaders at the doorstep: Using species distribution modeling to enhance invasive plant watch lists. Ecol. Inform. 75, 101997. https://doi.org/10.1016/j.ecoinf.2023.101997 (2023).
Wagstaff, K., Cardie, C., Rogers, S. & Schrödl, S. Constrained k-means clustering with background knowledge. Icml 1, 577–584. https://doi.org/10.5555/645530.655669 (2001).
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357–362. https://doi.org/10.1038/s41586-020-2649-2 (2020).
Acknowledgements
This work was supported by Korea Environment Industry & Technology Institute (KEITI) through Exotic Invasive Species Management Program, funded by Korea Ministry of Environment (MOE) (2018002270001). Additional funding was provided by the National Research Foundation of Korea (NRF), which is funded by the Ministry of Education, Republic of Korea (NRF-2020R1I1A3071769).
Author information
Authors and Affiliations
Contributions
Conceptualization and methodology, G.O., H.S., and H.J.; software and validation, G.O., Y.W., H.K., S.C., and H.J.; formal analysis, investigation and resources, Y.W.,H.K., Y.K., S.C., and H.J.; writing—original draft preparation, G.O. and H.J.; writing—review and editing, Y.K., Y.W., and H.J.; visualization, G.O. and H.J.; supervision, H.S. and H.J.; funding acquisition, H.S. and H.J.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Oh, G., Wi, Y., Kang, HJ. et al. Assessment of American Bullfrog (Lithobates catesbeianus) spreading in the Republic of Korea using rule learning of elementary cellular automata. Sci Rep 14, 11548 (2024). https://doi.org/10.1038/s41598-024-62139-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-62139-3
Keywords
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.