Species Distribution Modelling: Contrasting presence-only models with plot abundance data

Gomes, Vitor H. F.; IJff, Stéphanie D.; Raes, Niels; Amaral, Iêda Leão; Salomão, Rafael P.; de Souza Coelho, Luiz; de Almeida Matos, Francisca Dionízia; Castilho, Carolina V.; de Andrade Lima Filho, Diogenes; López, Dairon Cárdenas; Guevara, Juan Ernesto; Magnusson, William E.; Phillips, Oliver L.; Wittmann, Florian; de Jesus Veiga Carim, Marcelo; Martins, Maria Pires; Irume, Mariana Victória; Sabatier, Daniel; Molino, Jean-François; Bánki, Olaf S.; da Silva Guimarães, José Renan; Pitman, Nigel C. A.; Piedade, Maria Teresa Fernandez; Mendoza, Abel Monteagudo; Luize, Bruno Garcia; Venticinque, Eduardo Martins; de Leão Novo, Evlyn Márcia Moraes; Vargas, Percy Núñez; Silva, Thiago Sanna Freire; Manzatto, Angelo Gilberto; Terborgh, John; Reis, Neidiane Farias Costa; Montero, Juan Carlos; Casula, Katia Regina; Marimon, Beatriz S.; Marimon, Ben-Hur; Coronado, Euridice N. Honorio; Feldpausch, Ted R.; Duque, Alvaro; Zartman, Charles Eugene; Arboleda, Nicolás Castaño; Killeen, Timothy J.; Mostacedo, Bonifacio; Vasquez, Rodolfo; Schöngart, Jochen; Assis, Rafael L.; Medeiros, Marcelo Brilhante; Simon, Marcelo Fragomeni; Andrade, Ana; Laurance, William F.; Camargo, José Luís; Demarchi, Layon O.; Laurance, Susan G. W.; de Sousa Farias, Emanuelle; Nascimento, Henrique Eduardo Mendonça; Revilla, Juan David Cardenas; Quaresma, Adriano; Costa, Flavia R. C.; Vieira, Ima Célia Guimarães; Cintra, Bruno Barçante Ladvocat; Castellanos, Hernán; Brienen, Roel; Stevenson, Pablo R.; Feitosa, Yuri; Duivenvoorden, Joost F.; Aymard C., Gerardo A.; Mogollón, Hugo F.; Targhetta, Natalia; Comiskey, James A.; Vicentini, Alberto; Lopes, Aline; Damasco, Gabriel; Dávila, Nállarett; García-Villacorta, Roosevelt; Levis, Carolina; Schietti, Juliana; Souza, Priscila; Emilio, Thaise; Alonso, Alfonso; Neill, David; Dallmeier, Francisco; Ferreira, Leandro Valle; Araujo-Murakami, Alejandro; Praia, Daniel; do Amaral, Dário Dantas; Carvalho, Fernanda Antunes; de Souza, Fernanda Coelho; Feeley, Kenneth; Arroyo, Luzmila; Pansonato, Marcelo Petratti; Gribel, Rogerio; Villa, Boris; Licona, Juan Carlos; Fine, Paul V. A.; Cerón, Carlos; Baraloto, Chris; Jimenez, Eliana M.; Stropp, Juliana; Engel, Julien; Silveira, Marcos; Mora, Maria Cristina Peñuela; Petronelli, Pascal; Maas, Paul; Thomas-Caesar, Raquel; Henkel, Terry W.; Daly, Doug; Paredes, Marcos Ríos; Baker, Tim R.; Fuentes, Alfredo; Peres, Carlos A.; Chave, Jerome; Pena, Jose Luis Marcelo; Dexter, Kyle G.; Silman, Miles R.; Jørgensen, Peter Møller; Pennington, Toby; Di Fiore, Anthony; Valverde, Fernando Cornejo; Phillips, Juan Fernando; Rivas-Torres, Gonzalo; von Hildebrand, Patricio; van Andel, Tinde R.; Ruschel, Ademir R.; Prieto, Adriana; Rudas, Agustín; Hoffman, Bruce; Vela, César I. A.; Barbosa, Edelcilio Marques; Zent, Egleé L.; Gonzales, George Pepe Gallardo; Doza, Hilda Paulette Dávila; de Andrade Miranda, Ires Paula; Guillaumet, Jean-Louis; Pinto, Linder Felipe Mozombite; de Matos Bonates, Luiz Carlos; Silva, Natalino; Gómez, Ricardo Zárate; Zent, Stanford; Gonzales, Therany; Vos, Vincent A.; Malhi, Yadvinder; Oliveira, Alexandre A.; Cano, Angela; Albuquerque, Bianca Weiss; Vriesendorp, Corine; Correa, Diego Felipe; Torre, Emilio Vilanova; van der Heijden, Geertje; Ramirez-Angulo, Hirma; Ramos, José Ferreira; Young, Kenneth R.; Rocha, Maira; Nascimento, Marcelo Trindade; Medina, Maria Natalia Umaña; Tirado, Milton; Wang, Ophelia; Sierra, Rodrigo; Torres-Lezama, Armando; Mendoza, Casimiro; Ferreira, Cid; Baider, Cláudia; Villarroel, Daniel; Balslev, Henrik; Mesones, Italo; Giraldo, Ligia Estela Urrego; Casas, Luisa Fernanda; Reategui, Manuel Augusto Ahuite; Linares-Palomino, Reynaldo; Zagt, Roderick; Cárdenas, Sasha; Farfan-Rios, William; Sampaio, Adeilza Felipe; Pauletto, Daniela; Sandoval, Elvis H. Valderrama; Arevalo, Freddy Ramirez; Huamantupa-Chuquimaco, Isau; Garcia-Cabrera, Karina; Hernandez, Lionel; Gamarra, Luis Valenzuela; Alexiades, Miguel N.; Pansini, Susamar; Cuenca, Walter Palacios; Milliken, William; Ricardo, Joana; Lopez-Gonzalez, Gabriela; Pos, Edwin; ter Steege, Hans

doi:10.1038/s41598-017-18927-1

Download PDF

Article
Open access
Published: 17 January 2018

Species Distribution Modelling: Contrasting presence-only models with plot abundance data

Vitor H. F. Gomes^1,2,
Stéphanie D. IJff^3,4,
Niels Raes³,
Iêda Leão Amaral⁵,
Rafael P. Salomão²,
Luiz de Souza Coelho⁵,
Francisca Dionízia de Almeida Matos⁵,
Carolina V. Castilho⁶,
Diogenes de Andrade Lima Filho⁵,
Dairon Cárdenas López⁷,
Juan Ernesto Guevara^8,9,
William E. Magnusson¹⁰,
Oliver L. Phillips¹¹,
Florian Wittmann^12,13,
Marcelo de Jesus Veiga Carim¹⁴,
Maria Pires Martins⁵,
Mariana Victória Irume⁵,
Daniel Sabatier¹⁵,
Jean-François Molino⁵,
Olaf S. Bánki³,
José Renan da Silva Guimarães¹⁴,
Nigel C. A. Pitman¹⁶,
Maria Teresa Fernandez Piedade¹⁷,
Abel Monteagudo Mendoza¹⁸,
Bruno Garcia Luize¹⁹,
Eduardo Martins Venticinque²⁰,
Evlyn Márcia Moraes de Leão Novo²¹,
Percy Núñez Vargas²²,
Thiago Sanna Freire Silva²³,
Angelo Gilberto Manzatto²⁴,
John Terborgh²⁵,
Neidiane Farias Costa Reis²⁶,
Juan Carlos Montero^27,5,
Katia Regina Casula²⁶,
Beatriz S. Marimon²⁸,
Ben-Hur Marimon²⁸,
Euridice N. Honorio Coronado^29,11,
Ted R. Feldpausch³⁰,
Alvaro Duque³¹,
Charles Eugene Zartman⁵,
Nicolás Castaño Arboleda⁷,
Timothy J. Killeen³²,
Bonifacio Mostacedo³³,
Rodolfo Vasquez¹⁸,
Jochen Schöngart¹⁷,
Rafael L. Assis¹⁷,
Marcelo Brilhante Medeiros³⁴,
Marcelo Fragomeni Simon³⁴,
Ana Andrade³⁵,
William F. Laurance³⁶,
José Luís Camargo³⁵,
Layon O. Demarchi¹⁷,
Susan G. W. Laurance³⁶,
Emanuelle de Sousa Farias^37,38,
Henrique Eduardo Mendonça Nascimento⁵,
Juan David Cardenas Revilla⁵,
Adriano Quaresma¹⁷,
Flavia R. C. Costa⁵,
Ima Célia Guimarães Vieira¹,
Bruno Barçante Ladvocat Cintra^17,11,
Hernán Castellanos³⁹,
Roel Brienen¹¹,
Pablo R. Stevenson⁴⁰,
Yuri Feitosa⁴¹,
Joost F. Duivenvoorden⁴²,
Gerardo A. Aymard C.⁴³,
Hugo F. Mogollón⁴⁴,
Natalia Targhetta⁴⁵,
James A. Comiskey^46,47,
Alberto Vicentini¹⁰,
Aline Lopes¹⁷,
Gabriel Damasco⁸,
Nállarett Dávila⁴⁸,
Roosevelt García-Villacorta^49,50,
Carolina Levis^51,52,
Juliana Schietti⁵,
Priscila Souza⁵,
Thaise Emilio^53,10,
Alfonso Alonso⁴⁷,
David Neill⁵⁴,
Francisco Dallmeier⁴⁷,
Leandro Valle Ferreira¹,
Alejandro Araujo-Murakami⁵⁵,
Daniel Praia¹⁷,
Dário Dantas do Amaral¹,
Fernanda Antunes Carvalho^10,56,
Fernanda Coelho de Souza^10,11,
Kenneth Feeley^57,58,
Luzmila Arroyo⁵⁵,
Marcelo Petratti Pansonato^5,59,
Rogerio Gribel⁶⁰,
Boris Villa¹⁷,
Juan Carlos Licona²⁷,
Paul V. A. Fine⁸,
Carlos Cerón⁶¹,
Chris Baraloto⁶²,
Eliana M. Jimenez⁶³,
Juliana Stropp⁶⁴,
Julien Engel^15,62,
Marcos Silveira⁶⁵,
Maria Cristina Peñuela Mora⁶⁶,
Pascal Petronelli⁶⁷,
Paul Maas³,
Raquel Thomas-Caesar⁶⁸,
Terry W. Henkel⁶⁹,
Doug Daly⁷⁰,
Marcos Ríos Paredes⁷¹,
Tim R. Baker¹¹,
Alfredo Fuentes^72,73,
Carlos A. Peres⁷⁴,
Jerome Chave⁷⁵,
Jose Luis Marcelo Pena⁷⁶,
Kyle G. Dexter^77,50,
Miles R. Silman⁷⁸,
Peter Møller Jørgensen⁷³,
Toby Pennington⁵⁰,
Anthony Di Fiore⁷⁹,
Fernando Cornejo Valverde⁸⁰,
Juan Fernando Phillips⁸¹,
Gonzalo Rivas-Torres^82,83,
Patricio von Hildebrand⁸⁴,
Tinde R. van Andel³,
Ademir R. Ruschel⁸⁵,
Adriana Prieto⁸⁶,
Agustín Rudas⁸⁶,
Bruce Hoffman⁸⁷,
César I. A. Vela⁸⁸,
Edelcilio Marques Barbosa⁵,
Egleé L. Zent⁸⁹,
George Pepe Gallardo Gonzales⁷¹,
Hilda Paulette Dávila Doza⁷¹,
Ires Paula de Andrade Miranda⁵,
Jean-Louis Guillaumet⁹⁰,
Linder Felipe Mozombite Pinto⁷¹,
Luiz Carlos de Matos Bonates⁵,
Natalino Silva⁹¹,
Ricardo Zárate Gómez⁹²,
Stanford Zent⁸⁹,
Therany Gonzales⁹³,
Vincent A. Vos^94,95,
Yadvinder Malhi⁹⁶,
Alexandre A. Oliveira⁵⁹,
Angela Cano⁴⁰,
Bianca Weiss Albuquerque¹⁷,
Corine Vriesendorp¹⁶,
Diego Felipe Correa^40,97,
Emilio Vilanova Torre^98,99,
Geertje van der Heijden¹⁰⁰,
Hirma Ramirez-Angulo⁹⁸,
José Ferreira Ramos⁵,
Kenneth R. Young¹⁰¹,
Maira Rocha¹⁷,
Marcelo Trindade Nascimento¹⁰²,
Maria Natalia Umaña Medina^40,103,
Milton Tirado¹⁰⁴,
Ophelia Wang¹⁰⁵,
Rodrigo Sierra¹⁰⁴,
Armando Torres-Lezama⁹⁸,
Casimiro Mendoza^106,107,
Cid Ferreira⁵,
Cláudia Baider¹⁰⁸,
Daniel Villarroel⁵⁵,
Henrik Balslev¹⁰⁹,
Italo Mesones⁸,
Ligia Estela Urrego Giraldo³¹,
Luisa Fernanda Casas¹¹⁰,
Manuel Augusto Ahuite Reategui¹¹¹,
Reynaldo Linares-Palomino¹¹²,
Roderick Zagt¹¹³,
Sasha Cárdenas¹¹⁰,
William Farfan-Rios⁷⁸,
Adeilza Felipe Sampaio²⁶,
Daniela Pauletto¹¹⁴,
Elvis H. Valderrama Sandoval^115,116,
Freddy Ramirez Arevalo¹¹⁶,
Isau Huamantupa-Chuquimaco²²,
Karina Garcia-Cabrera⁷⁸,
Lionel Hernandez³⁹,
Luis Valenzuela Gamarra¹⁸,
Miguel N. Alexiades¹¹⁷,
Susamar Pansini²⁶,
Walter Palacios Cuenca¹¹⁸,
William Milliken⁵³,
Joana Ricardo¹¹,
Gabriela Lopez-Gonzalez¹¹,
Edwin Pos^3,119 &
…
Hans ter Steege ORCID: orcid.org/0000-0002-8738-2659^3,120

Scientific Reports volume 8, Article number: 1003 (2018) Cite this article

45k Accesses
107 Citations
24 Altmetric
Metrics details

Subjects

Abstract

Species distribution models (SDMs) are widely used in ecology and conservation. Presence-only SDMs such as MaxEnt frequently use natural history collections (NHCs) as occurrence data, given their huge numbers and accessibility. NHCs are often spatially biased which may generate inaccuracies in SDMs. Here, we test how the distribution of NHCs and MaxEnt predictions relates to a spatial abundance model, based on a large plot dataset for Amazonian tree species, using inverse distance weighting (IDW). We also propose a new pipeline to deal with inconsistencies in NHCs and to limit the area of occupancy of the species. We found a significant but weak positive relationship between the distribution of NHCs and IDW for 66% of the species. The relationship between SDMs and IDW was also significant but weakly positive for 95% of the species, and sensitivity for both analyses was high. Furthermore, the pipeline removed half of the NHCs records. Presence-only SDM applications should consider this limitation, especially for large biodiversity assessments projects, when they are automatically generated without subsequent checking. Our pipeline provides a conservative estimate of a species’ area of occupancy, within an area slightly larger than its extent of occurrence, compatible to e.g. IUCN red list assessments.

Evaluation metrics and validation of presence-only species distribution models based on distributional maps with varying coverage

Article Open access 15 January 2021

Validating species distribution models to illuminate coastal fireflies in the South Pacific (Coleoptera: Lampyridae)

Article Open access 30 August 2021

Empirical estimation of habitat suitability for rare plant restoration in an era of ongoing climatic shifts

Article Open access 07 November 2023

Introduction

Species distribution models (SDMs) are widely used in the fields of macroecology, biogeography and biodiversity research for modelling species geographic distributions based on correlations between known occurrence records and the environmental conditions at occurrence localities^1,2. SDMs generate geographical maps of a species’ environmental suitability, its likelihood of being collected, and its local abundance³. Their application includes selecting conservation areas, predicting the effects of climate change on species ranges and determining the risk of species invasions^4,5. The wide use of SDMs in ecological and conservation research can partly be explained by the growing availability of georeferenced species records (e.g. GBIF, SpeciesLink) and environmental data (e.g. WorldClim, CliMond)^6,7 on the web, together with the user-friendly character of some of the modelling methods.

One of the most commonly used SDMs is MaxEnt, which has become increasingly popular since its introduction⁸. This machine-learning algorithm estimates a species’ probability distribution that has maximum entropy (closest to uniform), subject to a set of constraints based upon our knowledge of the environmental conditions at known occurrence sites¹. MaxEnt is a presence-only model, enabling scientists to utilize the abundant data sources of natural history collections (NHCs), avoiding the high costs of sampling the species throughout their extent of occurrence. Presence data are abundant, but absence data are hard to obtain and often unreliable due to insufficient survey effort. To counter the lack of absences, MaxEnt uses a background sample to contrast the distribution of presences along environmental gradients against the distribution background points, randomly drawing from the study area.

NHCs, however, may not be independently drawn from the investigated populations due to the non-random nature of collecting^9,10. Because collectors aim to collect as many species as possible, rare species are often overrepresented in herbaria, whereas common species are underrepresented, producing collectors’ bias¹¹. Therefore, the relative number of specimens per species in herbaria is not a good representation of the species’ relative abundance in the field. Additionally, NHCs have spatial bias due to geographical differences in survey effort, data storage and mobilization^9,10,12. This may have negative impacts on the performance of presence-only SDMs if this results in environmentally biased sampling^12,13,14,15. Negative impact of spatial bias is not always present, however^16,17.

MaxEnt has shown to outperform other SDMs in several studies^{18,19,20,21,22}. Nevertheless, some drawbacks have been identified. For example, MaxEnt may underestimate the probability of occurrence within areas of observed presence, while overestimating it in areas beyond the species’ known extent of occurence²³. Like other SDMs, one essential assumption of MaxEnt is that the presence-data are an independent sample from the species’ unknown probability distribution of occurrence over the study area¹. Given the shortcomings of NHCs due to collectors’ bias mentioned above, this assumption may not be met.

With a large set of plots with quantitative data, species abundances may be estimated by a spatial interpolation of local species’ abundances²⁴. Based on plot data, where all species are collected (regardless of commonness or rarity), the interpolation method arguably suffers less from the collectors’ bias and is exclusively based on location. The abundance maps may serve as the species’ estimated probability distribution and a higher local abundance implies a higher probability of collecting. That is, the chance of encountering a species is higher in a region where the relative abundance of that species is high, than where the relative abundance is low. With spatially interpolated abundances we may thus test whether NHCs can actually be considered a random sample of the unknown probability distribution.

Here we test how the geographic distribution of NHCs relates to the species relative abundance. To achieve this we address the following questions: (1) Do NHCs represent an independently drawn sample from the unknown probability distribution of a species? And (2) how does MaxEnt’s predicted environmental suitability compare to plot abundance data and spatial interpolation of species abundances? To answer these questions, we used NHCs and abundance plot data of 227 hyperdominant Amazonian tree species, which are the most common tree species that together make up half of all trees with a diameter (dbh) over 10 cm in Amazonia^24,25, the most biodiverse rainforest on Earth. We used NHCs and MaxEnt to construct presence-only SDMs for all 227 species and constructed the abundance maps by spatial interpolation of the plot abundance data for all species as well. To answer the first question, we compared the collection records to the interpolated abundance maps for each species. Secondly, we compared MaxEnt’s predicted environmental suitability maps to the same interpolated abundance maps for each of the 227 species.

Results

NHCs data distribution and relative abundance analysis

The analysis testing our first question, whether NHCs are an independent draw from the unknown probability distribution, resulted in a significant (P < 0.05), but very weak positive relationship for 149 (66%) species of the 227. For these species the chance of being collected indeed increased slightly with higher interpolated relative abundance. For the other 78 species (34%), this relationship was non-significant or negative (Appendix S1).

Predicted environmental suitability compared to species relative abundances

Further analyses were carried out using only 170 species. Species, that had MaxEnt’s predicted environmental suitability not significantly different from a random expectation tested with bias corrected null models, were excluded (57 species). For 161 of the 170 species (95%), MaxEnt’s predicted environmental suitability was also significantly correlated with interpolated abundance (P < 0.05). The correlations and, thus the biological significance, were low however, with a mean rho (Spearman rank correlation) of 0.26 (Fig. 1A). A linear 90^th quantile regression revealed that for 135 (79%) of the 175 species, the logistic output of MaxEnt could significantly (P < 0.05) predict the highest 10% of the local relative abundance values. The slope of the regression and thus the biological significance was very low, with a mean slope of only 0.01 (Fig. 1B).

We also investigated the performance of the IDW output against NHCs data and the MaxEnt output against plot presence (sensitivity), to check whether the models were accurate references to the occurrence data of each other (Appendix S2). Approximately 87% of the grid cells with species’ NHCs were correctly predicted as present by the IDW maps with a median true positive rate of 0.87 (Fig. 1C). The same analyses for MaxEnt showed that 88% of the grid cells with plot presence were correctly predicted by MaxEnt maps, with a median true positive rate of 0.88 (Fig. 1D). Sensitivity for both analyses was high.

We provide maps (combined MaxEnt and IDW maps [as in Fig. 2]) for all species in the Supplementary Material S3. The predicted environmentally suitable region and the abundance distribution were similar for very abundant species with a large extent of occurence, such as Brosimum rubescens (Fig. S3_14A), Conceveiba guianensis (Fig. S3_32A) and Eschweilera coriacea (Fig. S3_49A). The same was true in the case of the species Clathrotropis glaucophylla (Fig. S3_30A) and Cenostigma tocantinum (Fig. S3_26A), despite the fact that neither species has a wide extent of occurence.

Moreover, MaxEnt also correctly predicted the environmental unsuitability of non-forested savanna areas, which are located in the north (Brazil, Guianas and Venezuela) and south of the map (northern Bolivia). These close matches apply to very abundant species with a large extent of occurence, such as Licania micrantha (Fig. S3_87A), and Ocotea aciphylla (Fig. S3_111A).

For Triplaris weigeltiana, a species with a northern Amazonian distribution, MaxEnt also correctly predicted its absence in these northern non-forested areas (Fig. 2A, S3_160A,B). In this case MaxEnt was able to establish a relationship between species distribution and vegetation type, based on climate variables (temperature and precipitation) and species occurrence. For Macrolobium acaciifolium, a riverine species, the IDW presented limitations. This species is rarely recorded in plots, because the plots are mostly far from river edges. Thus, the species was found only in plots near to major rivers such as the Amazon. In this case NHCs provided better information about species occurrence, as collectors can reach areas closer to other smaller rivers aiming to collect more species. In such a case, MaxEnt maps presented a wider distribution for the species (Fig. 2B, S3_92A–C).

IDW maps predicted widespread distributions for palms, for which the MaxEnt estimates were in sharp disagreement. Palms species are more difficult to collect, which can result in a lack of specimens in NHCs²⁴. IDW maps appear to be more accurate for these species, because all species are recorded inside plots. In eastern Amazonia this was particularly severe because NHCs showed a large lack of occurrence in comparison with plot data but also proper locations were rejected by a kernel density estimate (KDE), because of the huge amount of palm occurrence data from the Aarhus University Palm Transect Database in western Amazonia. Some of the species affected were Attalea butyracea (Fig. S3_8A), Euterpe precatoria (Fig. S3_60A), Iriartea deltoidea (Fig. S3_72A), Oenocarpus bacaba (Fig. S3_113A), Oenocarpus bataua (Fig. S3_114A) and Socratea exorrhiza (Fig. S3_150A).

NHCs data cleaning treatment and MaxEnt map building

All 227 hyperdominant species had records excluded by the data cleaning treatment, the consequence of records that either lack geographic information, are duplicates at the used grid cell resolution of 0.5 degree or were outliers based on a kernel-density estimate (Appendix S4). An average of 50% of the records was excluded. The first twelve species with the most excluded records were palms, with a mean of 96% of excluded records. The total average of excluded records decreased to 43% when palms were taken out of the analyses (Appendix S4). This high percentage is due to the huge amount of palm occurrence data of the Aarhus University Palm Transect Database²⁶. At this moment this database contains 543,000 records, all available in GBIF. Most of these records represent observations in many plots inside the same grid cell, thus these records were removed and considered as a single observation.

After the kernel density estimate treatment the average of excluded records was 57%, presenting an increment of 6.7% in the total amount of records excluded. Eperua purpurea and Eperua leucantha collections were in good agreement with plot data distribution, after outliers were excluded by the kernel density estimate (Appendix S5). In the case of Eperua falcata, some occurrences in Colombia and Venezuela were in fact misidentifications of Eperua leucantha, since this species occurs only in the Guianas (H. ter Steege, pers. obs.). The kernel density estimate function correctly removed these occurrences outside the E. falcata cluster observed in the Guianas (Fig. 3A). Some occurrences of Licania alba in southeast Amazonia, an area with no plot data, were also removed by the kernel-density estimate function (Fig. 3B).

Because of the use of the buffer treatment to limit MaxEnt predictions around the species’ extent of occurrence, MaxEnt maps predicted an area of occupancy close to that of the IDW maps., The median value for MaxEnt’s area of occupancy was 1354 0.5-degree grid cells, and the median for IDW was 1217. For 98 (58%) of the 170 species, MaxEnt predicted an area of occupancy bigger than the that predicted by IDW, and for 115 (42%) of the species IDW had a predicted area of occupancy bigger than MaxEnt. In 15% of the cases (26 species) the size difference in area of occupancy was smaller than 5% (Appendix S2).

Discussion

Using NHCs for presence-only SDMs

Collection density was weakly related to relative abundance in most tree species, and for 34% there was no positive relationship between the chance of being collected and local abundance, violating the assumption of MaxEnt that collection localities are an independently drawn sample from a species’ unknown probability distribution¹. The differences between the distribution of NHCs and local abundance could limit the ability of presence-only SDMs to predict species probability of distribution as predicted by spatial interpolation of local species’ abundance.

MaxEnt’s premise that species occurrences are drawn randomly from the unknown probability distribution¹ may not be met for two reasons: (1) collections are spatially biased with regard to environmental conditions^13,14,; (2) and collections are spatially biased with regard to areas of high abundance, with underrepresentation of in areas of high abundance and overrepresentation in areas of low abundance. Much attention has been given to the possible impacts of spatial bias on the performance of presence-only SDMs, with some showing a negative impact on these models^13,14,15, and others arguing for the robustness of MaxEnt against spatial bias^17,27. However, little attention has been given to the second issue. With our plot dataset, we addressed the relationship between collection localities and the predicted spatial abundance distribution.

In 66% of the cases we found that a higher local relative abundance indeed increased the chance of being collected, although the correlations were very weak (Fig. 1A), and the majority of collections originated from areas with a low relative abundance due to the large areas where a given species’ abundance is low. Even hyperdominant species are usually only dominant in one or two of the six regions of Amazonia, most hyperdominant species have a large geographic extent of occurrence but are habitat specialists²⁴. Steege et al.²⁵ also found that abundance is a poor predictor for the number of collections of a species compared to the size of its extent of occurrence. Additionally, herbaria are characterized by the earlier discussed collectors’ bias¹¹. Although we addressed the spatial bias of survey effort by including a bias based background file in our MaxEnt modelling, the lack of a significant positive relationship between relative abundance estimated by IDW and collection density for many species suggests that this assumption of MaxEnt is not met because of the way species are collected.

MaxEnt maps vs. IDW maps

We also asked if MaxEnt maps would be a close match to the IDW maps. In general, environmental suitability does not reflect a species abundance. Presence-only SDMs, such as MaxEnt, are based on correlations between species presence and environmental conditions, predicting the environmental suitability for a species, and not their realized distribution⁵. Relative abundance in the other hand is based solely on abundance, estimating the number of trees belonging to each species in the grid cells²⁸. The Spearman’s rank correlation and the linear 90^th percentile quantile regression showed a very weak positive relationship between MaxEnt’s predicted environmental suitability and IDW relative abundance prediction at plot localities, contrary to the results of VanDerWal et al.²⁹, who found a strong relationship between the two. Their research differs in that they modelled a biogeographical region with tropical and subtropical rainforests, and also drier and warmer environments. The relationship between environmental suitability and local abundance is likely to be stronger when more (extreme) divergent conditions are included, such as areas from different biomes. Perhaps Amazonia’s less divergent conditions, representing perhaps a one single biome, in a much larger area, are potentially responsible for this weak relationship.

To test their further predictive performances we also converted both outputs to binary maps. Some studies have addressed questions about the transformation of SDM predictions into discrete representations such as binary maps, aiming to estimate area of occupancy, species richness and others applications^30,31,32,33. Binary maps can add more uncertainties to model predictions, especially because it is necessary to set a threshold to distinguish between species presence and absence, which can be selected arbitrarily or without taking into account the context of the study. However, we avoided thresholds based on specificity (prediction of absences) because of the lack of absence data³⁴. In many cases our MaxEnt binary maps presented an area of occupancy close to those made with IDW, presenting a high median sensitivity (88%). Moreover our MaxEnt binary maps also correctly predicted absence in naturally non-forested areas in northern Amazonia for many species (Appendix S3).

MaxEnt’s environmental suitability mostly predicted much larger area of occupancy than those predicted with the IDW relative abundance. We reduced this effect estimating species extent of occurrence using a convex hull around each species records, plus a buffer of 300 km. This approach minimized MaxEnt’s overestimation of the area of occupancy beyond the species’ known geographical range (extent of occurrence), over climatically suitable areas, by restricting the species’ predicted suitable habitat, providing a more conservative estimate for the species’ area of occupancy (Appendix S5)^35,36.

The IDW relative abundance models showed an opposite behaviour, underpredicting areas where collections are present but where no plots have recorded the species. The high sensitivity of the MaxEnt compared to those of the IDW is in agreement with a previous study³⁷, where models fit to presence-only data yielded higher sensitivity but a lower specificity than presence-absence models. Nevertheless, in our case, the IDW relative abundance yielded sensitivity rates based on collection localities that were as high as the sensitivity rates of MaxEnt’s predicted environmental suitability based on plot presence localities (87%). Thus, both models function similarly in predicting species presences.

Collections versus abundance plot data

In some cases, collections were located outside the species’ extent of occurrence predicted by the IDW maps. This divergence follows from the methodical differences between collections and plot assessments. The distributions as predicted by the IDW do not always cover the whole species’ extent of occurrence. Because there are only 550 individuals (on average) in one plot, and 16,000 tree species in Amazonia²⁴, one plot obviously cannot contain all species that are present in the surrounding area. Furthermore, many plots, lacking a given species, are within the extent of occurrence predicted by IDW, and many plots with absences are located in close proximity to plots with presence data. This results in low specificity values. NHCs comprise a species’ range including areas of low abundance; while plot data have information on abundance, but may miss areas of low abundance, and, thus, may miss rare species more easily.

Environmental suitability versus dispersal limitation

The second large difference between the two models is the theoretical principles they are based upon. MaxEnt is based on environmental suitability, which is appropriate since correlations between species’ distributions and climate are evident^5,36. Nevertheless, predicting actual (realized) distributions also requires information on biotic interactions, dispersal limitation, and other environmental variables, which are beyond presence-only SDM⁵. IDW, on the other hand, is based on location only. Thus, both models cover only one of the three explanatory variables for species distributions. Again, it will depend on the aim of the research which type of model is most suitable. In either case predicted species distributions need to be interpreted with caution.

Collection data and cleaning pipeline

We propose a cleaning pipeline to remove possible inconsistencies in collection data. Unlike species-specific approaches, many studies use large numbers of species, lacking correction because of the great number of references and specialists to be consulted³⁸. Collection data available in global datacentres, such as GBIF, cannot carry out thorough data-correction procedures, and the quality of the records has been debated and tested in some cases³⁹. Some records have no locality information, or coordinates are based on cities close to the observed distribution, and may contain duplicated data or zeros as information^38,39,40.

We used a pipeline that cleans collection data by removing records with a lack of geographic information³⁸, and we strongly recommend the use of analytical tools to correct inconsistencies present in global databases. The cleaning process also removed coordinates considered spatial outliers by a kernel-density estimate, omitting locations too far from the central part of the distribution, which we assume to be misidentifications.

Our results suggest that half of the species records are likely inconsistent, missing geographical information, such as latitude, longitude or locality. Palms were the most impacted species, because the huge amount of records available with high levels of redundancy.

We used a kernel density estimate (KDE) to remove geographical outliers of the NHCs. This function removed e.g. occurrences outside the Eperua falcata cluster observed in the Guianas (Fig. 3A), and Licania alba in southeast Amazonia (Fig. 3B). Although the KDE excluded only a small number of records compared to the previous cleaning step, it was able to identify some isolated occurrences, which we considered likely misidentifications. The KDE, however, showed limitations with palm species, removing some eastern Amazonia records, simply caused by the great number of collections in the Aarhus University Palm Transect Database in western Amazonia.

Conclusion

We have shown that the NHCs violate the assumption of MaxEnt that collection localities are an independently drawn sample from a species’ unknown probability distribution. Although we found a relationship between NHCs and relative abundance for some species, it was very weak. Additionally, we found that the majority of MaxEnt’s predicted environmental suitability values differ from those of the IDW relative abundance values, and its results cannot be interpreted as an abundance estimate. Nevertheless, MaxEnt predicts probability of occurrence well, and both models largely overlap and predict similar areas of occupancy, showing high sensitivities. Furthermore, NHCs data should undergo cleaning processes before being used to represent occurrences in species distribution models. We showed that, half of the species records are likely inconsistent, missing geographical information, such as latitude, longitude or locality, and it also may represents misidentifications of the species. We therefore conclude that distribution maps as generated by MaxEnt should be used with caution. Their application should not be based solely on unsupervised models, especially because their easily constructed distribution maps are tempting to utilize without indication of probable errors. This outcome is particularly important for biodiversity assessments, for which SDMs of a large number of species are automatically generated without subsequent checking. Our pipeline provides a conservative means to do so. As our pipeline removes inconsistencies from NHCs data and estimates area of occupancy in an area slightly larger than the extent of occurrence of a species, compatible with IUCN red list assessments^35,41.

Methods

Species

We focused our analysis on 227 hyperdominant Amazonian tree species. The hyperdominant species are the most common tree species in Amazon, and together make up half of all trees with a diameter (dbh) over 10 cm²⁴. We chose only hyperdominant species to reduce the emergence of too many ‘false absences’ when plot data are interpolated into abundance maps. They present the largest probability of occurrence in the plots where they are present in the surrounding area.

Collections

Species collections were downloaded from GBIF (August 2017, www.gbif.org). We used data from the species’ complete extent of occurrence to prevent deficiencies that are associated with SDMs based on a species’ partial geographic range, such as under-prediction⁴². All individuals were assigned to species level; intraspecific levels were ignored.

Taxonomic names were checked with the Taxonomic Name Resolution Service (TNRS, http://tnrs.iplantcollaborative.org/). Although misidentification may represent a major problem in tree plots, we assume it is less severe in common species such as the hyperdominants; which are better represented in herbaria and more likely to be collected fertile²⁵. We assume that misidentification is within acceptable limits.

Collections cleaning pipeline

The cleaning pipeline consisted of a two-step process to remove inconsistencies from GBIF downloaded data (GBIF records). The first step consisted of removing all records with missing latitude, longitude or locality information (imprecise georeferences)³⁹ and all duplicates at 0.5-degree spatial resolution⁴⁰. With the GeoClean function from speciesgeocodeR R Package³⁸ we also removed coordinates assigned to capital cities, coordinates with latitude equal to longitude, coordinates equal to exactly zero; coordinates based on centroids of provinces, and corrected country references (cleaned GBIF records).

In the second step we used a kernel-density estimate function to remove spatial outliers from the cleaned GBIF data, assuming that these are misidentifications or incorrect coordinates not filtered by the step described above. This function calculates a fixed-bandwidth kernel-density estimate of the point process density function that produced the point patterns⁴³, using the density.ppp function from spatstat R Package⁴⁴ to generate a kernel-density estimate. Outliers were identified and removed based on the kernel-density values for each species coordinate, using a threshold based on a quantile function from stats R Package⁴⁵ (kernel-density estimate GBIF records).

The quantile threshold was set according to the number of Amazonian regions in which a species occurred, six in total as defined by ter Steege et al.²⁴. The quantile threshold was larger for species with narrow distribution (occurring in one to three Amazonian regions) and smaller for species with wide distribution (occurring in more than three Amazonian regions). As some hyperdominants are very widely distributed in Amazonia a larger quantile threshold cuts off too many occurrences, removing not only outliers, but also potential correct occurrence or entire occurrence clusters. Both steps reduced the number of species collection records (Appendix S4), and the predicted area of occupancy (Appendix S5).

Plot abundance data

Abundance maps were constructed using 1675 1-ha tree inventory plots well distributed across Amazonia (defined as the tropical rain forest of the Amazon basin and the Guyana Shield) from the Amazon Tree Diversity Network (ATDN) (http://atdn.myspecies.info/). All individuals with ≥10 cm diameter at breast height (dbh) were recorded within the plots²⁴. Because a relatively small number of collections from these plots have been deposited in herbaria, they constitute a dataset nearly independent from the NHCs.

Constructing abundance maps

Inverse distance weighting (IDW) interpolation was used to create abundance maps from the plot abundance data. First, Amazonia was divided into 2193 0.5-degree grid cells. We then constructed the inverse distance weighting (IDW) models based on relative abundance following ter Steege et al.²⁸. Then, the relative abundance (RA) for each cell was defined as RA_i = n_i/N, where: n_i = the number of individuals of species i, and N = the total number of trees. IDW models were based on the nearest 150 plots within a limit of 300 km distance. Each plot weight was calculated by taking the square root of the distance in degrees. The 150 plots that were taken into account ensured that within an area consisting of absence plots only, the species is predicted to be absent. In addition, the 3-degree distance limit causes the model to predict the absence of a species when no occurrence plots are present within a radius of 3 degrees. This setting is based on the notion that within a non-environmental model a species’ extent of occurrence is restricted by dispersal limitation only⁴⁶. The maximum dispersal distance has been optimized to a 3-degree distance by determining the best match between the IDW maps and the Fisher’s Alpha diversity map of all species⁴⁷.

Constructing presence-only SDMs using MaxEnt

We used MaxEnt version 3.3.3 k^1,48, to construct presence-only SDMs for all the 227 species. Data of 19 environmental variables were downloaded from WorldClim⁶. These included variables related to temperature and precipitation. Since collinearity, the non-independence of predictor variables, potentially leads to the wrong identification of relevant predictors for the model, we used the common Spearman’s rank correlation coefficient threshold of |rho| >0.7 to identify correlated variables⁴⁹.

Subsequently, we selected least correlated variables (|rho| <0.7) based on biological relevance and their loadings in a principal component analysis (PCA). The PCA consisted of all environmental variables for all collection localities of the 227 species. For temperature, we selected isothermality, temperature seasonality, and maximum temperature of warmest month. For precipitation we chose annual precipitation, wettest month precipitation and driest month precipitation. All the environmental variables were cropped to the extent of the Neotropics⁴², and aggregated to a 0.5-degree spatial resolution, using the function ‘mean’ from R package ‘raster’⁵⁰. We used precipitation and temperature variables to assess MaxEnt’s predicted environmental suitability based on climate only. In the MaxEnt feature settings we excluded the product, threshold and hinge features given their lack of biological justification with the variables used^34,36.

Correcting for geographical sampling bias has been found to improve the predictive performance of MaxEnt¹⁴. Also, environmental bias can be assessed by environmental filtering, which improves MaxEnt discriminatory ability⁵¹. We produced a bias file to employ the target-group background method recommended by Phillips and Dudík⁵², an option which is implemented in MaxEnt. The bias file consisted of a binary raster grid based on all Amazon tree species collections²⁵, at each grid cell downloaded from GBIF, which reflects local survey effort. This is an essential step in the analysis, given MaxEnt’s assumption that the occurrences are independently drawn from the unknown probability distribution of the species. Without a bias file, sampling bias could severely reduce models accuracy. We used the bias file to produce a background file according the efforts of collection. Finally we used a convex hull around cleaned occurrences (kernel-density estimate GBIF records) of each species to estimate their extent of occurrence sensu³⁵, plus a buffer of 300 km, equal to the buffer set for the IDW analysis, to crop the area of predicted environmental suitability^29,41. The latter is our predicted area of occupancy.

Data analysis

We compared collection presences and absences to IDW relative abundance to answer our first question whether NHCs are independent drawn from the unknown probability distribution. A binomial generalized linear model (logit regression) was used to determine if a significant positive relationship existed between the probability of being collected and predicted local relative abundance.

To answer the second question, how MaxEnt’s predicted environmental suitability compares to IDW relative abundance, we first tested which species’ MaxEnt maps were significantly different from random expectation with a bias corrected null-model⁵³. For each species, 99 null-models were generated by randomly drawing n collection localities without replacement from the same spatial grid as the environmental layers, with n being the number of geographically unique collections for that species. Using an upper one-sided 95% confidence interval, we determined the probability value of the observed AUC as calculated by MaxEnt against those generated by the null distribution. If the species’ observed AUC value ranks 95 or above, the chance that a random set of n points could generate an equally good model is less than 5%, hence considered significantly different from random expectation. All species for which the SDM prediction did not deviate significantly from random expectation were excluded from further analysis.

Second, a Spearman Rank Correlation test was used to test the relationship between MaxEnt logistic output and IDW relative abundance at plot localities. Additionally, following VanDerWal et al.²⁹, we determined the linear 90th percentile quantile regression between the IDW relative abundance and MaxEnt logistic outputs at plot localities. The confidence intervals of the linear quantile regressions were calculated with the Markov chain marginal bootstrap method as suggested by Kocherginsky et al.⁵⁴. We computed the correlations and regressions for all plots separately, even if multiple plots were present in one grid square.

Third, we tested the predictive performance of MaxEnt and IDW. For MaxEnt, its logistic output was transformed into binary maps with a 10% training presence threshold. Although the maximum sum of sensitivity and specificity is considered to be the best threshold method for presence-only SDMs by Liu et al.⁵⁵, we followed the advice of Merow et al.³⁴ to avoid measures with specificity because they are based on absences that are unknown in this analysis. Then we tested its sensitivity by calculating true positive rate of the binary maps against plot presence. That is, the fraction of the grid cells with a plot for which MaxEnt predicted the species correctly to be present. Finally we calculated the median predicted area of occupancy.

For IDW, its output was transformed into binary maps by converting the grids cells with RA >0 into 1. Last, naturally non-forested areas were excluded from the maps based on Soares-Filho et al.⁵⁶. We then calculated its output true positive rate against collections presences and absences. That is, the fraction of the grid cells with a collection for which the IDW relative abundance predicted the species correctly to be present. Finally we also calculated the median predicted area of occupancy for IDW.

All calculations and analyses were performed with R version 3.0.3³, including the R packages raster⁵⁰, rgdal⁵⁷, gstat⁵⁸, dismo⁵⁹, vegan⁶⁰, quantreg⁶¹, sp⁶², rJava⁶³ and SDMTools⁶⁴.

References

Phillips, S. J., Anderson, R. P. & Schapire, R. E. Maximum entropy modeling of species geographic distributions. Ecol. Modell. 190, 231–259 (2006).
Article Google Scholar
Elith, J. & Leathwick, J. R. Species Distribution Models: Ecological Explanation and Prediction Across Space and Time. Annu. Rev. Ecol. Evol. Syst. 40, 677–697 (2009).
Article Google Scholar
Miller, J. Species Distribution Modeling. Geogr. Compass 4, 490–509 (2010).
Article Google Scholar
Pearson, R. G. Species’ distribution modeling for conservation educators and practitioners. Lessons Conserv. 3, 54–89 (2010).
Google Scholar
Araújo, M. B. & Peterson, A. T. Uses and misuses of bioclimatic envelope modeling. Ecology 93, 1527–1539 (2012).
Article PubMed Google Scholar
Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G. & Jarvis, A. Very high resolution interpolated climate surfaces for global land areas. Int. J. Climatol. 25, 1965–1978 (2005).
Article Google Scholar
Kriticos, D. J. et al. CliMond: global high‐resolution historical and future scenario climate surfaces for bioclimatic modelling. Methods Ecol. Evol. 3, 53–64 (2012).
Article Google Scholar
Renner, I. W. & Warton, D. I. Equivalence of MAXENT and Poisson Point Process Models for Species Distribution Modeling in Ecology. Biometrics 69, 274–281 (2013).
Article MathSciNet PubMed MATH Google Scholar
Tobler, M., Honorio, E., Janovec, J. & Reynel, C. Implications of collection patterns of botanical specimens on their usefulness for conservation planning: an example of two neotropical plant families (Moraceae and Myristicaceae) in Peru. Biodivers. Conserv. 16, 659–677 (2007).
Article Google Scholar
Haripersaud, P. P. P. Collecting biodiversity. (Utrecht University, 2009).
ter Steege, H., Haripersaud, P. P., Bánki, O. S. & Schieving, F. A model of botanical collectors’ behavior in the field: Never the same species twice. Am. J. Bot. 98, 31–37 (2011).
Article PubMed Google Scholar
Beck, J., Böller, M., Erhardt, A. & Schwanghart, W. Spatial bias in the GBIF database and its effect on modeling species’ geographic distributions. Ecol. Inform. 19, 10–15 (2014).
Article Google Scholar
Phillips, S. J. et al. Sample selection bias and presence-only distribution models: implications for background and pseudo-absence data. Ecol. Appl. 19, 181–197 (2009).
Article PubMed Google Scholar
Syfert, M. M., Smith, M. J., Coomes, D. A., Meagher, T. R. & Roberts, D. L. The Effects of Sampling Bias and Model Complexity on the Predictive Performance of MaxEnt Species Distribution Models. PLoS One 8, e55158 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Fourcade, Y., Engler, J. O., Rödder, D., Secondi, J. & Brooks, T. Mapping Species Distributions with MAXENT Using a Geographically Biased Sample of Presence Data: A Performance Assessment of Methods for Correcting Sampling Bias. PLoS One 9, e97122 (2014).
Article ADS PubMed PubMed Central Google Scholar
Kadmon, R., Farber, O. & Danin, A. Effect of Roadside Bias on the Accuracy of Predictive Maps Produced By Bioclimatic Models. Ecol. Appl. 14, 401–413 (2004).
Article Google Scholar
Loiselle, B. A. et al. Predicting species distributions from herbarium collections: does climate bias in collection sampling influence model outcomes? J. Biogeogr. 35, 105–116 (2008).
Google Scholar
Elith, J. et al. Novel methods improve prediction of species’ distributions from occurrence data. Ecography (Cop.). 29, 129–151 (2006).
Wisz, M. S. et al. Effects of sample size on the performance of species distribution models. Divers. Distrib. 14, 763–773 (2008).
Article Google Scholar
Giovanelli, J. G. R. R., de Siqueira, M. F., Haddad, C. F. B. B. & Alexandrino, J. Modeling a spatially restricted distribution in the Neotropics: How the size of calibration area affects the performance of five presence-only methods. Ecol. Modell. 221, 215–224 (2010).
Article Google Scholar
Merckx, B., Steyaert, M., Vanreusel, A., Vincx, M. & Vanaverbeke, J. Null models reveal preferential sampling, spatial autocorrelation and overfitting in habitat suitability modelling. Ecol. Modell. 222, 588–597 (2011).
Article CAS Google Scholar
Aguirre-Gutiérrez, J. et al. Fit-for-Purpose: Species Distribution Model Performance Depends on Evaluation Criteria – Dutch Hoverflies as a Case Study. PLoS One 8, e63708 (2013).
Article ADS PubMed PubMed Central Google Scholar
Fitzpatrick, M. C., Gotelli, N. J. & Ellison, A. M. MaxEnt versus MaxLike: empirical comparisons with ant species distributions. Ecosphere 4, art55 (2013).
Article Google Scholar
ter Steege, H. et al. Hyperdominance in the Amazonian Tree Flora. Science (80-). 342, (2013).
ter Steege, H. et al. The discovery of the Amazonian tree flora with an updated checklist of all known tree taxa. Sci. Rep. 6, 1–15 (2016).
Article Google Scholar
Cámara-Leret, R., Tuomisto, H., Ruokolainen, K., Balslev, H. & Kristiansen, S. M. Modelling responses of western Amazonian palms to soil nutrients. J. Ecol. 1–15, https://doi.org/10.1111/1365-2745.12708 (2016).
Graham, C. H. et al. The influence of spatial errors in species occurrence data used in distribution models. J. Appl. Ecol. 45, 239–247 (2008).
Article Google Scholar
Steege, H. et al. Estimating the global conservation status of more than 15,000 Amazonian tree species. 9–11 (2015).
VanDerWal, J., Shoo, L. P., Johnson, C. N. N. & Williams, S. E. Abundance and the Environmental Niche: Environmental Suitability Estimated from Niche Models Predicts the Upper Limit of Local Abundance. Am. Nat. 174, 282–291 (2009).
Article PubMed Google Scholar
Calabrese, J. M., Certain, G., Kraan, C. & Dormann, C. F. Stacking species distribution models and adjusting bias by linking them to macroecological models. Glob. Ecol. Biogeogr. 23, 99–112 (2014).
Article Google Scholar
Guillera‐Arroita, G. et al. Is my species distribution model fit for purpose? Matching data and models to applications. Glob. Ecol. Biogeogr. 24, 276–292 (2015).
Article Google Scholar
Lahoz‐Monfort, J. J., Guillera‐Arroita, G. & Wintle, B. A. Imperfect detection impacts the performance of species distribution models. Glob. Ecol. Biogeogr. 23, 504–515 (2014).
Article Google Scholar
Lawson, C. R., Hodgson, J. A., Wilson, R. J. & Richards, S. A. Prevalence, thresholds and the performance of presence–absence models. Methods Ecol. Evol. 5, 54–64 (2014).
Article Google Scholar
Merow, C., Smith, M. J. & Silander, J. A. A practical guide to MaxEnt for modeling species’ distributions: What it does, and why inputs and settings matter. Ecography (Cop.). 36, 1058–1069 (2013).
Article Google Scholar
IUCN. IUCN Red List Categories and Criteria: Version 3.1. Gland. Switz. Cambridge, UK IUCN iv 32pp (2012).
Boucher-Lalonde, V., Morin, A. & Currie, D. J. How are tree species distributed in climatic space? A simple and general pattern. Glob. Ecol. Biogeogr. 21, 1157–1166 (2012).
Article Google Scholar
Maher, S. P., Randin, C. F., Guisan, A. & Drake, J. M. Pattern-recognition ecological niche models fit to presence-only and presence-absence data. Methods Ecol. Evol. 5, 761–770 (2014).
Article Google Scholar
Zizka, A. & Antonelli, A. speciesgeocodeR: An R package for linking species occurrences, user-defined regions and phylogenetic trees for biogeography, ecology and evolution. bioRxiv 32755 (2015).
Maldonado, C. et al. Estimating species diversity and distribution in the era of Big Data: To what extent can we trust public databases? Glob. Ecol. Biogeogr. 24, 973–984 (2015).
Article PubMed PubMed Central Google Scholar
Boyle, B. et al. The taxonomic name resolution service: an online tool for automated standardization of plant names. BMC Bioinformatics 13, 14–16 (2013).
Google Scholar
Syfert, M. M. et al. Using species distribution models to inform IUCN Red List assessments. Biol. Conserv. 177, 174–184 (2014).
Article Google Scholar
Raes, N. Partial versus full species distribution models. Nat. a Conserv. 10, 127–138 (2012).
Article Google Scholar
Diggle, P. J. A Kernel Method for Smoothing Point Process Data. J. R. Stat. Soc. Ser. C (Applied Stat. 34, 138–147 (1985).
MATH Google Scholar
Baddeley, A., Rubak, E. & Turner, R. Spatial point patterns: methodology and applications with R. (CRC Press, 2015).
Team, R. C. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2016. (2016).
Gaston, K. J. Geographic range limits: achieving synthesis. Proc. R. Soc. B 276, 1395–1406 (2009).
Article PubMed PubMed Central Google Scholar
ter Steege, H. et al. A spatial model of tree α-diversity and tree density for the Amazon. Biodivers. Conserv. 12, 2255–2277 (2003).
Article Google Scholar
Phillips, S. J., Dudík, M. & Schapire, R. E. A maximum entropy approach to species distribution modeling. Proc. Twenty-First Int. Conf. Mach. Learn. 83 (2004).
Dormann, C. F. et al. Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography (Cop.). 36, 27–46 (2013).
Article Google Scholar
Hijmans, R. J. & van Etten, J. raster: Geographic Data Analysis and Modeling. R package version 2, 5–8 (2016).
Google Scholar
Varela, S., Anderson, R. P., García-Valdés, R. & Fernández-González, F. Environmental filters reduce the effects of sampling bias and improve predictions of ecological niche models. Ecography (Cop.). 37, 1084–1091 (2014).
Google Scholar
Phillips, S. J. & Dudík, M. Modeling of species distributions with Maxent: New extensions and a comprehensive evaluation. Ecography (Cop.). 31, 161–175 (2008).
Article Google Scholar
Raes, N. & Ter Steege, H. A null-model for significance testing of presence-only species distribution models. Ecography (Cop.). 30, 727–736 (2007).
Article Google Scholar
Kocherginsky, M., He, X. M. & Mu, Y. M. Practical confidence intervals for regression quantiles. J. Comput. Graph. Stat. 14, 41–55 (2005).
Article MathSciNet Google Scholar
Liu, C., White, M. & Newell, G. Selecting thresholds for the prediction of species occurrence with presence-only data. J. Biogeogr. 40, 778–789 (2013).
Article Google Scholar
Soares-Filho, B. S. et al. LBA-ECO LC-14 Modeled Deforestation Scenarios, Amazon Basin: 2002–2050 (Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, TN, 2013). (2013).
Bivand, R., Keitt, T. & Rowlingson, B. rgdal: Bindings for the Geospatial Data Abstraction Library. R package version 0.8–16, https://doi.org/10.1353/lib.0.0050 (2014).
Pebesma, E. & Graeler, B. gstat: spatial and spatio-temporal geostatistical modelling, prediction and simulation. R package version 1.0–19 (2014).
Hijmans, R. J., Phillips, S., Leathwick, J. & Elith, J. Species Distribution Modeling. Package ‘dismo’. dismo: Species Distribution Modeling. R package version 0.9–3, http://CRAN.R-project.org/package=dismo (2017).
Oksanen, A. J. et al. Package ‘ vegan’. (2015).
Koenker, R. quantreg: Quantile Regression. R package version 5.05 (2013).
Pebesma, E. J. & Bivand, R. sp: classes and methods for spatial data in R. R package version 1.0–15 (2014).
Urbanek, M. S. rJava: low-level R to Java interface. R package version 0.9–6 (2013).
VanDerWal, J., Falconi, L., Januchowski, S., Shoo, L. & Storlie, C. SDMTools: Species Distribution Modelling Tools: Tools for processing data associated with species distribution modelling exercises. R package version 1.1–20 1 (2014).

Download references

Acknowledgements

V. H. F. Gomes, H. ter Steege and R. P. Salomão are supported by grant 407232/2013-3 -PVE - MEC/MCTI/CAPES/CNPq/FAPs. We thank Jésus Aguirre Guttierez, Edwin Pos (and others) for constructive comments on the manuscript.

Author information

Authors and Affiliations

Coordenação de Botânica, Museu Paraense Emílio Goeldi, Av. Magalhães Barata 376, C.P. 399, Belém, PA, 66040-170, Brazil
Vitor H. F. Gomes, Ima Célia Guimarães Vieira, Leandro Valle Ferreira & Dário Dantas do Amaral
Programa de Pós-Graduação em Ciência Ambientais, Universidade Federal do Pará, Rua Augusto Corrêa 01, Belém, PA, 66075-110, Brazil
Vitor H. F. Gomes & Rafael P. Salomão
Biodiversity Dynamics, Naturalis Biodiversity Center, PO Box 9517, Leiden, 2300 RA, The Netherlands
Stéphanie D. IJff, Niels Raes, Olaf S. Bánki, Paul Maas, Tinde R. van Andel, Edwin Pos & Hans ter Steege
Marine and Coastal Management, Deltares, Boussinesqweg 1, Delft, 2629 HV, The Netherlands
Stéphanie D. IJff
Coordenação de Biodiversidade, Instituto Nacional de Pesquisas da Amazônia – INPA, Av. André Araújo 2936, Petrópolis, Manaus, AM, 69067-375, Brazil
Iêda Leão Amaral, Luiz de Souza Coelho, Francisca Dionízia de Almeida Matos, Diogenes de Andrade Lima Filho, Maria Pires Martins, Mariana Victória Irume, Jean-François Molino, Juan Carlos Montero, Charles Eugene Zartman, Henrique Eduardo Mendonça Nascimento, Juan David Cardenas Revilla, Flavia R. C. Costa, Juliana Schietti, Priscila Souza, Marcelo Petratti Pansonato, Edelcilio Marques Barbosa, Ires Paula de Andrade Miranda, Luiz Carlos de Matos Bonates, José Ferreira Ramos & Cid Ferreira
EMBRAPA – Centro de Pesquisa Agroflorestal de Roraima, BR 174 km 8, Distrito Industrial, Boa Vista, RR, 69301-970, Brazil
Carolina V. Castilho
Herbario Amazónico Colombiano, Instituto SINCHI, Calle 20 No 5-44, Bogotá, DC, Colombia
Dairon Cárdenas López & Nicolás Castaño Arboleda
Department of Integrative Biology University of California, Berkeley, CA, 94720-3140, USA
Juan Ernesto Guevara, Gabriel Damasco, Paul V. A. Fine & Italo Mesones
Universidad San Francisco de Quito, Colegio de Ciencias Biológicas, Diego de Robles y Vía Interoceánica, Quito, Pichincha, Ecuador
Juan Ernesto Guevara
Coordenação de Pesquisas em Ecologia, Instituto Nacional de Pesquisas da Amazônia – INPA, Av. André Araújo 2936, Petrópolis, Manaus, AM, 69067-375, Brazil
William E. Magnusson, Alberto Vicentini, Thaise Emilio, Fernanda Antunes Carvalho & Fernanda Coelho de Souza
School of Geography, University of Leeds, Woodhouse Lane, Leeds, LS2 9JT, UK
Oliver L. Phillips, Euridice N. Honorio Coronado, Bruno Barçante Ladvocat Cintra, Roel Brienen, Fernanda Coelho de Souza, Tim R. Baker, Joana Ricardo & Gabriela Lopez-Gonzalez
Department of Wetland Ecology, Institute of Geography and Geoecology, Karlsruhe Institute of Technology – KIT, Josefstr 1, Rastatt, D-76437, Germany
Florian Wittmann
Biogeochemistry, Max Planck Institute for Chemistry, Hahn-Meitner Weg 1, Mainz, 55128, Germany
Florian Wittmann
Departamento de Botânica, Instituto de Pesquisas Científicas e Tecnológicas do Amapá – IEPA, Rodovia JK Km 10, Campus do IEPA da Fazendinha, Amapá, 68901-025, Brazil
Marcelo de Jesus Veiga Carim & José Renan da Silva Guimarães
AMAP, IRD, Cirad, CNRS, INRA, Université de Montpellier, TA A-51/PS2, Bd. de la Lironde, Montpellier, 34398, France
Daniel Sabatier & Julien Engel
Science and Education, The Field Museum, 1400 S. Lake Shore Drive, Chicago, IL, 60605-2496, USA
Nigel C. A. Pitman & Corine Vriesendorp
Coordenação de Dinâmica Ambiental, Instituto Nacional de Pesquisas da Amazônia – INPA, Av. André Araújo 2936, Petrópolis, Manaus, AM, 69067-375, Brazil
Maria Teresa Fernandez Piedade, Jochen Schöngart, Rafael L. Assis, Layon O. Demarchi, Adriano Quaresma, Bruno Barçante Ladvocat Cintra, Aline Lopes, Daniel Praia, Boris Villa, Bianca Weiss Albuquerque & Maira Rocha
Jardín Botánico de Missouri, Oxapampa, Pasco, Peru
Abel Monteagudo Mendoza, Rodolfo Vasquez & Luis Valenzuela Gamarra
Departamento de Ecologia, Universidade Estadual Paulista – UNESP, Instituto de Biociências – IB, Av. 24 A 1515, Bela Vista, Rio Claro, SP, 13506-900, Brazil
Bruno Garcia Luize
Centro de Biociências, Departamento de Ecologia, Universidade Federal do Rio Grande do Norte, Av. Senador Salgado Filho 3000, Natal, RN, 59072-970, Brazil
Eduardo Martins Venticinque
Divisao de Sensoriamento Remoto – DSR, Instituto Nacional de Pesquisas Espaciais – INPE, Av. dos Astronautas 1758, Jardim da Granja, São José dos Campos, SP, 12227-010, Brazil
Evlyn Márcia Moraes de Leão Novo
Herbario Vargas, Universidad Nacional de San Antonio Abad del Cusco, Avenida de la Cultura, Nro 733, Cusco, Cuzco, Peru
Percy Núñez Vargas & Isau Huamantupa-Chuquimaco
Departamento de Geografia, Universidade Estadual Paulista – UNESP, Instituto de Geociências e Ciências Extas – IGCE, Bela Vista, Rio Claro, SP, 13506-900, Brazil
Thiago Sanna Freire Silva
Departamento de Biologia, Universidade Federal de Rondônia, Rodovia BR 364 s/n Km 9.5, Rural, Porto Velho, RO, 76.824-027, Brazil
Angelo Gilberto Manzatto
Center for Tropical Conservation, Duke University, Nicholas School of the Environment, Durham, NC, 27708, USA
John Terborgh
Programa de Pós-Graduação em Biodiversidade e Biotecnologia - PPG-Bionorte, Universidade Federal de Rondônia, Campus Porto Velho, Km 9.5, Rural, Porto Velho, RO, 76.824-027, Brazil
Neidiane Farias Costa Reis, Katia Regina Casula, Adeilza Felipe Sampaio & Susamar Pansini
Instituto Boliviano de Investigacion Forestal, Av. 6 de agosto 28 Km 14, Doble via La Guardia Casilla, 6204, Santa Cruz, Santa Cruz, Bolivia
Juan Carlos Montero & Juan Carlos Licona
Programa de Pós-Graduação em Ecologia e Conservação, Universidade do Estado de Mato Grosso, Nova Xavantina, MT, Brazil
Beatriz S. Marimon & Ben-Hur Marimon
Instituto de Investigaciones de la Amazonía Peruana – IIAP, Av. A. Quiñones km 2.5, Iquitos, Loreto, 784, Peru
Euridice N. Honorio Coronado
Geography College of Life and Environmental Sciences, University of Exeter, Exeter, EX4 4RJ, UK
Ted R. Feldpausch
Departamento de Ciencias Forestales, Universidad Nacional de Colombia, Calle 64 x Cra 65, Medellín, Antioquia, 1027, Colombia
Alvaro Duque & Ligia Estela Urrego Giraldo
Agteca-Amazonica, Santa Cruz, Bolivia
Timothy J. Killeen
Facultad de Ciencias Agrícolas Universidad Autónoma Gabriel René Moreno Santa Cruz, Santa Cruz, Bolivia
Bonifacio Mostacedo
Prédio da Botânica e Ecologia Embrapa Recursos Genéticos e Biotecnologia Parque Estação Biológica, Av. W5 Norte, Brasilia, DF, 70770-917, Brazil
Marcelo Brilhante Medeiros & Marcelo Fragomeni Simon
Projeto Dinâmica Biológica de Fragmentos Florestais, Instituto Nacional de Pesquisas da Amazônia - INPA, Av. André Araújo 2936, Petrópolis, Manaus, AM, 69067-375, Brazil
Ana Andrade & José Luís Camargo
Centre for Tropical Environmental and Sustainability Science and College of Science and Engineering James Cook University, Cairns, Queensland, 4870, Australia
William F. Laurance & Susan G. W. Laurance
Laboratório de Ecologia de Doenças Transmissíveis da Amazônia (EDTA), Instituto Leônidas e Maria Deane, Fiocruz, Rua Terezina 476, Adrianópolis, Manaus, AM, 69060-001, Brazil
Emanuelle de Sousa Farias
Programa de Pós-graduação em Biodiversidade e Saúde Instituto Oswaldo Cruz - IOC/FIOCRUZ Pav. Arthur Neiva – Térreo Av. Brasil 4365 – Manguinhos, Rio de Janeiro, RJ, 21040-360, Brazil
Emanuelle de Sousa Farias
Centro de Investigaciones Ecológicas de Guayana Universidad Nacional Experimental de Guayana, Calle Chile, urbaniz Chilemex, Puerto Ordaz, Bolivar, Venezuela
Hernán Castellanos & Lionel Hernandez
Laboratorio de Ecología de Bosques Tropicales y Primatología Universidad de los Andes Carrera 1 # 18a- 10, Bogotá, DC, 111711, Colombia
Pablo R. Stevenson, Angela Cano, Diego Felipe Correa & Maria Natalia Umaña Medina
Programa de Pós-Graduação em Biologia (Botânica) Instituto Nacional de Pesquisas da Amazônia - INPA, Av. André Araújo 2936, Petrópolis, Manaus, AM, 69067-375, Brazil
Yuri Feitosa
Institute of Biodiversity and Ecosystem Dynamics University of Amsterdam, Sciencepark 904, Amsterdam, 1098 XH, The Netherlands
Joost F. Duivenvoorden
Programa de Ciencias del Agro y el Mar Herbario Universitario (PORT) UNELLEZ-Guanare, Guanare, Portuguesa, 3350, Venezuela
Gerardo A. Aymard C.
Endangered Species Coalition, 8530 Geren Rd, Silver Spring, MD, 20901, USA
Hugo F. Mogollón
MAUA Working Group Instituto Nacional de Pesquisas da Amazônia - INPA Av. André Araújo 2936, Petrópolis, Manaus, AM, 69067-375, Brazil
Natalia Targhetta
Inventory and Monitoring Program National Park Service 120 Chatham Lane, Fredericksburg, VA, 22405, USA
James A. Comiskey
Center for Conservation Education and Sustainability Smithsonian Conservation Biology Institute, 1100 Jefferson Dr. SW, Suite 3123, Washington, DC, 20560-0705, USA
James A. Comiskey, Alfonso Alonso & Francisco Dallmeier
Biologia Vegetal, Universidade Estadual de Campinas, Caixa Postal 6109, Campinas, SP, 13.083-970, Brazil
Nállarett Dávila
Institute of Molecular Plant Sciences University of Edinburgh, Mayfield Rd, Edinburgh, EH3 5LR, UK
Roosevelt García-Villacorta
Royal Botanic Garden Edinburgh 20a Inverleith Row, Edinburgh, Scotland, EH3 5LR, UK
Roosevelt García-Villacorta, Kyle G. Dexter & Toby Pennington
Programa de Pós-Graduação em Ecologia, Instituto Nacional de Pesquisas da Amazônia - INPA, Av. André Araújo 2936, Petrópolis, Manaus, AM, 69067-375, Brazil
Carolina Levis
Forest Ecology and Forest Management Group Wageningen University Wageningen University & Research, Droevendaalsesteeg 3, Wageningen, P.O. Box 47, 6700 AA, The Netherlands
Carolina Levis
Natural Capital and Plant Health Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AB, UK
Thaise Emilio & William Milliken
Ecosistemas Biodiversidad y Conservación de Especies Universidad Estatal Amazónica, Km. 2 1/2 vía a Tena (Paso Lateral), Puyo, Pastaza, Ecuador
David Neill
Museo de Historia Natural Noel Kempff Mercado Universidad Autónoma Gabriel Rene Moreno, Avenida Irala 565 Casilla Postal 2489, Santa Cruz, Santa Cruz, Bolivia
Alejandro Araujo-Murakami, Luzmila Arroyo & Daniel Villarroel
Centro de Biociências, Departamento de Botânica e Zoologia, Universidade Federal do Rio Grande do Norte, Campus Universitário, Lagoa Nova, Natal, RN, 59078-970, Brazil
Fernanda Antunes Carvalho
Department of Biology University of Miami Coral, Gables, FL, 33146, USA
Kenneth Feeley
Fairchild Tropical Botanic Garden Coral, Gables, FL, 33156, USA
Kenneth Feeley
Instituto de Biociências - Dept. Ecologia, Universidade de Sao Paulo - USP, Rua do Matão, Trav. 14, no. 321, Cidade Universitária, São Paulo, SP, 05508-090, Brazil
Marcelo Petratti Pansonato & Alexandre A. Oliveira
Diretoria de Pesquisas Científicas, Instituto de Pesquisas Jardim Botânico do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
Rogerio Gribel
Escuela de Biología Herbario Alfredo Paredes Universidad Central, Ap. Postal 17.01.2177, Quito, Pichincha, Ecuador
Carlos Cerón
International Center for Tropical Botany (ICTB) Department of Biological Sciences Florida International University, 11200 SW 8th Street, OE 243, Miami, FL, 33199, USA
Chris Baraloto & Julien Engel
Grupo de Ecología de Ecosistemas Terrestres Tropicales, Universidad Nacional de Colombia Sede Amazonía, Leticia, Amazonas, Colombia
Eliana M. Jimenez
Institute of Biological and Health Sciences, Federal University of Alagoas, Av. Lourival Melo Mota, s/n, Tabuleiro do Martins, Maceio, AL, 57072-970, Brazil
Juliana Stropp
Museu Universitário/Centro de Ciências Biológicas e da Natureza/Laboratório de Botânica e Ecologia Vegetal, Universidade Federal do Acre, Rio Branco, AC, 69915-559, Brazil
Marcos Silveira
Universidad Regional Amazónica IKIAM, Km 7 via Muyuna, Tena, Napo, Ecuador
Maria Cristina Peñuela Mora
Cirad UMR Ecofog AgrosParisTech CNRS INRA Univ Guyane, Campus agronomique, Kourou Cedex, 97379, France
Pascal Petronelli
Iwokrama International Programme for Rainforest Conservation, Georgetown, Guyana
Raquel Thomas-Caesar
Department of Biological Sciences, Humboldt State University, 1 Harpst Street, Arcata, CA, 95521, USA
Terry W. Henkel
New York Botanical Garden 2900 Southern Blvd, Bronx, New York, NY, 10458-5126, USA
Doug Daly
Servicios de Biodiversidad EIRL, Jr. Independencia 405, Iquitos, Loreto, 784, Peru
Marcos Ríos Paredes, George Pepe Gallardo Gonzales, Hilda Paulette Dávila Doza & Linder Felipe Mozombite Pinto
Herbario Nacional de Bolivia, Universitario UMSA, Casilla 10077 Correo Central, La Paz, La Paz, Bolivia
Alfredo Fuentes
Missouri Botanical Garden, P.O. Box 299, St. Louis, MO, 63166-0299, USA
Alfredo Fuentes & Peter Møller Jørgensen
School of Environmental Sciences, University of East Anglia, Norwich, NR4 7TJ, UK
Carlos A. Peres
Laboratoire Evolution et Diversité Biologique CNRS and Université Paul Sabatier UMR 5174 EDB, Toulouse, 31000, France
Jerome Chave
Department of Forestry Management, Universidad Nacional Agraria La Molina, Avenido La Molina, Apdo. 456, La Molina, Lima, Peru
Jose Luis Marcelo Pena
School of Geosciences University of Edinburgh, 201 Crew Building, King’s Buildings, Edinburgh, EH9 3JN, UK
Kyle G. Dexter
Biology Department and Center for Energy, Environment and Sustainability, Wake Forest University, 1834 Wake Forest Rd, Winston Salem, NC, 27106, USA
Miles R. Silman, William Farfan-Rios & Karina Garcia-Cabrera
Department of Anthropology University of Texas at Austin, SAC 5.150, 2201 Speedway Stop C3200, Austin, TX, 78712, USA
Anthony Di Fiore
Andes to Amazon Biodiversity Program, Madre de Dios, Madre de Dios, Peru
Fernando Cornejo Valverde
Fundación Puerto Rastrojo, Cra 10 No. 24-76 Oficina 1201, Bogotá, DC, Colombia
Juan Fernando Phillips
Colegio de Ciencias Biológicas y Ambientales-COCIBA & Galapagos Institute for the Arts and Sciences-GAIAS, Universidad San Francisco de Quito-USFQ, Quito, Pichincha, Ecuador
Gonzalo Rivas-Torres
Department of Wildlife Ecology and Conservation University of Florida, 110 Newins-Ziegler Hall, Gainesville, FL, 32611, USA
Gonzalo Rivas-Torres
Fundación Estación de Biología, Cra 10 No. 24-76 Oficina 1201, Bogotá, DC, Colombia
Patricio von Hildebrand
Embrapa Amazônia Oriental, Trav. Dr. Enéas Pinheiro s/nº, Belém, PA, 66095-100, Brazil
Ademir R. Ruschel
Instituto de Ciencias Naturales, Universidad Nacional de Colombia, Apartado, 7945, Bogotá, DC, Colombia
Adriana Prieto & Agustín Rudas
Amazon Conservation Team, Doekhieweg Oost #24, Paramaribo, Suriname
Bruce Hoffman
Facultad de Ciencias Forestales y Medio Ambiente, Universidad Nacional de San Antonio Abad del Cusco, Jirón San Martín 451, Puerto Maldonado, Madre de Dios, Peru
César I. A. Vela
Laboratory of Human Ecology, Instituto Venezolano de Investigaciones Científicas - IVIC, Ado 20632, Caracas, Caracas, 1020A, Venezuela
Egleé L. Zent & Stanford Zent
Departement EV, Muséum national d’histoire naturelle de Paris, 16 rue Buffon, Paris, 75005, France
Jean-Louis Guillaumet
Instituto de Ciência Agrárias, Universidade Federal Rural da Amazônia, Av. Presidente Tancredo Neves 2501, Belém, PA, 66.077-901, Brazil
Natalino Silva
PROTERRA, Instituto de Investigaciones de la Amazonía Peruana (IIAP), Av. A. Quiñones km 2 5, Iquitos, Loreto, 784, Peru
Ricardo Zárate Gómez
ACEER Foundation, Jirón Cusco N° 370, Puerto Maldonado, Madre de Dios, Peru
Therany Gonzales
Universidad Autónoma del Beni José Ballivián, Campus Universitario Final Av. Ejercito, Riberalta, Beni, Bolivia
Vincent A. Vos
Regional Norte Amazónico, Centro de Investigación y Promoción del Campesinado, C/Nicanor Gonzalo Salvatierra N° 362, Riberalta, Beni, Bolivia
Vincent A. Vos
Environmental Change Institute, Oxford University Centre for the Environment, Dyson Perrins Building, South Parks Road, Oxford, England, OX1 3QY, UK
Yadvinder Malhi
School of Agriculture and Food Sciences - ARC Centre of Excellence for Environmental Decisions CEED, The University of Queensland, St. Lucia, QLD 4072, Australia
Diego Felipe Correa
Instituto de Investigaciones para el Desarrollo Forestal (INDEFOR), Universidad de los Andes, Conjunto Forestal, C.P. 5101, Mérida, Mérida, Venezuela
Emilio Vilanova Torre, Hirma Ramirez-Angulo & Armando Torres-Lezama
School of Environmental and Forest Sciences, University of Washington, Seattle, WA, 98195-2100, USA
Emilio Vilanova Torre
University of Nottingham, University Park, Nottingham, NG7 2RD, UK
Geertje van der Heijden
Geography and the Environment, University of Texas at Austin, 305 E. 23rd Street, CLA building, Austin, TX, 78712, USA
Kenneth R. Young
Laboratório de Ciências Ambientais, Universidade Estadual do Norte Fluminense, Av. Alberto Lamego 2000, Campos dos Goyatacazes, RJ, 28013-620, Brazil
Marcelo Trindade Nascimento
Department of Biology, University of Maryland, College Park, MD, 20742, USA
Maria Natalia Umaña Medina
GeoIS, El Día 369 y El Telégrafo, 3° Piso, Quito, Pichincha, Ecuador
Milton Tirado & Rodrigo Sierra
Environmental Science and Policy, Northern Arizona University, Flagstaff, AZ, 86011, USA
Ophelia Wang
FOMABO, Manejo Forestal en las Tierras Tropicales de Bolivia, Sacta, Cochabamba, Bolivia
Casimiro Mendoza
Escuela de Ciencias Forestales (ESFOR), Universidad Mayor de San Simon (UMSS), Sacta, Cochabamba, Bolivia
Casimiro Mendoza
Agricultural Services, Ministry of Agro-Industry and Food Security, Agricultural Services, Ministry of Agro-Industry and Food Security, The Mauritius Herbarium, Reduit, Mauritius
Cláudia Baider
Department of Bioscience, Aarhus University, Building 1540 Ny Munkegade, Aarhus C, Aarhus, DK-8000, Denmark
Henrik Balslev
Ciencias Biológicas, Universidad de Los Andes, Carrera 1 # 18a- 10, Bogotá, DC, 111711, Colombia
Luisa Fernanda Casas & Sasha Cárdenas
Medio Ambiente, PLUSPRETOL, Iquitos, Loreto, Peru
Manuel Augusto Ahuite Reategui
Center for Conservation Education and Sustainability, Smithsonian’s National Zoo & Conservation Biology Institute, National Zoological Park, 3001 Connecticut Ave, Washington, DC, 20008, USA
Reynaldo Linares-Palomino
Tropenbos International, Lawickse Allee 11 PO Box 232, Wageningen, 6700 AE, The Netherlands
Roderick Zagt
Instituto de Biodiversidade e Floresta, Universidade Federal do Oeste do Pará, Rua Vera Paz, Campus Tapajós, Santarém, PA, 68015-110, Brazil
Daniela Pauletto
Department of Biology, University of Missouri, St. Louis, MO, 63121, USA
Elvis H. Valderrama Sandoval
Facultad de Biologia, Universidad Nacional de la Amazonia Peruana, Pevas 5ta cdra, Iquitos, Loreto, Peru
Elvis H. Valderrama Sandoval & Freddy Ramirez Arevalo
School of Anthropology and Conservation, University of Kent, Marlowe Building, Canterbury, Kent, CT2 7NR, UK
Miguel N. Alexiades
Herbario Nacional del Ecuador, Universidad Técnica del Norte, Quito, Pichincha, Ecuador
Walter Palacios Cuenca
Ecology & Biodiversity Group, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
Edwin Pos
Systems Ecology, Free University, De Boelelaan 1087, Amsterdam, 1081 HV, Netherlands
Hans ter Steege

Authors

Vitor H. F. Gomes
View author publications
You can also search for this author in PubMed Google Scholar
Stéphanie D. IJff
View author publications
You can also search for this author in PubMed Google Scholar
Niels Raes
View author publications
You can also search for this author in PubMed Google Scholar
Iêda Leão Amaral
View author publications
You can also search for this author in PubMed Google Scholar
Rafael P. Salomão
View author publications
You can also search for this author in PubMed Google Scholar
Luiz de Souza Coelho
View author publications
You can also search for this author in PubMed Google Scholar
Francisca Dionízia de Almeida Matos
View author publications
You can also search for this author in PubMed Google Scholar
Carolina V. Castilho
View author publications
You can also search for this author in PubMed Google Scholar
Diogenes de Andrade Lima Filho
View author publications
You can also search for this author in PubMed Google Scholar
Dairon Cárdenas López
View author publications
You can also search for this author in PubMed Google Scholar
Juan Ernesto Guevara
View author publications
You can also search for this author in PubMed Google Scholar
William E. Magnusson
View author publications
You can also search for this author in PubMed Google Scholar
Oliver L. Phillips
View author publications
You can also search for this author in PubMed Google Scholar
Florian Wittmann
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo de Jesus Veiga Carim
View author publications
You can also search for this author in PubMed Google Scholar
Maria Pires Martins
View author publications
You can also search for this author in PubMed Google Scholar
Mariana Victória Irume
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Sabatier
View author publications
You can also search for this author in PubMed Google Scholar
Jean-François Molino
View author publications
You can also search for this author in PubMed Google Scholar
Olaf S. Bánki
View author publications
You can also search for this author in PubMed Google Scholar
José Renan da Silva Guimarães
View author publications
You can also search for this author in PubMed Google Scholar
Nigel C. A. Pitman
View author publications
You can also search for this author in PubMed Google Scholar
Maria Teresa Fernandez Piedade
View author publications
You can also search for this author in PubMed Google Scholar
Abel Monteagudo Mendoza
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Garcia Luize
View author publications
You can also search for this author in PubMed Google Scholar
Eduardo Martins Venticinque
View author publications
You can also search for this author in PubMed Google Scholar
Evlyn Márcia Moraes de Leão Novo
View author publications
You can also search for this author in PubMed Google Scholar
Percy Núñez Vargas
View author publications
You can also search for this author in PubMed Google Scholar
Thiago Sanna Freire Silva
View author publications
You can also search for this author in PubMed Google Scholar
Angelo Gilberto Manzatto
View author publications
You can also search for this author in PubMed Google Scholar
John Terborgh
View author publications
You can also search for this author in PubMed Google Scholar
Neidiane Farias Costa Reis
View author publications
You can also search for this author in PubMed Google Scholar
Juan Carlos Montero
View author publications
You can also search for this author in PubMed Google Scholar
Katia Regina Casula
View author publications
You can also search for this author in PubMed Google Scholar
Beatriz S. Marimon
View author publications
You can also search for this author in PubMed Google Scholar
Ben-Hur Marimon
View author publications
You can also search for this author in PubMed Google Scholar
Euridice N. Honorio Coronado
View author publications
You can also search for this author in PubMed Google Scholar
Ted R. Feldpausch
View author publications
You can also search for this author in PubMed Google Scholar
Alvaro Duque
View author publications
You can also search for this author in PubMed Google Scholar
Charles Eugene Zartman
View author publications
You can also search for this author in PubMed Google Scholar
Nicolás Castaño Arboleda
View author publications
You can also search for this author in PubMed Google Scholar
Timothy J. Killeen
View author publications
You can also search for this author in PubMed Google Scholar
Bonifacio Mostacedo
View author publications
You can also search for this author in PubMed Google Scholar
Rodolfo Vasquez
View author publications
You can also search for this author in PubMed Google Scholar
Jochen Schöngart
View author publications
You can also search for this author in PubMed Google Scholar
Rafael L. Assis
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo Brilhante Medeiros
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo Fragomeni Simon
View author publications
You can also search for this author in PubMed Google Scholar
Ana Andrade
View author publications
You can also search for this author in PubMed Google Scholar
William F. Laurance
View author publications
You can also search for this author in PubMed Google Scholar
José Luís Camargo
View author publications
You can also search for this author in PubMed Google Scholar
Layon O. Demarchi
View author publications
You can also search for this author in PubMed Google Scholar
Susan G. W. Laurance
View author publications
You can also search for this author in PubMed Google Scholar
Emanuelle de Sousa Farias
View author publications
You can also search for this author in PubMed Google Scholar
Henrique Eduardo Mendonça Nascimento
View author publications
You can also search for this author in PubMed Google Scholar
Juan David Cardenas Revilla
View author publications
You can also search for this author in PubMed Google Scholar
Adriano Quaresma
View author publications
You can also search for this author in PubMed Google Scholar
Flavia R. C. Costa
View author publications
You can also search for this author in PubMed Google Scholar
Ima Célia Guimarães Vieira
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Barçante Ladvocat Cintra
View author publications
You can also search for this author in PubMed Google Scholar
Hernán Castellanos
View author publications
You can also search for this author in PubMed Google Scholar
Roel Brienen
View author publications
You can also search for this author in PubMed Google Scholar
Pablo R. Stevenson
View author publications
You can also search for this author in PubMed Google Scholar
Yuri Feitosa
View author publications
You can also search for this author in PubMed Google Scholar
Joost F. Duivenvoorden
View author publications
You can also search for this author in PubMed Google Scholar
Gerardo A. Aymard C.
View author publications
You can also search for this author in PubMed Google Scholar
Hugo F. Mogollón
View author publications
You can also search for this author in PubMed Google Scholar
Natalia Targhetta
View author publications
You can also search for this author in PubMed Google Scholar
James A. Comiskey
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Vicentini
View author publications
You can also search for this author in PubMed Google Scholar
Aline Lopes
View author publications
You can also search for this author in PubMed Google Scholar
Gabriel Damasco
View author publications
You can also search for this author in PubMed Google Scholar
Nállarett Dávila
View author publications
You can also search for this author in PubMed Google Scholar
Roosevelt García-Villacorta
View author publications
You can also search for this author in PubMed Google Scholar
Carolina Levis
View author publications
You can also search for this author in PubMed Google Scholar
Juliana Schietti
View author publications
You can also search for this author in PubMed Google Scholar
Priscila Souza
View author publications
You can also search for this author in PubMed Google Scholar
Thaise Emilio
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso Alonso
View author publications
You can also search for this author in PubMed Google Scholar
David Neill
View author publications
You can also search for this author in PubMed Google Scholar
Francisco Dallmeier
View author publications
You can also search for this author in PubMed Google Scholar
Leandro Valle Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Araujo-Murakami
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Praia
View author publications
You can also search for this author in PubMed Google Scholar
Dário Dantas do Amaral
View author publications
You can also search for this author in PubMed Google Scholar
Fernanda Antunes Carvalho
View author publications
You can also search for this author in PubMed Google Scholar
Fernanda Coelho de Souza
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth Feeley
View author publications
You can also search for this author in PubMed Google Scholar
Luzmila Arroyo
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo Petratti Pansonato
View author publications
You can also search for this author in PubMed Google Scholar
Rogerio Gribel
View author publications
You can also search for this author in PubMed Google Scholar
Boris Villa
View author publications
You can also search for this author in PubMed Google Scholar
Juan Carlos Licona
View author publications
You can also search for this author in PubMed Google Scholar
Paul V. A. Fine
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Cerón
View author publications
You can also search for this author in PubMed Google Scholar
Chris Baraloto
View author publications
You can also search for this author in PubMed Google Scholar
Eliana M. Jimenez
View author publications
You can also search for this author in PubMed Google Scholar
Juliana Stropp
View author publications
You can also search for this author in PubMed Google Scholar
Julien Engel
View author publications
You can also search for this author in PubMed Google Scholar
Marcos Silveira
View author publications
You can also search for this author in PubMed Google Scholar
Maria Cristina Peñuela Mora
View author publications
You can also search for this author in PubMed Google Scholar
Pascal Petronelli
View author publications
You can also search for this author in PubMed Google Scholar
Paul Maas
View author publications
You can also search for this author in PubMed Google Scholar
Raquel Thomas-Caesar
View author publications
You can also search for this author in PubMed Google Scholar
Terry W. Henkel
View author publications
You can also search for this author in PubMed Google Scholar
Doug Daly
View author publications
You can also search for this author in PubMed Google Scholar
Marcos Ríos Paredes
View author publications
You can also search for this author in PubMed Google Scholar
Tim R. Baker
View author publications
You can also search for this author in PubMed Google Scholar
Alfredo Fuentes
View author publications
You can also search for this author in PubMed Google Scholar
Carlos A. Peres
View author publications
You can also search for this author in PubMed Google Scholar
Jerome Chave
View author publications
You can also search for this author in PubMed Google Scholar
Jose Luis Marcelo Pena
View author publications
You can also search for this author in PubMed Google Scholar
Kyle G. Dexter
View author publications
You can also search for this author in PubMed Google Scholar
Miles R. Silman
View author publications
You can also search for this author in PubMed Google Scholar
Peter Møller Jørgensen
View author publications
You can also search for this author in PubMed Google Scholar
Toby Pennington
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Di Fiore
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Cornejo Valverde
View author publications
You can also search for this author in PubMed Google Scholar
Juan Fernando Phillips
View author publications
You can also search for this author in PubMed Google Scholar
Gonzalo Rivas-Torres
View author publications
You can also search for this author in PubMed Google Scholar
Patricio von Hildebrand
View author publications
You can also search for this author in PubMed Google Scholar
Tinde R. van Andel
View author publications
You can also search for this author in PubMed Google Scholar
Ademir R. Ruschel
View author publications
You can also search for this author in PubMed Google Scholar
Adriana Prieto
View author publications
You can also search for this author in PubMed Google Scholar
Agustín Rudas
View author publications
You can also search for this author in PubMed Google Scholar
Bruce Hoffman
View author publications
You can also search for this author in PubMed Google Scholar
César I. A. Vela
View author publications
You can also search for this author in PubMed Google Scholar
Edelcilio Marques Barbosa
View author publications
You can also search for this author in PubMed Google Scholar
Egleé L. Zent
View author publications
You can also search for this author in PubMed Google Scholar
George Pepe Gallardo Gonzales
View author publications
You can also search for this author in PubMed Google Scholar
Hilda Paulette Dávila Doza
View author publications
You can also search for this author in PubMed Google Scholar
Ires Paula de Andrade Miranda
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Louis Guillaumet
View author publications
You can also search for this author in PubMed Google Scholar
Linder Felipe Mozombite Pinto
View author publications
You can also search for this author in PubMed Google Scholar
Luiz Carlos de Matos Bonates
View author publications
You can also search for this author in PubMed Google Scholar
Natalino Silva
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo Zárate Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Stanford Zent
View author publications
You can also search for this author in PubMed Google Scholar
Therany Gonzales
View author publications
You can also search for this author in PubMed Google Scholar
Vincent A. Vos
View author publications
You can also search for this author in PubMed Google Scholar
Yadvinder Malhi
View author publications
You can also search for this author in PubMed Google Scholar
Alexandre A. Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Angela Cano
View author publications
You can also search for this author in PubMed Google Scholar
Bianca Weiss Albuquerque
View author publications
You can also search for this author in PubMed Google Scholar
Corine Vriesendorp
View author publications
You can also search for this author in PubMed Google Scholar
Diego Felipe Correa
View author publications
You can also search for this author in PubMed Google Scholar
Emilio Vilanova Torre
View author publications
You can also search for this author in PubMed Google Scholar
Geertje van der Heijden
View author publications
You can also search for this author in PubMed Google Scholar
Hirma Ramirez-Angulo
View author publications
You can also search for this author in PubMed Google Scholar
José Ferreira Ramos
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth R. Young
View author publications
You can also search for this author in PubMed Google Scholar
Maira Rocha
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo Trindade Nascimento
View author publications
You can also search for this author in PubMed Google Scholar
Maria Natalia Umaña Medina
View author publications
You can also search for this author in PubMed Google Scholar
Milton Tirado
View author publications
You can also search for this author in PubMed Google Scholar
Ophelia Wang
View author publications
You can also search for this author in PubMed Google Scholar
Rodrigo Sierra
View author publications
You can also search for this author in PubMed Google Scholar
Armando Torres-Lezama
View author publications
You can also search for this author in PubMed Google Scholar
Casimiro Mendoza
View author publications
You can also search for this author in PubMed Google Scholar
Cid Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
Cláudia Baider
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Villarroel
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Balslev
View author publications
You can also search for this author in PubMed Google Scholar
Italo Mesones
View author publications
You can also search for this author in PubMed Google Scholar
Ligia Estela Urrego Giraldo
View author publications
You can also search for this author in PubMed Google Scholar
Luisa Fernanda Casas
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Augusto Ahuite Reategui
View author publications
You can also search for this author in PubMed Google Scholar
Reynaldo Linares-Palomino
View author publications
You can also search for this author in PubMed Google Scholar
Roderick Zagt
View author publications
You can also search for this author in PubMed Google Scholar
Sasha Cárdenas
View author publications
You can also search for this author in PubMed Google Scholar
William Farfan-Rios
View author publications
You can also search for this author in PubMed Google Scholar
Adeilza Felipe Sampaio
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Pauletto
View author publications
You can also search for this author in PubMed Google Scholar
Elvis H. Valderrama Sandoval
View author publications
You can also search for this author in PubMed Google Scholar
Freddy Ramirez Arevalo
View author publications
You can also search for this author in PubMed Google Scholar
Isau Huamantupa-Chuquimaco
View author publications
You can also search for this author in PubMed Google Scholar
Karina Garcia-Cabrera
View author publications
You can also search for this author in PubMed Google Scholar
Lionel Hernandez
View author publications
You can also search for this author in PubMed Google Scholar
Luis Valenzuela Gamarra
View author publications
You can also search for this author in PubMed Google Scholar
Miguel N. Alexiades
View author publications
You can also search for this author in PubMed Google Scholar
Susamar Pansini
View author publications
You can also search for this author in PubMed Google Scholar
Walter Palacios Cuenca
View author publications
You can also search for this author in PubMed Google Scholar
William Milliken
View author publications
You can also search for this author in PubMed Google Scholar
Joana Ricardo
View author publications
You can also search for this author in PubMed Google Scholar
Gabriela Lopez-Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Edwin Pos
View author publications
You can also search for this author in PubMed Google Scholar
Hans ter Steege
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.t.S. conceived the study. H.t.S., N.R., S.D.I.J. and V.H.F.G. designed the study. V.H.F.G. carried out the GBIF data collection. V.H.F.G. and S.D.I.J. carried out the analyses and wrote the R scripts. All co-authors contributed to checking and re-checking of the species list and all co-authors contributed to the writing.

Corresponding author

Correspondence to Hans ter Steege.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

How does abundance affect the chance of being found?

Comparing the results of modelling the area of occupancy with MaxEnt and with inverse distance weighting (IDW).

Maps of combined MaxEnt IDW predictions.

Cleaning pipeline results for all 227 hyperdominant species.

Area of occupancy predicted by MaxEnt for each step of the modelling pipeline.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gomes, V.H., IJff, S.D., Raes, N. et al. Species Distribution Modelling: Contrasting presence-only models with plot abundance data. Sci Rep 8, 1003 (2018). https://doi.org/10.1038/s41598-017-18927-1

Download citation

Received: 22 May 2017
Accepted: 20 December 2017
Published: 17 January 2018
DOI: https://doi.org/10.1038/s41598-017-18927-1

This article is cited by

One sixth of Amazonian tree diversity is dependent on river floodplains
- John Ethan Householder
- Florian Wittmann
- Hans ter Steege
Nature Ecology & Evolution (2024)
Ecological niche modelling of a critically endangered species Commiphora wightii (Arn.) Bhandari using bioclimatic and non-bioclimatic variables
- Manish Mathur
- Preet Mathur
- Harshit Purohit
Ecological Processes (2023)
Ecological niche model transferability of the white star apple (Chrysophyllum albidum G. Don) in the context of climate and global changes
- Jean Cossi Ganglo
Scientific Reports (2023)
Vulnerability of the Cerrado–Atlantic Forest ecotone in the Espinhaço Range Biosphere Reserve to climate change
- Thaís Ribeiro Costa
- Ludmila Aglai da Silva
- Anne Priscila Dias Gonzaga
Theoretical and Applied Climatology (2023)
Identifying the drivers of silky shark distribution and an evaluation of protection measures
- Shona Murray
- Jessica J. Meeuwig
- David Mouillot
Environmental Biology of Fishes (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.