Abstract
We present a flora and fauna dataset for the Mira-Mataje binational basins. This is an area shared between southwestern Colombia and northwestern Ecuador, where both the Chocó and Tropical Andes biodiversity hotspots converge. We systematized data from 120 sources in the Darwin Core Archive (DwC-A) standard and geospatial vector data format for geographic information systems (GIS) (shapefiles). Sources included natural history museums, published literature, and citizen science repositories across 13 countries. The resulting database has 33,460 records from 6,821 species, of which 540 have been recorded as endemic, and 612 as threatened. The diversity represented in the dataset is equivalent to 10% of the total plant species and 26% of the total terrestrial vertebrate species in both hotspots. The dataset can be used to estimate and compare biodiversity patterns with environmental parameters and provide value to ecosystems, ecoregions, and protected areas. The dataset is a baseline for future assessments of biodiversity in the face of environmental degradation, climate change, and accelerated extinction processes.
Similar content being viewed by others
Background & Summary
With less than 1% of the planet’s surface, the Tropical Andes and Chocó biodiversity hotspots shelter the richest and most endemically dense biota of flora and fauna on Earth1. Both, in northwestern South America, are a high priority for conservation at local, regional, and global scales. They require transnational collaborative networks to conserve their biodiversity1,2,3,4,5.
In the transboundary basins of the Mira and Mataje rivers, the tropical forests of the Chocó converge with the tropical Andes of northern Colombia and Ecuador. These watersheds are of significance to conservationists and researchers in northwestern South America. Several lineages described as new species show the importance of these basins in the mechanisms of speciation and geographic distribution of flora and fauna5,6,7,8,9,10,11,12,13,14,15,16. Nevertheless, there have been no documented efforts to consolidate the available information on species occurrences into a unified dataset for this important conservation region.
The armed conflict and geopolitical setting in this national border, between southern Colombia and northern Ecuador, have prevented bioexploration in recent decades17. Valuable remnants of natural vegetation in both basins require urgent conservation actions9. It is within this context that the project consortium named “Biodiversity Conservation in the Binational Basins of the Mira-Mataje Rivers (MMRB): Construction of Biophysical and Socio-environmental Bases for the Conservation and Adaptive Management of Ecosystem Services”, has the purpose of influencing and participating in decision-making. It aspires to improve the capacity of stakeholders and decision-makers to conserve ecosystems and optimize the management of its environmental services. One main component of the MMRB was to establish flora and fauna datasets of collected and observed specimens from the basins in Colombia and Ecuador. The provided datasets can be enriched with additional records through new field expeditions for exploring current sampling gaps. We encourage the use and updating of the presented datasets.
Methods
Study area
The Mira and Mataje binational basins are in northwestern Ecuador and southwestern Colombia; they cover an area of 1,045,965 hectares. They originate in Ecuador, in the Andean Massif of the Eastern Cordillera of the Andes in the Pimampiro region. To the north, they connect through the inter-Andean mountains of Tulcán, with the Western Andes in the El Ángel paramo. To the south, they are delimited by the Mojanda and Cajas crossroads, connecting to the west with the Cotacachi-Yanahurco-Piñan mountain system (Fig. 1). Their waters flow through moorlands, foothill cloud forests, and rainforests until reaching Cabo Manglares in Tumaco, where they empty into the Pacific Ocean in Colombia. They subdivide into six sub-basins, have more than 15 ecological systems, and their elevation gradient ranges from 0 to 4800 m.
Data sources and data compilation
We obtained a collection of 35 data tables for records of fauna and a collection of 36 data tables for records of flora (Supplementary Tables S1 and S2). We accessed these data tables from October 2019 to October 2022. The flora collection included records of vascular plants, while the fauna collection included records of mammals, birds, reptiles, and amphibians. We obtained the data tables from various sources, including government agencies, non-governmental organizations, and academic institutions. Data was delimited geographically to those records belonging to the Mira-Mataje binational basins.
Data organization
We systematized and reorganized the information on all the data tables to create a new dataset based on the Darwin Core Archive (DwC-A) standard18. The DwC-A provides a way to package biodiversity data and metadata in a self-contained and machine-readable format. The DwC-A serves for sharing biodiversity data and is primarily based on the occurrence in nature of taxa, as documented by observations, specimens, samples, and related information18. An accompanying shape file is also provided. The shapefile delimits all ecosystems in the Mira-Mataje binational basins, and provides a tabulated measure of total records, total recorded species, number of endemic species, and number of threatened species. Data organization was carried out by hand and by a single researcher who applied available filters to the selected source data tables and extracted the corresponding data records. Data organization followed the DataONE Best Practices as a guideline19
Data cleaning
To ensure accuracy, completeness, and consistency, we did manual and computational processes for quality control. We identified and corrected errors, such as misspellings, incorrect georeferences, and mismatched taxonomic names. We also checked for missing or incomplete data, such as absent or vague locality descriptions or missing occurrence dates. We resolved all inconsistencies and missing data by consulting the primary literature, museum specimens, and reference databases at the National Institute of Biodiversity of Ecuador (INABIO) (http://inabio.biodiversidad.gob.ec/) and the Pacific Research Institute of Colombia (https://iiap.org.co/). For additional details regarding computational strategies to avoid duplicate records, please refer to the “Technical validation” section below.
Data integration
We integrated the data from the individual datasets into a single, unified dataset using the DwC-A common set of terms and standards. According to the informed criterion of the research team, we mapped the individual terms of each dataset to the DwC-A. The mapping process was assisted by the available definition of the Darwin Core terms (https://dwc.tdwg.org/terms/). We used this mapping process to transform the systematized data into the required standardized format.
Geodatabase
We consolidated a shapefile for the basin that contains layers for vegetation ecosystems and the total number of occurrences, number of species, and number of endemic and threatened species for each ecosystem. This spatial information, including the digital elevation model, was used to generate dynamic three-dimensional shaded relief maps in Rayshader20 in R21. Endemism is a biogeographical concept, and the construction of the geodatabase provided an opportunity to complement the endemism data recorded in the DwC-A term “establishmentMeans”, which was recorded by the original data operators of the source data files. For this, the species distribution ranges were considered according to the categories of political endemism (i.e. species unique to Colombia or Ecuador) and regional endemism (i.e. species restricted to southern Colombia and northern Ecuador). As a result, we included an additional term (column) in the data tables for plants and terrestrial vertebrates, “Distribution”, that determined the condition of endemism for the recorded species and according to our geographical analysis.
Data Records
The resulting dataset (i.e. data tables and geodatabase) is available for download at The Knowledge Network for Biocomplexity (KNB) (https://doi.org/10.5063/F14F1P6H)22, under the Creative Commons Attribution 4.0 International License. The dataset was modified during the manuscript review process; thus, we present the latest version. Although KNB stores previous versions of datasets, these are neither shown nor numbered. Besides the commonly known .csv format, we also compiled the reviewed data tables in the .rds format. The .rds file extension saves a single object that can be interpreted in R21 and provides better compression, serialization, and integration within the R environment, when compared to plain .csv files21. The available dataset includes three data tables for flora (“m_m_flora_total_2021_ecuador.rds” or “m_m_flora_total_2021_ecuador.csv”, “m_m_flora_voucher_2021_ecuador.rds” or “m_m_flora_voucher_2021_ecuador.csv”, “m_m_flora_observation_2021_ecuador.rds” or “m_m_flora_observation_2021_ecuador.csv”) and one data table for fauna ("m_m_fauna_2021_ecuador.rds” or “m_m_fauna_2021_ecuador.csv”). Two main kinds of records are present in the provided dataset. The first are records based on voucher specimens preserved at herbaria or zoology collections. The second are field observations or records obtained from the literature, without voucher specimens. All literature sources point to its corresponding references in the DwC-A term “References”. Data tables have been encoded in Unicode UTF-8. Additionally, we provide a geodatabase as a shapefile, named “biodiversity_measures_Mira_Mataje.shp” with its accessory files. Because of unavailable coordinates for reported localities, the geodatabase contains 33,027 records, this is 433 fewer records than the total in the presented data tables (33,460 records).
The 33,460 terrestrial records belong to 23,368 flora records and 10,092 fauna records (Fig. 1). The dataset is intended for researchers, policymakers, and stakeholders in biodiversity conservation and management. We encourage users to cite the dataset and acknowledge the original data sources and contributors in their publications and presentations. We also plan to maintain and periodically update the dataset to ensure its relevance and usefulness.
The dataset includes data from 13 countries (Fig. 2), with 82.40% of the records from natural history museums and 17.60% from citizen science. For flora, the majority of species and records were obtained from repositories in USA (83.15%), followed by Ecuador (15%) and literature sources (7%) (Fig. 2 and Fig. 3a,b). For fauna, the majority of species and records were obtained from repositories in Ecuador (62%), followed by the USA (34%), and Colombia (3%) (Fig. 2 and Fig. 4a,b). The contrasting differences between the contributions of USA and Ecuador for the flora and fauna datasets, where USA contributes more to flora and Ecuador contributes more to fauna may mirror particular trajectories in the development of biodiversity studies at both countries.
The data records in both data tables (i.e. “m_m_flora_2021_ecuador.rds” and “m_m_fauna_2021_ecuador.rds”) represent 15 ecosystems in the Mira-Mataje binational basins. The greatest number of records (11,612) came from the Western Foothill Forests (WFOF). Terrestrial fauna records were more abundant in the Western Low Montane Forest (WLMF) ; in contrast, flora records reached higher numbers in the Western Foothill Forests (WFOF) (Fig. 5). We estimated 6,821 species, corresponding to 5,553 plant species and 1,268 terrestrial vertebrate species. According to the DwC-A terms “establishmentMeans” (endemic condition) and “threatStatus-IUCN” (threat status), the dataset has recorded an estimated 540 endemic species and 612 species at risk of extinction (Table 1). The results of our geographical assessment of endemism are described in section “Information on diversity, endemism, and risk of extinction” (Fig. 6).
Data table for fauna
In the data table “mira_mataje_fauna_2021_ecuador.rds”, records came from 35 repositories(e.g. museums) in seven countries, with literature as an additional source23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76 (Supplementary Table S1). The most relevant repositories, both in the number of records and species, include QCAZ77,78,79, INABIO, MLZ, and GADPC (Fig. 4). The fauna data table corresponds to 1268 species, from which 69.08% are birds, 13.41% mammals, 11.36% amphibians, and 6.15% reptiles (Fig. 7). A total of 670 records have information on endemism, that represent 59 vertebrate species, of which 59.32% are amphibians, 22.03% reptiles, and 18.64% mammals (Fig. 7). A total of 1302 records have information on threatened conservation status, that represent 295 species at risk of extinction; corresponding to 71.19% birds, 15.25% amphibians, 9.15% reptiles, and 4.41% mammals (Fig. 7).
Data tables for flora
In the data table “mira_mataje_flora_2021_ecuador.rds”, flora records came from 36 repositories, in 12 countries, with literature as an additional source24,30,74,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111 (Supplementary Table S2). The most relevant repositories, both in the number of records and species, include MO-Tropicos, QCA-PUCE112, and INABIOEC-QCE, (Fig. 3). Literature sources also provided a representative number of records and species. The flora data table represents 60.18% species of Magnoliopsida, 28.24% of Liliopsida, and 9.95% of Polypodiopsida. A total of 1073 records have information on endemism, from which 481 species are registered as endemic to the southwest of Colombia and northern Ecuador; 52.60% of the species in Magnoliopsida, 43.03% in Liliopsida, and 3.74% in Polypodiopsida are endemic (Fig. 7). A total of 758 records have information on threatened conservation status (vulnerable, endangered, and critical) from which 317 flora species are at risk of extinction, corresponding to 47.31% Magnoliopsida, 46.69% Liliopsida, and 5.36% Polypodiopsida (Fig. 7).
Information on diversity, endemism, and risk of extinction in the flora and fauna data tables
The diversity of flora (5,553 spp.) and fauna (1,268 spp.) of the Mira- Mataje binational basin is equivalent to 10% of the plants and 26% of the vertebrates in the Tropical Andes and Chocó biodiversity hotspots as defined by Myers et al.1. However, the Mira- Mataje binational basin covers an area of 0.07% of both biodiversity hotspots combined. The ecosystemic mosaic of the Mira-Mataje binational basin influences the distribution of the flora and fauna11. It includes the evergreen biomes of the Chocó lowlands (i. e. lower Mira basin) and the low montane forests of the western foothills of the Andes (i.e. middle Mira basin). The altitudinal bands corresponding to the former ecosystems have the highest diversity of the gradient (Fig. 8). The highest number of species (3732) and endangered species (462) occurs in the Western Foothill Forest (WFOF) (Fig. 8). As a complement to Fig. 8, we have made available a set of dynamic 3D shaded-relief map renders in Figshare (https://doi.org/10.6084/m9.figshare.23499180.v4)113.
Terrestrial fauna diversity is the richest in the Western Low Montane Forest (WLMF), with a total of 653 species and endemic species are more numerous in the Western Foothill Forest, with 893 species (Figs. 5 and 6). In contrast, the number of flora species reaches 3330 in the (WFOF), and 103 endemic species in the WLMF (Figs. 5 and 6). The highest number of endangered species in terrestrial fauna is 299 in the Western Montane Forest (WMF). The highest number of endangered species in plants is 266 in the WFOF (Fig. 6).
Technical Validation
One significant challenge to the presentation of curated biodiversity data sets is the handling of duplicates. Here we define duplicated records as multiple specimens that represent a single object type or collection event114. Duplicated records are present at both global online biodiversity databases, like GBIF, and museum repositories, particularly in botanical collections. The same collection event could be found duplicated in various herbaria collections or within a single herbarium. For example, from a total of 58,860 specimens in the Dipterocarpaceae from nine herbaria, 21,075 specimens were selected after discriminating duplicates115. Although duplicates, frequently populating the history and constitution of world herbaria, are useful as currency for the interinstitutional exchange of museum material, as a buffer to catastrophic losses, or as foundational material for new herbaria collections114,116,117, they also represent problematic data for biodiversity estimates if their condition as duplicates has not been properly recorded, and therefore cannot be discriminated from single data events. Thus, to circumnavigate the issue of duplicates in our data set, we designed a strategy and algorithm in R21, and its reference is found in the “Code availability” section, below. The code in R pertains to a data preprocessing and cleaning pipeline designed for botanical and zoological occurrence data. This pipeline comprises several steps aimed at harmonizing taxonomic names, standardizing collector names, identifying and handling duplicate records, and segmenting the dataset for further analysis (Fig. 9). Because duplicates occurred more intensively in the botanical data tables, our focus was on this particular group of data; however, duplicates were also screened in the fauna dataset.
Data preprocessing and standardization
The raw botanical data underwent a series of preprocessing steps to ensure data quality and standardization. This included the removal of non-standard characters, filtering to include only specimen records, and standardization of taxonomic and locality information for consistency across the dataset. Packages tydyverse118 and taxisze119,120 where used along the pipeline. The original raw data tables were in .csv format and imported to R. Taxonomic thesauri from The Catalogue of Life (TCF)121, World Checklist of Vascular Plants (WCVP)122, and World Flora Online (WFO)123 were loaded into data frames for taxonomic standardization purposes. By applying regular expressions, unnecessary characters and words were removed or replaced from all data columns which could hinder subsequent taxonomic harmonization and the discrimination of duplicate data. We cleaned, replaced, and standardized specific character patterns in the data tables; such as various representations of null records, whitespace characters or extra spaces, accents, Latin letters, and converted character strings to sentence case. Non-ASCII characters were transformed to ASCII, and unnecessary punctuation marks, separators, and special characters were removed. We placed emphasis on taxonomic and locality information in key DwC-A columns, such as “scientificName”, “recordedBy”, and “locality”.
Harmonization of scientific and collector names
Scientific names for the plant data set ("scientificName” in the DwC-A terminology) were standardized according to the International Code of Nomenclature for algae, fungi, and plants124. All unique 6,808 plant taxa names that were found in our dataset were compared with the WCVP -Version 12122. Several scientific names were not found in the WCVP database, so these were compared with the WFO version December 2023123, the TCF121, and the Tropicos database of the Missouri Botanical Garden (accessed in December 2023, https://tropicos.org), the latter using the API provided by the taxize package119,120. Once applied, this approach allowed the identification of multiple synonyms that were updated to accepted scientific names. Besides the DwC-A “scientificName” term, the harmonization also updated the “scientificNameAuthorship”, “family”, “genus”, “specificEpithet”, and “infraspecificEpithet” terms.
To resolve distinct taxonomic determinations across herbaria for duplicates of the same collection event. we applied a prioritization scheme to achieve standardized scientific names. This process was necessary due to potential inconsistencies, synonyms, illegitimate names, misapplied names, and outdated names often present in botanical datasets115,125. This prioritization typically followed a hierarchy where accepted names were given the highest priority, followed by synonyms, illegitimate names, misapplied names, and finally invalid names. In some cases, synonyms were updated to their corresponding accepted names based on the WCVP taxonomy. The harmonized scientific names underwent validation and standardization checks to ensure completeness, correctness, and adherence to established nomenclatural rules and conventions. This validation step involved cross-referencing with taxonomic thesauri to confirm the accuracy and validity of the scientific names.
Collector names were parsed and cleaned to extract consistent surnames, ensuring harmonization and standardization across varied collector name formats and irregularities. The the DwC-A term “recordedBy” column was standardized by extracting the collectors’ surnames using regular expressions, and in the few cases where automatic extraction was not possible, the surnames were extracted manually.
Duplicate records handling
Duplicate records were identified mainly on collector names ("recordedBy” in DwC-A terminology) and record numbers ("recordNumber in DwC-A terminology). When any of both DwC-A terms were available, we used geographic ("decimalLatitude” and “decimalLongitude” in DwC-A terminology) and locality ("locality” in DwC-A terminology) data. Yet, caution and informed decisions are required when using location data, as botanical collectors often tend to apply generalized locality descriptions for collection batches, a practice that obscures the precise identity of recalcitrant duplicate records. Therefore, to obtain a robust flora data table, we filtered out all occurrences that did not have data for the “recordedBy” and “recordNumber” columns, and which were not voucher specimens. This discrimination was performed to ensure the taxonomic reliability of the compiled occurrences. Through this first filtering step, the preliminary flora data table went from 238,219 to 60,094 occurrences.
Duplicate records were prioritized based on the identification method, herbarium information, and taxonomic rank. This prioritization ensured the retention of the most informative and reliable duplicate records while discarding redundant or less informative duplicates. This framework included the following prioritization criteria. 1) Identified by: Records with explicit identification information, such as those identified by experts or through detailed documentation, were prioritized over records without clear identification sources. 2) Herbarium information: Records associated with reputable herbarium institutions, indicated by institution codes, were given higher priority due to the reliability and authenticity typically associated with specimens stored in established herbaria. 3) Taxonomic rank: The taxonomic rank of the specimens, categorized as species, genus, or family, was considered in the prioritization process. Records with higher taxonomic resolution (e.g., species-level identification) were prioritized over records with lower taxonomic resolution (e.g., family-level identification). The prioritization algorithm involved sorting the duplicate records based on the aforementioned criteria in a hierarchical manner. Specifically, records were sorted first by the identification source (identified by or not), then by the associated herbarium (institution code), and finally by the taxonomic rank of the specimen. Duplicate records were evaluated based on the prioritization algorithm, with higher-priority records retained in the dataset. Redundant duplicates, i.e., records that did not offer additional information or value beyond the retained records, were systematically removed from the dataset. False negatives and mismatches were identified for further review and resolution. The final product, after the described approach of filtering and discrimination of records, resulted in a final data table containing 23,368 plant records.
Voucher specimens and observations
For the plant data set, a complementary data table was generated to encompass occurrences lacking collector names, record numbers, or preservation status, providing a comprehensive overview of both records based on preserved specimens and records based on observations or literature sources (i.e. voucherless records). The DwC-A term “basisOfRecord” provides a controlled vocabulary that differentiates between those records that possess a museum voucher and those that are observations or based on literature sources. For the former, the term “PreservedSpecimen” was used, and for the latter the most common term used was “HumanObservation”. The DwC-A term “basisOfRecord” should be interpreted with the DwC-A term “References” to understand the source of observation records. An additional DwC-A term, “institutionCode”, provides differentiation for those records that belong to institutions and those that belong to literature sources. Both data tables, preserved specimens and observations, provided a comprehensive view of the available botanical occurrence data. These two tables were then merged for the presentation of a global botanical data table. Thus, three tables were deemed necessary, the first with voucher museum specimens, the second with voucherless observations, and the third with merged records from the previous two tables. We present records in this way to facilitate future use and research.
Usage Notes
To the best of our knowledge, the datasets we have presented for the Mira-Mataje binational basins are unique, as it is the first time a single systematized and consolidated dataset is available for the region. Because of the diverse formats and approaches to tabulating and classifying data in the reviewed sources, a manual validation was necessary to systematize all dispersed data into a single and coherent dataset under the universal and common DwC-A standard. The R statistical programming language21 is required to open the provided data tables under the .rds extension, with the advantage of a consistent character codification and a proper analytical environment. The provided comma-separated values files allow universal access to the data. The accompanying geodata is in the common geospatial vector data, which consists of the main file, index file, and database table, and is readable in all common geoinformatics platforms. The open-access release of this Neotropical biodiversity dataset is anticipated to unite stakeholders interested in preserving this highly endangered region.
Code availability
The code for the “Technical validation” section is publicly available at https://github.com/roberto-roman/mira-mataje_duplicates/tree/main. This code requires and works with large taxonomic data tables that have been mentioned in the “Technical Validation” section and that are available for download at The Knowledge Network for Biocomplexity (KNB) (https://doi.org/10.5063/F14F1P6H)22. The code for generating the 3D shaded-relief maps is publicly available at https://github.com/PJV-Ecu/3D-shaded-relief-maps-using-Rayshader.git.
References
Myers, N., Mittermeier, R. A., Mittermeier, C. G., Fonseca, G. A. Bd & Kent, J. Biodiversity hotspots for conservation priorities. Nature 403, 853–858 (2000).
Sarkar, S., Sánchez-Cordero, V., Londoño, M. C. & Fuller, T. Systematic conservation assessment for the Mesoamerica, Chocó, and Tropical Andes biodiversity hotspots: a preliminary analysis. Biodiversity and Conservation 18, 1793–1828 (2009).
Cuesta, F., Peralvo, M. & Valarezo, N.Los Bosques Montanos de los Andes Tropicales. Una Evaluación Regional de su Estado de Conservación y de su Vulnerabilidad a Efectos del Cambio Climático (Programa Regional ECOBONA - INTERCOOPERATION, 2009).
Londoño-Murcia, M. C. & Sánchez-Cordero, V. Distribución y conservación de especies amenazadas en Mesoamérica, Chocó y Andes tropicales. Revista Mexicana de Biodiversidad 82, 926–950 (2011).
Pérez-Escobar, O. A. et al. The origin and diversification of the hyperdiverse flora in the Chocó biogeographic region. Frontiers in Plant Science 10, 1328 (2019).
Croat, T. B. A review of studies of neotropical Araceae. Aroideana 38E, 44–54 (2015).
Arteaga, A. et al. Comparative phylogeography reveals cryptic diversity and repeated patterns of cladogenesis for amphibians and reptiles in Northwestern Ecuador. PLoS ONE 11, e0151746 (2016).
Baquero, R. L. E. & Zuchan, K. Platystele pamelae (Orchidaceae: Pleurothallidinae), a new species from Ecuador. Lankesteriana 17, 245–250 (2017).
Gomez, L. F., Gallego, B. & Naranjo, L.Atlas Socioambiental de las Cuencas Transfronterizas Mira y Mataje: Aportes para su Ordenamiento y Gestión Integral Colombia-Ecuador (Cali: WWF-Colombia, 2017).
Yánez-Muñoz, M. H. et al. A new cryptic species of Anolis lizard from northwestern South America (Iguanidae, Dactyloinae). ZooKeys 794, 135–163 (2018).
Yánez-Muñoz, M. et al. Anfibios en los Ecosistemas Andino-Tropicales de la provincia del Carchi. Publicación Miscelanea número 14. Serie de Publicaciones del INABIO (2020).
Yánez-Muñoz, M. H. et al. A new Andean treefrog (Amphibia: Hyloscirtus bogotensis group) from Ecuador: an example of community involvement for conservation. PeerJ 9, e11914 (2021).
Yánez-Muñoz, M. H., Torres-Carvajal, O., Reyes-Puig, J. P., Urgiles-Merchán, M. A. & Koch, C. A new and very spiny lizard (Gymnophthalmidae: Echinosaura) from the Andes in northwestern Ecuador. PeerJ 9, e12523 (2021).
Reyes-Puig, C., Yánez-Muñoz, M. H., Ortega, J. A. & Ron, S. R. Relaciones filogenéticas del subgénero Hypodictyon (Anura: Strabomantidae: Pristimantis) con la descripción de tres especies nuevas de la región del Chocó. Revista Mexicana de Biodiversidad 91, 913013 (2020).
Reyes-Puig, C. et al. Two extremely rare new species of fossorial salamanders of the genus Oedipina (Plethodontidae) from northwestern Ecuador. PeerJ 8, e9934 (2020).
Brito, J. et al. A new genus of oryzomyine rodents (Cricetidae, Sigmodontinae) with three new species from montane cloud forests, western Andean cordillera of Colombia and Ecuador. PeerJ 8, e10247 (2020).
Zou, F., Bhuiyan, M. A., Crovella, T. & Paiano, A. Analyzing the borderlands: a regional report on the Colombia-Ecuador border on political, economic, social, legal, and environment aspects. International Migration Review 58, 881–897 (2024).
Wieczorek, J. et al. Darwin Core: an evolving community-developed biodiversity data standard. PLoS ONE 7, e29715 (2012).
Allard, S. Special issues in data management. ACS Symposium Series 47–67 (2012).
Morgan-Wall, T. rayshader: Create maps and visualize data in 2d and 3d. https://github.com/tylermorganwall/rayshader (2023).
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2022).
Jarrín-V., P. & Yánez-Muñoz, M. H. The tropical andes biodiversity hotspot: A comprehensive dataset for the mira-mataje binational basins. Knowledge Network for Biocomplexity https://doi.org/10.5063/F14F1P6H (2024).
McCormack, J. MLZ Bird Collection (Arctos). Version 33.82. Moore Laboratory of Zoology. Occurrence dataset https://knb.ecoinformatics.org/view/doi:10.5063/F1RV0M5K (2023).
Symbiota INABIO https://bndb.sisbioecuador.bio/bndb/ (2021).
A, B. & R, B. KUBI Herpetology Collection. Version 31.76. University of Kansas Biodiversity Institute. Occurrence dataset https://doi.org/10.15468/ubdwdc (2023).
A, B. & R, B. KUBI Ornithology Collection. University of Kansas Biodiversity Institute. Occurrence dataset https://doi.org/10.15468/hlh5ud (2023).
Yánez-Muñoz, M. H. et al. Anfibios en los Ecosistemas Andino-Tropicales de la provincia del Carchi. (Serie de Publicaciones del Gobierno Autónomo Descentralizado Provincial del Carchi y el Instituto Nacional de Biodiversidad. Publicación Miscelánea N°14:1-340. INABIO - GADPC., 2020).
Vellinga, W. Xeno-canto - Bird sounds from around the world. Xeno-canto Foundation for Nature Sounds. Occurrence dataset https://doi.org/10.15468/qv0ksn (2023).
Academy of Natural Sciences. ANSP. Academy of Natural Sciences of Drexel University. Occurrence dataset https://doi.org/10.15468/s55f2k (2023).
Orrell, T. NMNH Extant Specimen Records (USNM, US). Version 1.74. National Museum of Natural History, Smithsonian Institution. Occurrence dataset https://doi.org/10.15468/hnhrg3 (2023).
Borja-Acosta, K. G. & Galeano Muñoz, S. Colección de Anfibios del Instituto de Investigación de Recursos Biológicos Alexander von Humboldt (IAvH-Am). Version 37.6. Instituto de Investigación de Recursos Biológicos Alexander von Humboldt. Occurrence dataset https://doi.org/10.15472/dma300 (2023).
Borja-Acosta, K. G. & Bravo Mora, G. Colección de Aves del Instituto de Investigación de Recursos Biológicos Alexander von Humboldt (IAvH-A). Version 30.7. Instituto de Investigación de Recursos Biológicos Alexander von Humboldt. Occurrence dataset https://doi.org/10.15472/maeoiz (2023).
Borja-Acosta, K. G. & Galeano Muñoz, S. Colección de Reptiles del Instituto de Investigación de Recursos Biológicos Alexander von Humboldt (IAvH-R). Version 23.5. Instituto de Investigación de Recursos Biológicos Alexander von Humboldt. Occurrence dataset https://doi.org/10.15472/bg0gnd (2023).
Borja-Acosta, K. G. & Reyes-Amaya, N. Colección de Mamíferos del Instituto de Investigación de Recursos Biológicos Alexander von Humboldt (IAvH-M). Version 32.11. Instituto de Investigación de Recursos Biológicos Alexander von Humboldt. Occurrence dataset https://doi.org/10.15472/aghjkw (2023).
Calderón-Leyton, J. J., Arcos-Patiño, O. E., Rosero-Calderón, C. D. & Fernández-Gómez, R. A. Colección Ornitológica de la Universidad de Nariño. Version 4.7. Universidad de Nariño. Occurrence dataset https://doi.org/10.15472/dbowdj (2021).
UMMZ Birds Data Group & LSA IT A. University of Michigan Museum of Zoology, Division of Birds. Version 14.63. University of Michigan Museum of Zoology. Occurrence dataset https://doi.org/10.15468/lns1qm (2023).
UMMZ Mammals Data Group & LSA IT A. University of Michigan Museum of Zoology, Division of Mammals. Version 8.65. University of Michigan Museum of Zoology. Occurrence dataset https://doi.org/10.15468/dx3rcj (2023).
Corado, R. & Hall, L. WFVZ Bird Collections. Version 19.3. Western Foundation of Vertebrate Zoology. Occurrence dataset https://doi.org/10.15468/hqrtbv (2023).
Fernández, R. Caracterización Aves IAI-GAICA: Proyecto Cambio Climático. Version 5.1. Asociación GAICA. Occurrence dataset https://doi.org/10.15472/z8ik4j (2018).
Rogers, S. CM Birds Collection. Version 9.3. Carnegie Museums. Occurrence dataset https://doi.org/10.15468/dv1ojv (2022).
Rogers, S. CM Herps Collection. Version 8.5. Carnegie Museums. Occurrence dataset https://doi.org/10.15468/lxe6h4 (2017).
Grant, S., Webbink, K. & Marks, B. Field Museum of Natural History (Zoology) Bird Collection. Version 14.21. Field Museum. Occurrence dataset https://doi.org/10.15468/exkxdx (2022).
Grant, S., Webbink, K., Jones, J. & Ferguson, A. Field Museum of Natural History (Zoology) Mammal Collection. Version 9.18. Field Museum. Occurrence dataset https://doi.org/10.15468/n4zgxw (2020).
Grant, S., Webbink, K. & Resetar, A. Field Museum of Natural History (Zoology) Amphibian and Reptile Collection. Version 12.9. Field Museum. Occurrence dataset https://doi.org/10.15468/u2pzhj (2023).
Mertz, W. LACM Vertebrate Collection. Version 18.13. Natural History Museum of Los Angeles County. Occurrence dataset https://doi.org/10.15468/77rmwd (2023).
Tubaro, P. & Rodríguez, D. Museo Argentino de Ciencias Naturales “Bernardino Rivadavia” (MACN). Ornithology National Collection (MACNOr). Museo Argentino de Ciencias Naturales. Occurrence dataset https://doi.org/10.15468/w4svvx (2022).
Teta, P. & Rodríguez, D. Museo Argentino de Ciencias Naturales “Bernardino Rivadavia” (MACN). Mammalogy National Collection (MACNMa). Museo Argentino de Ciencias Naturales. Occurrence dataset https://doi.org/10.15468/ukdxqp (2022).
Parker, S. LSUMZ Herps Collection. Louisiana State University Museum of Natural Science. Occurrence dataset https://doi.org/10.15468/iogakq (2017).
Esselstyn, J. LSUMZ Mammals Collection. Louisiana State University Museum of Natural Science. Occurrence dataset https://doi.org/10.15468/wxiqf6 (2017).
Cardiff, S. & Mason, N. LSUMZ Birds Collection. Louisiana State University Museum of Natural Science. Occurrence dataset https://doi.org/10.15468/d90jm5 (2021).
Coloma, L. A. & Terán, A. Banco de Vida/ CJ Museum/ Amphibian collection. Version 1.8. Centro Jambatu. Occurrence dataset https://doi.org/10.15468/e5ijd1 (2021).
Calderon, M., Lynch, J., Raz, L. & Agudelo, H. Colección de Herpetología del Instituto de Ciencias Naturales (ICN-MHN-Her). Version 2.4. Universidad Nacional de Colombia. Occurrence dataset https://doi.org/10.15472/cip0kf (2023).
Stiles, F., Raz, L. & Agudelo, H. Colección Ornitológica del Instituto de Ciencias Naturales (ICN-MHN-Or). Version 2.4. Universidad Nacional de Colombia. Occurrence dataset https://doi.org/10.15472/di7zed (2023).
López, H., Raz, L. & Agudelo, H. Colección de Mamíferos del Instituto de Ciencias Naturales (ICN-MHN-Ma). Version 2.16. Universidad Nacional de Colombia. Occurrence dataset https://doi.org/10.15472/lrjbmd (2023).
Scheinberg, L. & Fong, J. CAS Herpetology (HERP). Version 33.108. California Academy of Sciences. Occurrence dataset https://doi.org/10.15468/bvoyqy (2023).
Flannery, M. & Fong, J. California Academy of Sciences: CAS Ornithology (ORN). Version 126.386. California Academy of Sciences. Occurrence dataset https://doi.org/10.15468/ozz2u5 (2023).
Millen, B. Ornithology Collection Non Passeriformes - Royal Ontario Museum. Version 16.13. Royal Ontario Museum. Occurrence dataset https://doi.org/10.15468/s0pqqg (2018).
Millen, B. Ornithology Collection Passeriformes - Royal Ontario Museum. Version 15.10. Royal Ontario Museum. Occurrence dataset https://doi.org/10.15468/ahiyvz (2018).
Solari, S. Colección Teriológica Universidad de Antioquia. Version 1.2. Universidad de Antioquia. Occurrence dataset https://doi.org/10.15472/hkwiav (2021).
Daza Rojas, J. M. Colección de anfibios - Museo de Herpetología de la Universidad de Antioquia. Version 7.3. Universidad de Antioquia. Occurrence dataset https://doi.org/10.15472/zn2bkv (2021).
Daza Rojas, J. M. Colección de Reptiles - Museo de Herpetología de la Universidad de Antioquia. Version 3.2. Universidad de Antioquia. Occurrence dataset https://doi.org/10.15472/z6yhzr (2022).
iNaturalist contributors, iNaturalist. iNaturalist Research-grade Observations. iNaturalist.org. Occurrence dataset https://doi.org/10.15468/ab3s5x (2023).
Tamayo Quintero, J. & Cruz Bernate, L. Colección Ornitológica de la Universidad del Valle. Version 8.2. Universidad del Valle. Occurrence dataset https://doi.org/10.15472/rhzrcp (2021).
Sedano-Cruz, R. E., Gil-Lopez, D. L., Cediel, J. L. & Lopez, A. Colección de anfibios y reptiles, Universidad del Valle (Colección UV-C). Version 14.2. Universidad del Valle. Occurrence dataset https://doi.org/10.15472/4nryow (2021).
Cárdenas Hincapié, J. S. & Lozano Bernal, M. F. Colección de Saurios Museo de La Salle Bogotá (MLS). Version 3.11. Universidad de la Salle. Occurrence dataset https://doi.org/10.15472/1ymc6r (2022).
Cárdenas Hincapié, J. S. Colección de Mamíferos Museo de La Salle Bogotá (MLS). Version 4.5. Universidad de la Salle. Occurrence dataset https://doi.org/10.15472/rpxp1w (2021).
Cárdenas Hincapié, J. S. & Lozano Bernal, M. F. Colección de Quelonios Museo de La Salle Bogotá (MLS). Version 5.6. Universidad de la Salle. Occurrence dataset https://doi.org/10.15472/df7ko5 (2022).
Cárdenas Hincapié, J. S. & Lozano Bernal, M. F. Colección de Ofidios Museo de La Salle Bogotá (MLS). Version 4.5. Universidad de la Salle. Occurrence dataset https://doi.org/10.15472/gnumbe (2022).
Lozano Bernal, M. F. Colección de Anfibios Museo de La Salle Bogotá (MLS). Version 4.8. Universidad de la Salle. Occurrence dataset https://doi.org/10.15472/imntpn (2022).
Posse, V. & Montalvo, E. Colección de Ornitología del Museo de Historia Natural Gustavo Orcés V. Version 1.8.Museo de Historia Natural “Gustavo Orcés V.” Occurrence dataset https://doi.org/10.15468/atvsmk (2023).
Dickey, D. AMNH Herpetology Collections. American Museum of Natural History. Occurrence dataset https://doi.org/10.15468/jfkgyh (2016).
Red Nacional de Observadores de Aves & Naranjo Maury, G. DATAVES. Version 7.5. Red Nacional de Observadores de Aves - RNOA. Occurrence dataset https://doi.org/10.15472/iqnpse (2022).
Colecciones Científicas de la Fundación Miguel Lillo. https://www.lillo.org.ar/colecciones-cientificas (2021).
Natural History Museum. Natural History Museum (London) Collection Specimens. Occurrence dataset https://doi.org/10.5519/qd.zqujp4ov (2023).
Harvard University M & Morris, P. J. Museum of Comparative Zoology, Harvard University. Version 162.397. Museum of Comparative Zoology, Harvard University. Occurrence dataset https://doi.org/10.15468/p5rupv (2023).
Cicero, C. MVZ Bird Collection (Arctos). Version 43.77. Museum of Vertebrate Zoology. Occurrence dataset https://doi.org/10.15468/r1woj3 (2023).
Base de Datos de la Colección de Anfibios del Museo de Zoología QCAZ. Versión 2021.0. https://bioweb.puce.edu.ec/QCAZ/anfibios (2021).
Base de Datos de la Colección de Mamíferos del Museo de Zoología QCAZ. Versión 2021.0. https://bioweb.puce.edu.ec/QCAZ/contenido/InicioMamiferos (2021).
Base de Datos de la Colección de Reptiles del Museo de Zoología QCAZ. Versión 2021.0. https://bioweb.puce.edu.ec/QCAZ/reptiles (2021).
Teisher, J. & Stimmel, H. Tropicos MO Specimen Data. Missouri Botanical Garden. Occurrence dataset https://doi.org/10.15468/hja69f (2023).
León Yánez, S. et al. Colección de plantas vasculares y no vasculares del Herbario de la Universidad Católica del Ecuador (QCA). Version 1.3.Herbario QCA de la Pontificia Universidad Católica del Ecuador. Occurrence dataset https://doi.org/10.15468/hzdeb9 (2023).
Ramirez, J., Watson, K., McMillin, L., Gjieli, E. & Sessa, E. The New York Botanical Garden Herbarium (NY). Version 1.62. The New York Botanical Garden. Occurrence dataset https://doi.org/10.15468/6e8nje (2023).
Borja-Acosta, K. G. & Díaz, A. Colección Herbario Federico Medem Bogotá - FMB. Version 32.2. Instituto de Investigación de Recursos Biológicos Alexander von Humboldt. Occurrence dataset https://doi.org/10.15472/ighftu (2023).
Carnegie Museums. Carnegie Museum of Natural History Herbarium. Occurrence dataset https://doi.org/10.15468/d51v1f (2023).
CSIC-Real Jardín Botánico & Castilla, F. CSIC-Real Jardín Botánico-Colección de Plantas Vasculares (MA). Real Jardín Botánico (CSIC). Occurrence dataset https://doi.org/10.15468/mug7kr (2023).
Grant, S. & von Konrat, M. Field Museum of Natural History (Botany) Seed Plant Collection. Version 11.16. Field Museum. Occurrence dataset https://doi.org/10.15468/nxnqzf (2023).
Lundberg, J. Phanerogamic Botanical Collections (S). Swedish Museum of Natural History. https://doi.org/10.15468/yo3mmu (2023).
Gaisberger, H. & Endresen, D. Bioversity Collecting Mission Database. Version 1.10. Bioversity International. Occurrence dataset https://doi.org/10.15468/ulk1iz (2019).
Borchsenius, F. The AAU Herbarium Database. Herbarium of the University of Aarhus. Occurrence dataset https://doi.org/10.15468/7uigwo (2023).
Teixeira de Souza Chies, T & Prado, J. F. ICN - Herbário do Instituto de Ciências Naturais. Version 1.103. Universidade Federal do Rio Grande do Sul. Occurrence dataset https://doi.org/10.15468/gwuezt (2023).
Naturalista Colombia. https://colombia.inaturalist.org/ (2023).
University of California, Davis. DAV - UC Davis Herbarium. Occurrence dataset https://doi.org/10.15468/on4axg (2023).
Natural History Museum. Natural History Museum (London) Herbarium Collection Specimens. Occurrence dataset. https://doi.org/10.5519/qd.zqujp4ov (2023).
Royal Botanic Gardens, Kew. Royal Botanic Gardens, Kew - Herbarium Specimens. Occurrence dataset https://doi.org/10.15468/ly60bx (2021).
Jennings, L. University of British Columbia Herbarium (UBC) - Vascular Plant Collection. Version 16.12. University of British Columbia. Occurrence dataset https://doi.org/10.5886/rtt57cc9 (2023).
Hopkins, M. & Campos de Oliveira, D. Herbarium - Instituto Nacional de Pesquisas da Amazônia (INPA). Instituto Nacional de Pesquisas da Amazônia - INPA. Occurrence dataset https://doi.org/10.15468/5ictpz (2015).
Martínez Figueroa, Y. M., Arango Vásquez, D., Cardona Naranjo, F. A., Calderón Arias, A. M. & Benavides, A. M. Herbario Universidad de Antioquia (HUA). Version 7.7. Universidad de Antioquia. Occurrence dataset https://doi.org/10.15472/esjdio (2023).
Magnaghi, E. & Fong, J. CAS Botany (BOT). Version 157.523. California Academy of Sciences. Occurrence dataset https://doi.org/10.15468/7gudyo (2023).
Gustafsson, C. & Holston, K. Herbarium GB, University of Gothenburg. Version 1.7. University of Gothenburg. Occurrence dataset https://doi.org/10.15468/asgd85 (2022).
Forzza, R., Silva, L. A. & Dalcin, E. RB - Rio de Janeiro Botanical Garden Herbarium Collection. Version 84.252. Instituto de Pesquisas Jardim Botanico do Rio de Janeiro. Occurrence dataset https://doi.org/10.15468/7ep9i2 (2023).
Labiak Evangelista, P. H. & Goldenberg, R. UPCB herbarium - Universidade Federal de Paraná - Herbário Virtual REFLORA. Version 1.185. Instituto de Pesquisas Jardim Botanico do Rio de Janeiro. Occurrence dataset https://doi.org/10.15468/fpf5j6 (2023).
University of Vienna - Herbarium WU. University of Vienna, Institute for Botany - Herbarium WU. Occurrence dataset https://doi.org/10.15468/tnj8wm (2023).
Gall, L. Botany Division, Yale Peabody Museum. Yale University Peabody Museum. Occurrence dataset https://doi.org/10.15468/hrztgn (2023).
Australia’s Virtual Herbarium. National Herbarium of Victoria (MEL) AVH data. Occurrence dataset https://doi.org/10.15468/rhzrxw (2023).
Arizona State University Biocollections. Arizona State University Vascular Plant Herbarium. Occurrence dataset https://doi.org/10.15468/a2o8vy (2023).
Yatskievych, G. Plant Resources Center. Version 1.123. University of Texas at Austin, Biodiversity Collections. Occurrence dataset https://doi.org/10.15468/g85t8z (2023).
Zuluaga, A., Arango Vásquez, D, Martínez, A. L. & Cardona, M. E. Colección de referencia del Herbario CUVC de la Universidad del Valle. Version 8.6. Universidad del Valle. Occurrence dataset https://doi.org/10.15472/lapeh5 (2023).
Kennedy, J. Harvard University Herbaria: All Records. Harvard University Herbaria. Occurrence dataset https://doi.org/10.15468/o3pvnh (2023).
Conservatoire et Jardin botaniques de la Ville de Genève - G. Geneva Herbarium - General Collection (G). Occurrence dataset https://doi.org/10.15468/rvjdu1 (2023).
Viana, P. Museu Paraense Emilio Goeldi Herbarium. Version 4.2. Museu Paraense Emílio Goeldi. Occurrence dataset https://doi.org/10.15468/igjr8k (2020).
Cordi, L. UEC - Herbário da Universidade Estadual de Campinas. Version 1.103. Universidade Estadual de Campinas - Instituto de Biologia. Occurrence dataset https://doi.org/10.15468/4hnsz8 (2023).
Base de Datos del Herbario QCA. Versión 2021.0. https://bioweb.puce.edu.ec/QCA/ (2021).
Jarrín, P. & Yánez-Muñoz, M. H. Dynamic 3D shaded-relief map renders used in the publication: The Tropical Andes Biodiversity Hotspot: A Comprehensive Dataset for the Mira-Mataje Binational Basins. Figshare https://doi.org/10.6084/m9.figshare.23499180.v3 (2023).
Heumann, I., MacKinney, A. G. & Buschmann, R. Introduction: the issue of duplicates. The British Journal for the History of Science 55, 257–278 (2022).
Goodwin, Z. A., Harris, D. J., Filer, D., Wood, J. R. & Scotland, R. W. Widespread mistaken identity in tropical plant collections. Current Biology 25, R1066–R1067 (2015).
Cornish, C. & Driver, F. ‘Specimens Distributed’ The circulation of objects from Kew’s Museum of Economic Botany, 1847-1914. Journal of the History of Collections 32, 327–340 (2019).
Cotterill, F. P. D. Toward exorcism of the ghost of W. T. Thistleton Dyer: a comment on “over duplication” and the scientific properties, uses and values of natural science specimens. TAXON 48, 35–39 (1999).
Wickham, H. et al. Welcome to the tidyverse. Journal of Open Source Software 4, 1686 (2019).
Scott Chamberlain & Eduard Szocs. taxize - taxonomic search and retrieval in r. F1000Research https://f1000research.com/articles/2-191/v2 (2013).
Chamberlain, S. et al. taxize: Taxonomic information from around the web https://github.com/ropensci/taxize. R package version 0.9.98 (2020).
Bánki, O. et al. Catalogue of life checklist (version 2024-03-26), https://doi.org/10.48580/dfz8d (2024).
Govaerts, R., Lughadha, E. N., Black, N., Turner, R. & Paton, A. The World Checklist of Vascular Plants, a continuously updated resource for exploring global plant diversity. Scientific Data 8, 215 (2021).
The World Flora Online Consortium et al. World flora online plant list december 2023 https://doi.org/10.5281/zenodo.10425161 (2023).
Turland, N. et al. International Code of Nomenclature for algae, fungi, and plants (Shenzhen Code) adopted by the Nineteenth International Botanical Congress Shenzhen, China, July 2017. http://hdl.handle.net/10141/622572 (Koeltz Botanical Books, 2018).
Cardoso, D. et al. Amazon plant diversity revealed by a taxonomically verified species list. Proceedings of the National Academy of Sciences 114, 10695–10700 (2017).
Acknowledgements
We thank the John D. & Catherine T. MacArthur Foundation for supporting the project titled “Biodiversity Conservation in the Mira and Mataje Rivers’ Basins”, without which this work would not have been possible. We extend our gratitude to our colleagues at INABIO, Pablo Sánchez, Diana Fernández, Glenda Pozo, Gabriela Lagla, and Julio Carrión for their valuable comments and suggestions. We acknowledge the efforts of over 1,258 collectors and scientists from 13 countries that have contributed to documenting the biodiversity of the Mira-Mataje binational basins. We must also thank two anonymous reviewers and our editor for their supporting insights to improve the quality of our work. We extend due credit to Melisa Ingala (Piper), Alexander Schmidt-Lebuhn (Anthurium), Olegivvit (Pteridium), Gabriela Palomo-Munoz (Dasypus), and José Carlos Arenas-Monroy (Sarcohyla) for the use of silhouette images under CC BY 3.0 or CC BY-NC-SA 3.0 licenses in www.phylopic.org. Our special recognition to INaturalist, BioWeb (Pontificia Universidad Católica del Ecuador), the Global Biodiversity Information Facility (GBIF), and Tropicos (Missouri Botanical Garden). Their initiatives of open data have enabled us to elaborate this study.
Author information
Authors and Affiliations
Contributions
M.Y.M., J.B.M., and D.J.I. designed the research. P.J.V. provided technical assistance, analysis, and design for the final version of the manuscript. R.R.-RR designed and applied the data cleaning strategy and algorithm in R code. P.A.B., J.G.P., D.J.I., W.K., F.P.A., G.R., M.Y.M., and Z.M. coordinated the binational data collection initiative. M.Y.M., C.P., P.B.M., Z.M., J.M.A., and E.C. compiled the data. M.Y.M., J.B.M., E.F., C.G., P.M.V., M.A.U., L.M.R., R.M., L.E.L., Y.M., Y.D.M., E.N., C.O.P., R.F., L.E.R., Z.Q., and L.V., validated the flora and fauna information of the database. M.Y.M, P.B.M., C.P., and E.C. systematized the first database version. M.Y.M., J.B.M., and D.J.I. wrote the first version of the manuscript with inputs from W.M., S.P., and P.A.B. M.Y.M, G.N., and C.V. coordinated the survey and consolidation of the geodatabase. All authors contributed substantially to providing data and checking the information on the distribution and status of the species.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yánez-Muñoz, M.H., Jarrín-V., P., Brito M., J. et al. The Tropical Andes Biodiversity Hotspot: A Comprehensive Dataset for the Mira-Mataje Binational Basins. Sci Data 11, 782 (2024). https://doi.org/10.1038/s41597-024-03463-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-024-03463-1