Abstract
Trait datasets are increasingly being used in studies investigating eco-evolutionary theory and global conservation initiatives. Reptiles are emerging as a key group for studying these questions because their traits are crucial for understanding the ability of animals to cope with environmental changes and their contributions to ecosystem processes. We collected data from earlier databases, and the primary literature to create an up-to-date dataset of reptilian traits, encompassing 40 traits from 12060 species of reptiles (Archelosauria: Crocodylia and Testudines, Rhynchocephalia, and Squamata: Amphisbaenia, Sauria, and Serpentes). The data were gathered from 1288 sources published between 1820 and 2023. The dataset includes morphological, physiological, behavioral, and life history traits, as well as information on the availability of genetic data, IUCN Red List assessments, and population trends.
Similar content being viewed by others
Background & Summary
Species traits are fundamental to macroecological and macroevolutionary investigations. Trait datasets allow for integrating a diverse range of physiological, ecological, morphological, and life history data to explore organismal ecology and evolution1,2,3,4. Comparative studies regularly use trait data to study topics such as animal physiology, ecology, and behaviour. These studies have a rich history of aggregating extensive trait datasets and examining diverse hypotheses at the species level. Analyses may focus, for example, on factors influencing rapid morphological diversification, or the role of divergent adaptation in speciation5,6,7,8. The consolidation of trait data into comprehensive databases enhances ongoing research efforts by centralizing scattered information into a unified repository. Unrestricted access to such a repository has the potential to significantly streamline future investigations into animal diversity, ecology, evolution, and conservation.
Reptiles are a highly interesting group of animals that demonstrate a strong sensitivity to environmental factors, including temperature, precipitation, landscape features, and soils9,10,11,12,13,14. A growing body of literature describes the physiological, morphological, and performance traits of reptiles15,16,17,18,19, contributing to global analyses that aim to enhance our understanding of the life history and evolution of these creatures. Recently, Meiri20 published a comprehensive database containing basic physiological and ecological traits for lizards. However, this database excludes other groups of reptiles, such as archelosaurs (crocodiles and turtles), Rhynchocephalia (the tuatara), and other squamates (i.e. snakes). While ecological databases have been published for turtles, snakes, and crocodiles over the past decades, they are often small and limited to specific species or countries21,22,23,24.
Therefore, we have compiled a database summarizing a vast amount of data for all reptiles, including amphisbaenians, lizards, crocodiles, snakes, and turtles. This database is designed to be user-friendly and easily updated as the literature grows. In this article, we present this dataset, which contains ecological and physiological traits of all reptile species (Archelosauria: Crocodylia and Testudines, and Lepidosauria: Rhynchocephalia, and Squamata: Amphisbaenia, Sauria, and Serpentes). We collected data for 40 traits central to many ecological questions. For most traits, we incorporated lizard data from Meiri20 published database and added new data not covered in that dataset from various literature sources. Our data collection involved published and online databases, as well as primary and secondary literature25,26,27. The Reptile Database28 served as our taxonomic backbone. We anticipate that making this data available will reveal both gaps and errors inherent in compiling a dataset of this size, enabling efforts to address these shortcomings.
Methods
We compiled a dataset for 12060 species following the taxonomy in the Reptile Database28, including 40 physiological, morphological, ecological and behavioural traits, and habitat variables (Table S1). We divided traits into six categories: habitat; behaviour; morphology; ife history; physiology; and conservation (see Fig. 2). We collected data for four reptile Orders: Crocodylia (alligators and crocodiles, n = 27 species), Testudines (turtles and tortoises, n = 361 species); Rhynchocephalia (tuatara, n = 1 species), and Squamata, comprised of three Sub-Orders: Sauria (lizards, n = 7415 species), Amphisbaenia (n = 202 species), and Serpentes (snakes, n = 4073 species). We used the “one-row-per-species” format because information on within-species variation is very limited for most species. Our dataset compilation consisted of several steps. First, we identified sources of trait data. Second, we manually extracted the data and transcribed them into a comma-separated values (CSV) file (a “raw” data file) and retained the measurement units as published. Then, we read all raw data files, checked data quality and combined them into a single Excel file of standardised observations and units of measurement. Finally, we performed additional data quality checks on the standardised observations, correcting processing errors and checking for additional issues (see Figure S1).
We searched for additional species-specific data in literature published between 1974 and 2023 using Google Scholar (https://scholar.google.com) and Web of Science (https://www.webofscience.com). First, we searched published databases. In the search phrases, we combined taxon names (Class or Order) with one of the following keywords: reptil*, squamat*, lizard*, snake*, turtle*, testudin*, tortois*, tuatara*, crocodil*, alligator*. In total, we used 16 published databases and four online databases25,26,27. In addition, we considered all citations in published databases and major reviews and included any additional papers. We have added primary sources from published databases for sources of individual trait values. If primary sources were not listed for a species, we cited only the database. We searched separately for some species of reptiles that had unclear information. In Google Scholar and Web of Science we searched for the species name (e.g., Ablepharus alaicus*”). We focused on Crocodylia, Testudines and Serpentes species because most of the data for Sauria and Amphisbaenia were published in Meiri20. However, if new information for lizards was found during this search, these data also were added to our dataset. We added the higher-level taxonomic classification (family, order) following the Reptile Database28. If a species with data could not be identified automatically, we corrected the entry manually after searching for relevant synonyms in the reptile database28. We contacted the Reptile Database team and received permission for using their data under an open license from the original data generators. We translated sources from languages other than English when possible. Papers that were inaccessible, or written in languages we could not translate, were excluded. Our review involved examining approximately 2000 sources from primary scientific literature, books, public journals, and online resources. Reviews and meta-analyses guided the selection of appropriate papers for extracting original data. After reading the title and abstract of each article, we decided whether to read the entire article and extract data from it, based on whether the paper reported species-specific information on the ecological traits we were trying to find (see Figure S1). We focused on papers that provided species-specific information on the ecological traits under investigation. From 1288 of the 2000 sources we extracted data. Data extraction involved reviewing text, online supplementary materials, or tables within each source. For species represented in multiple rows (e.g., appearing with different names, subspecies, or data sources), we consolidated the information into a single row (see details below). This approach aimed to present a unified representation of consensus data for each species. The phylogenetic tree was drawn using “ggtree” package29,30 for all species of the Class Reptilia31,32,33.
Data Records
We amassed data for a total of 12060 species, belonging to 1255 genera, 92 families, and four orders (included six major groups: the orders Testudines, Crocodilia, and Rhynchocephalia and the three sub-orders of Squamata: Amphisbaenia, Sauria, and Serpentes). We include 1288 data sources in the dataset, with 1284 sources coming from published scientific literature (including books and published databases) and four online databases25,26,27. Missing data were coded as ‘NA’. We created an excel spreadsheet containing both the dataset content and the column descriptions as separate worksheets (Table 1: individual trait values; Table 2: sources of individual trait values.; Table 3, citations; Table 4, trait definitions). The dataset is provided as an Excel file named ReptTraits dataset v1-1.xlsx in the Figshare repository34.
Our dataset includes eight types of taxonomic data and metadata, and 40 traits as follows: Species, Order, Suborder, Family, Genus, Description author/s, Description year, Subspecies, main biogeographic region, microhabitat, habitat type, minimal and maximal elevation, mean annual temperature, temperature seasonality, precipitation seasonality, insular/endemic, venomous, diet, active time, dorsal colour and pattern, foraging mode, pupil shape, fangs (front-fanged, non-fanged or rear-fanged), maximum longevity, maximum body mass and maximum length (TL, SVL and SCL), hatchling/neonate mass, reproductive mode, sex-determining mechanism (GSD or TSD), mean number of offspring per litter or number of eggs per clutch, smallest and largest clutch size, number of litters or clutches produced per year, egg length and width, mean, minimum and maximum body temperature (Tb) (in the field), genetic data (whether these exist on GenBank or not), IUCN red list assessments, and IUCN population trends (e.g., Table S1). We present each type of data in a column, or set of columns, that can be instantly used for analyses. We collected length data separately for males, females, juvenile and unsexed individuals. Most mass data are based on lengths, transformed to masses based on taxon-specific equations (accounting for the degree of limb loss in relevant lineages)15,35. We used only maximum values for longevity, body mass, Total length, snout vent length (SVL), and carapace length (CL; of turtles). For some traits, we average the minimum and maximum reported means values (e.g., clutch size, body size, Tb). We collected “Maximum length SVL”, “Maximum female SVL”, “Maximum male SVL” or “Maximum juvenile SVL” data for Crocodylia, Rhynchocephalia, and Squamata. Testudine measurements of the same columns have different meanings: they are Carapace lengths. If we had more than one mean for a specific trait for a given species, we averaged the smallest and highest reported means. When means were unavailable, we averaged the minimum and maximum reported values20. Unfortunately, due to the lack of data, we could not collect the maximum, minimum and average values for all traits, so some traits have only the maximum or average values Our dataset can easily be reproduced, updated, and expanded to include a wider range of species, other taxa or traits.
Technical Validation
We thoroughly examined the dataset to ensure variable consistency, including accurate species and trait naming. We assessed data integrity by identifying outliers and verifying correct data types and consistent use of units. We updated species binomials according to the latest version of the reptile database28. Data sources that posed challenges in interpretation, lacked extractable raw data, or relied on data imputation, were excluded from the dataset (see Figure S1).
Quality control measures included generating plots and scrutinizing them for outliers. When we identified issues during standardization or quality control, we first verified whether they stemmed from transcription errors by comparing the raw file to the source data. If this was the case, we corrected the data. Otherwise, we investigated whether the problem originated from a standardization step failure, such as a programming error, and rectified the standardization scripts accordingly. In cases where an error persisted, we examined the source paper for potential issues, such as incorrect units or misplaced decimal points. In those few cases when inconsistencies occurred, we decided on a solution based on double-checking the original sources and mutual agreement between the first, second and last authors. If no solution was found, we deleted the datum.
Usage Notes
Compared with previously published databases21,22,23,24 our dataset is bigger (greater number of species and traits), thereby advancing the current state of knowledge in the field. However, we acknowledge certain limitations inherent in our dataset (and others), including taxonomic (Figs. 1, 2) and geographic biases in sampling. The data gaps identified in our study are regrettably common (Figs. 1, 2) and pertain to taxa (e.g., Amphisbaenia, Dibamidae) that are notably rare or challenging to study (‘Linnean Shortfall’) due to biological constraints (e.g., fossoriality). Similarly, the faunas of some regions are difficult to access or study (e.g., in certain war-torn regions and regions with poor transportation infrastructure; ‘Wallacean Shortfall’). Additionally, some gaps are attributed to difficulties in accessing scientific literature, particularly due to language barriers and related citation indexing challenges36.
The dataset can also easily be expanded and corrected if errors are identified. We encourage researchers to let us know if they find any error in our dataset or if they publish new data that should be included in future versions. Users can utilize the supplied data to compile and standardise the dataset with different standardisation parameters or output units. The data descriptor was peer reviewed in 2023 based on the data available on the platform at the time.
References
Westoby, M. & Wright, I. J. Land-plant ecology on the basis of functional traits. Trends Ecol. Evol. 21, 261–268 (2006).
Chown, S. L. & Gaston, K. J. Body size variation in insects: a macroecological perspective. Biol. Rev. Camb. Philos. Soc. 85, 139–169 (2010).
Parr, C. L. et al. Global Ants: a new database on the geography of ant traits (Hymenoptera: Formicidae). Insect Conserv. Divers. 10, 5–20 (2017).
Le Boulch, M., Déhais, P., Combes, S. & Pascal, G. The MACADAM database: a MetAboliC pAthways DAtabase for Microbial taxonomic groups for mining potential metabolic capacities of archaeal and bacterial taxonomic groups. Database 2019, baz049 (2019).
Anderson, S. A. S. & Weir, J. T. The role of divergent ecological adaptation during allopatric speciation in vertebrates. Science 378, 1214–1218 (2022).
Briscoe, N. J. et al. Mechanistic forecasts of species responses to climate change: The promise of biophysical ecology. Glob. Chang. Biol. 29, 1451–1470 (2022).
Crouch, N. M. A. & Tobias, J. A. The causes and ecological context of rapid morphological evolution in birds. Ecol. Lett. 25, 611–623 (2022).
Pilowsky, J. A., Colwell, R. K., Rahbek, C. & Fordham, D. A. Process-explicit models reveal the structure and dynamics of biodiversity patterns. Sci. Adv. 8, 2271 (2022).
Addo-Bediako, A., Chown, S. L. & Gaston, K. J. Thermal tolerance, climatic variability and latitude. Proc. Royal Soc. B 267, 739–745 (2000).
Boher, F., Trefault, N., Estay, S. A. & Bozinovic, F. Ectotherms in variable thermal landscapes: A physiological evaluation of the invasive potential of fruit fly species. Front. Physiol. 7, 624–6 (2016).
Bozinovic, F., Sabat, P., Rezende, E. L. & Canals, M. Temperature variability and thermal performance in ectotherms: Acclimation, behaviour, and experimental considerations. Evol. Ecol. Res. 17, 111–124 (2016).
Folguera, G. et al. An experimental test of the role of environmental temperature variability on ectotherm molecular, physiological and life-history traits: Implications for global warming. Comp. Biochem. Phys. A 159, 242–246 (2011).
Du, W. G. & Ji, X. The effects of incubation thermal environments on size, locomotor performance and early growth of hatchling soft-shelled turtles, Pelodiscus. sinensis. J. Therm. Biol. 28, 279–286 (2003).
Noble, D. W. A., Stenhouse, V. & Schwanz, L. E. Developmental temperatures and phenotypic plasticity in reptiles: a systematic review and meta‐analysis. Biol. Rev. 93, 72–97 (2018).
Feldman, A., Sabath, N., Pyron, R. A., Mayrose, I. & Meiri, S. Body sizes and diversification rates of lizards, snakes, amphisbaenians and the tuatara. Glob. Ecol. Biogeogr. 25, 187–197 (2016).
Noble, D. et al. A comprehensive database of thermal developmental plasticity in reptiles. Sci Data 5, 180138 (2018).
Slavenko, A. et al. Global patterns of body size evolution in squamate reptiles are not driven by climate. Glob. Ecol. Biogeogr. 28(4), 471–483 (2019).
Zimin, A. et al. A global analysis of viviparity in squamates highlights its prevalence in cold climates. Glob. Ecol. Biogeogr. 31, 2437–2452 (2022).
Nemesházi, E. & Bókony, V. HerpSexDet: the herpetological database of sex determination and sex reversal. Sci Data 10, 377 (2023).
Meiri, S. Traits of lizards of the world: Variation around a successful evolutionary design. Glob. Ecol. Biogeogr. 27(10), 1–5 (2018).
Feldman, A. & Meiri, S. Length–mass allometry in snakes. Biol. J. Linn. Soc. 108(1), 161–172 (2013).
Feldman, A. et al. The geography of snake reproductive mode: a global analysis of the evolution of snake viviparity. Glob. Ecol. Biogeogr. 24, 1433–1442 (2015).
Harrington, S. M. et al. Habits and characteristics of arboreal snakes worldwide: arboreality constrains body size but does not affect lineage diversification. Biol. J. Linn. Soc. 125(1), 61–71 (2018).
Stuginski, D. R. et al. Phylogenetic analysis of standard metabolic rate of snakes: a new proposal for the understanding of interspecific variation in feeding behavior. J Comp Physiol B. 188, 315–323 (2018).
NCBI Sequence Read Archive https://www.ncbi.nlm.nih.gov/ (2023).
SnakeDB http://snakedb.org/ (2012).
Fry, B.G. Snakes Venom LD50 – List of the Available Data and Sorted by Route of Injection http://www.venomdoc.com, (2012).
Uetz, P., Freed, P, Aguilar, R., Reyes, F. & Hošek, J. The Reptile Database http://www.reptile-database.org (2023).
Yu, G. Using ggtree to visualize data on tree-like structures. Curr. Protoc. Bioinformatics 69, e96 (2020).
R Core Team. R: A language and environment for statistical computing, version 4.3.2. https://www.R-project.org/ (2023).
Gumbs, R. et al. Global priorities for conservation of reptilian phylogenetic diversity in the face of human impacts. Nat. Commun. 11, 2616 (2020).
Thomson, R. C., Spinks, P. Q. & Shaffer, H. B. A global phylogeny of turtles reveals a burst of climate-associated diversification on continental margins. Proc. Natl. Acad. Sci. USA 118(7), e2012215118 (2021).
Kumar, S. et al. TimeTree 5: An Expanded Resource for Species Divergence Times. Mol Biol Evol. 39(8), msac174 (2022).
Oskyrko, O., Mi, C., Meiri, S. & Du, W. ReptTraits: a comprehensive dataset of ecological traits in reptiles. figshare https://doi.org/10.6084/m9.figshare.24572683 (2024).
Meiri, S. Endothermy, offspring size and evolution of parental provisioning in vertebrates. Biol. J. Linn. Soc. 128(4), 1052–1056 (2019).
Amano, T., González-Varo, J. P. & Sutherland, W. J. Languages are still a major barrier to global science. PLOS Biology 14, e2000933 (2016).
Acknowledgements
This research is funded by the National Natural Science Foundation of China (32300420, 32030013 and 32330067). O.O. was supported by the ANSO Scholarship for Young Talents (№ 2022ANP10120).
Author information
Authors and Affiliations
Contributions
O.O., S.M. collected the data and verified them; C.M. concept the idea; O.O., C.M. performed the analyses; C.M. W.D. supervised this study; O.O., C.M., S.M., W.D. wrote, reviewed, and approved the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Oskyrko, O., Mi, C., Meiri, S. et al. ReptTraits: a comprehensive dataset of ecological traits in reptiles. Sci Data 11, 243 (2024). https://doi.org/10.1038/s41597-024-03079-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-024-03079-5