Mining data from legacy taxonomic literature and application for sampling spiders of the Teutamus group (Araneae; Liocranidae) in Southeast Asia

Rivera-Quiroz, F. Andres; Petcharad, Booppa; Miller, Jeremy A.

doi:10.1038/s41598-020-72549-8

Download PDF

Article
Open access
Published: 25 September 2020

Mining data from legacy taxonomic literature and application for sampling spiders of the Teutamus group (Araneae; Liocranidae) in Southeast Asia

F. Andres Rivera-Quiroz^1,2,
Booppa Petcharad³ &
Jeremy A. Miller^1,4

Scientific Reports volume 10, Article number: 15787 (2020) Cite this article

2685 Accesses
4 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Taxonomic literature contains information about virtually ever known species on Earth. In many cases, all that is known about a taxon is contained in this kind of literature, particularly for the most diverse and understudied groups. Taxonomic publications in the aggregate have documented a vast amount of specimen data. Among other things, these data constitute evidence of the existence of a particular taxon within a spatial and temporal context. When knowledge about a particular taxonomic group is rudimentary, investigators motivated to contribute new knowledge can use legacy records to guide them in their search for new specimens in the field. However, these legacy data are in the form of unstructured text, making it difficult to extract and analyze without a human interpreter. Here, we used a combination of semi-automatic tools to extract and categorize specimen data from taxonomic literature of one family of ground spiders (Liocranidae). We tested the application of these data on fieldwork optimization, using the relative abundance of adult specimens reported in literature as a proxy to find the best times and places for collecting the species (Teutamus politus) and its relatives (Teutamus group, TG) within Southeast Asia. Based on these analyses we decided to collect in three provinces in Thailand during the months of June and August. With our approach, we were able to collect more specimens of T. politus (188 specimens, 95 adults) than all the previous records in literature combined (102 specimens). Our approach was also effective for sampling other representatives of the TG, yielding at least one representative of every TG genus previously reported for Thailand. In total, our samples contributed 231 specimens (134 adults) to the 351 specimens previously reported in the literature for this country. Our results exemplify one application of mined literature data that allows investigators to more efficiently allocate effort and resources for the study of neglected, endangered, or interesting taxa and geographic areas. Furthermore, the integrative workflow demonstrated here shares specimen data with global online resources like Plazi and GBIF, meaning that others can freely reuse these data and contribute to them in the future. The contributions of the present study represent an increase of more than 35% on the taxonomic coverage of the TG in GBIF based on the number of species. Also, our extracted data represents 72% of the occurrences now available through GBIF for the TG and more than 85% of occurrences of T. politus. Taxonomic literature is a key source of undigitized biodiversity data for taxonomic groups that are underrepresented in the current biodiversity data sphere. Mobilizing these data is key to understanding and protecting some of the less well-known domains of biodiversity.

Phylogenomics and the rise of the angiosperms

Article Open access 24 April 2024

Isotopic evidence of high reliance on plant food among Later Stone Age hunter-gatherers at Taforalt, Morocco

Article Open access 29 April 2024

Network of large pedigrees reveals social practices of Avar communities

Article Open access 24 April 2024

Introduction

In the aggregate, traditional taxonomic publications can be thought of as a repository that has accumulated vast amounts of biological data linked to specific taxonomic names. These units of taxonomic knowledge, information linked to a name within a publication, are known as taxonomic treatments^1,2,3. This makes taxonomic literature not only crucial for the exchange and growth of biodiversity knowledge, but also capable of being used to detect and understand larger biodiversity patterns with historical perspective.

In recent years, great efforts have gone into the digitization of legacy taxonomic literature^4,5,6. This combined with digital publications have greatly improved access to taxonomic literature. Nevertheless, although easy to share, PDF publications still have most biodiversity data embedded in strings of text making them less dynamic and difficult or impossible to read and analyze without a human interpreter⁷. This difficulty to access and use core specimen data is what we define as PDF prison⁸. Recently developed tools allow text in PDF documents to be interpreted and categorized in XML format (mark-up) allowing information to be mobilized, aggregated and reanalyzed^9,10,11,12. Plazi Treatment Bank^8,13,14, is a project dedicated to creating a comprehensive compendium of taxonomic and biological data extracted from primary literature¹⁵. This platform permits mined treatment data to be accessed, queried, compared, and reused in a customized way. The strategy for data extraction can be prospective: where journals generate new data in XML format that can be uploaded directly to repositories (as has been implemented by Zookeys² and EJT^8,13). or retrospective: where data is mined from legacy taxonomic literature^3,11,12,13 through a process called semantic enhancement^9,13. This retrospective approach is more complicated and time consuming since the semi-automatic process of text recognition and tagging needs to be checked by a human operator^3,15. However, it can provide useful information by extracting, integrating and using biodiversity data contained in the hundreds of years of accumulated taxonomic literature. Data integration is achieved by linking records from Plazi treatment bank to the Global Biodiversity Information Facility (GBIF)^8,16 where they are aggregated with other type of records, mainly natural history institution specimen collections and observation data based on GBIF’s taxonomic backbone¹⁷.

Here we combined several of these cybertaxonomic tools to test the data extraction process and its potential application on the design and planning of an expedition to collect fresh material in the field. We targeted the ground spider Teutamus politus Thorell 1,890 and its relatives from the so called Teutamus group (TG) (Araneae, Liocranidae)¹⁸. This group of spiders is mostly distributed in Southeast Asia^{19,20,21,22,23} and is composed of seven genera: Jacaena, Koppe, Oedignatha, Sesieutes, Sphingius, Sudharmia and Teutamus¹⁸. These spiders have been cataloged in the family Liocranidae; however, their phylogenetic relationships, biology and evolution are still poorly understood^18,24. Therefore, collection of fresh specimens of the target taxa was necessary for building a molecular phylogeny of the TG. The species T. politus, besides being the type species of the genus Teutamus, is an example of the extremely rare phenomenon of directional genital asymmetry²⁵. For this reason, the collection of live adult specimens was crucial to study, document, and test the behavioral implications of their abnormal genital morphology.

Our study aimed to highlight the importance of making biodiversity data contained within taxonomic treatments accessible and reusable in accordance with the FAIR data principles²⁶. This approach can help bridge gaps and focus efforts in the study of particularly interesting taxa or geographic regions. The usability of taxonomic literature data, potential applications, and its limitations and biases are discussed.

Results

Literature data analysis

Data extracted from 55 analyzed publications represent in total 23 genera and ca. 160 species of the family Liocranidae with ca. 3,000 specimens collected worldwide (Fig. 1a). A visual summary of the data extraction process and data display in Plazi’s Treatment Bank and GBIF can be found in Supplementary Figure 1. These include treatments of all currently valid genera and 90 species of the TG based on 1,309 specimens; out of 137 currently valid species²⁷. The TG was mostly distributed in East and Southeast Asia (Fig. 1b) with the exception of two species of the genus Oedignatha found in the Seychelles. Within SEA, six genera of the TG have a broad distribution being reported from India and the southern region of mainland Asia to the Malay Archipelago (Fig. 1c–e,g–h). Two exceptions are Jacaena that has not been reported south of Thailand (Fig. 1f) and Sudaharmia that has only been reported within Indonesia (Fig. 1i). Indonesia (Six genera, 386 specimens), Thailand (Five, 351) and Malaysia (Four, 212) were the countries with a highest richness and abundance of TG spiders accounting for 72.5% of all the TG records (Fig. 2a). Thailand was the country that combined most occurrences of the TG genera and T. politus having 66% of all the known specimens of this species reported in literature. Within Thailand, the best sampled province is Chiang Mai accounting for 35% of all the TG specimen records for the country. Other relatively well known provinces were Krabi, Nakhon Ratchasima and Phuket, adding up to 30% of the country records (Fig. 2a). Chiang Mai had reports of four TG genera and 11 species, Krabi and Phuket had relatively less representation of the TG; however, these two provinces had 66 of the 68 specimens of T. politus recorded for the country.

The majority of species treatments that we semantically enhanced contained collecting dates that allowed us to plot temporal distribution of the group within Thailand. Most specimens were collected between 1980 and 2009. These dates together with collecting locations allowed us to plot the known temporal and geographic distribution of our target taxon (Fig. 2b). For instance, most collecting is concentrated between May and December, with February and March being the least represented months. Similarly, Indonesia, Malaysia and Thailand are the best sampled countries in Southeast Asia. From an historical perspective, Indonesia was clearly the most sampled area during the 80 s and Malaysia during the 90 s, with more heterogeneous and international records appearing during the 2000s.

Total monthly abundances suggest that adults of the TG are mostly found in between June and July, and October to January (Fig. 3a). A more detailed visualization at genus level shows that most TG genera have similar seasonal variations, with the exception of Teutamus that is most common between June and July (Fig. 3a). The species T. politus has adults reported mostly between June and July, and some specimens from September to December but none have been recorded between January and May (Fig. 3b).

Fieldwork

Our sampling produced 134 adult liocranid specimens from the following genera: Jacaena (3), Oedignatha (32), Sesieutes (3), Sphingius (1), Teutamus (95) (Table 1). Some juvenile specimens of Oedignatha and Teutamus could be matched to adults in the same sample and assigned to the same species adding up to a total of 229 identified specimens of the Liocranidae. We found four species of the TG in Chiang Mai: Jacaena lunulata, Oedignatha barbata, O. jocquei, and Sphingius cf. vivax; three species in Phuket: O. spadix, Sesieutes cf. minuatus, and Teutamus politus; and two species in Krabi: O. sp. and T. politus. Most of them were represented by males and females with the exception of J. lunulata and S. cf. vivax, where only males were found. These two, along with O. barbata and O. sp., were the rarest species having three or fewer individuals in our sample. The most abundant species were O. spadix and T. politus with 21 and 95 adults respectively.

Table 1 Records of Teutaumus group (TG) species from three Thai provinces.

Full size table

Discussion

Literature data analysis

Detecting and understanding biodiversity patterns require large amounts of high quality data. In recent years global databased like GBIF and Plazi have set standards for collection, curation and dissemination of these biological data. GBIF, the largest biodiversity data repository, has aggregated digitized specimen records from many of the world’s most important biodiversity collections institutions. In addition, records from observation networks such as iNaturalist are aggregated on GBIF. However, legacy taxonomic literature as a source of biodiversity data has remained relatively unexplored until recent years. Taxonomic literature holds a vast amount of high-quality biodiversity data^12,49,50. Like data from institutional collections and unlike data from observations networks, these data typically point to specimen objects archived in a natural history institution. Such records have the potential to be re-evaluated in a way that records from observation networks cannot be. It is worth noting that many specimens cited in the taxonomic literature, although archived in a natural history collection, are not necessarily among the institutional collections data shared with GBIF.

Data extraction from taxonomic literature can proceed along two major pathways: (1) prospective, where data is mobilized and shared with GBIF as part of the routine publication process, as has been implemented some journals like EJT¹³ and ZooKeys^2,8 and some revisionary studies⁵¹; and (2) retrospective, where data is mined from legacy taxonomic data^11,12. This retrospective approach was tested in our study by semantically enhancing records from more than 50 legacy taxonomic documents. From these sources, ca. 3,000 specimens of the family Liocranidae were structured and mobilized, including more than 1,300 records from about 100 treatments of TG taxa (Supplementary Table 1). These data included relevant biodiversity information, such as geographical distribution, date of collection, sex, and number of specimens.

Although the data contained in taxonomical treatments has been curated by specialists and is highly dependable, it is not free from error and methodological bias. Meyer, Weigelt, and Kreft⁵², in their study of land plant data available on GBIF, documented data biases in two major groups: coverage (geographical and temporal documentation gaps) and uncertainty (accuracy or credibility). Another bias observed in GBIF, as well as biodiversity studies and funding in general, is related to the taxonomic coverage and over representation of some groups like birds and plants and under representation of megadiverse groups like insects and arachnids^53,54,55,56 (Supplementary Table 2; see also Data Aggregation, below).

In our analysis we did not find clear cases of uncertainty bias with the exception of the absence of geographical coordinates that made some of the occurrences spatially ambiguous. However, geographical and temporal coverage bias was observed. Scientists do not sample randomly or evenly from the whole world; therefore, it should be expected that some areas and times are studied more than others. This makes it difficult to distinguish seasonal changes in abundance from uneven sampling effort at different times of the year. Nevertheless, existing records at least indicate the time of year when specimens have been found in the past, and might therefore be found again. Overall, records of TG taxa were not evenly spread throughout the year. For example, zero specimens of T. politus are recorded for the month of August, suggesting that this might not be best time of year to search for this species in Thailand (Figs. 2, 3). Although we had planned our sampling during the highest abundance peak (June–July; Fig. 3b), logistic constrains forced us to carry our sampling one month later. Nevertheless, we found a total of 188 specimens of this species during our collection, of which 95 were adults. Our results give evidence of the presence of these taxa during this time of the year, suggesting that the variation observed in legacy records is most probably due to temporal coverage bias and must be interpreted with care.

Another temporal coverage bias was observed when assessing specimen contributions per collector (Fig. 2b). We found P.J. Schwendinger to be the collector with most specimens contributed to the TG^{19,20,21,22,23}; between 1983 and 2009 he collected 231 TG specimens in Thailand. However, most of his specimens, presumably, due to logistics, were reported around June and July, and December. Therefore, temporal distribution patterns, as observed in literature-extracted data (Figs. 2 and 3), could be an artifact of sampling bias and not necessarily reflect real seasonal variation of the taxa.

Even taking into account these methodological biases, we consider specimen records in taxonomic literature to be among the best curated evidence of presence and, to some extent, relative abundances; and for many understudied and megadiverse taxa, this is the only source of specimen records available. Identifying and understanding data biases can help to identify temporal and spatial gaps were further sampling effort is needed.

Fieldwork

Data extracted from taxonomic literature on the family Liocranidae were used to create detailed profiles for the TG. These helped us to plan a collection that specifically targeted the re-collection of these taxa. Our analysis showed that within Southeast Asia, three provinces in Thailand, Chiang Mai, Phuket and Krabi were the best choice for targeting T. politus and its relatives.

This selection of times and places, in combination with specific methods for collecting ground spiders showed a high efficiency for sampling the TG. Our one-month expedition captured 134 adult spiders of the TG (Table 1) representing all TG genera previously reported for Thailand and six out of seven liocranid genera reported for this country (only missing Paratus Simon, 1898). In total, 351 adults of the TG had been reported from Thailand^{19,20,21,22,23,41}; from these, ca. 200 had been reported in the same provinces we sampled (Chiang Mai, Krabi and Phuket) (Table 1). When comparing only the collections reported for the same months where we sample, we can observe that our approach was much more efficient, collecting 134 adults vs. 48 in literature. We collected a total of nine TG species vs. 14 reported from the same provinces and six reported from the same provinces and times. From these, Teutamus politus was the most abundant species in both literature and our study with 66 and 95 adults respectively (Fig. 3b).We collect more specimens of this species (188) than all the previous records in literature combined (102 specimens)^19,21. Oedignatha spadix was the second most abundant in our study with 21 adult specimens; Oedignatha spadix is previously known only from Indonesia¹⁹.

Data aggregation

The interoperable network of Plazi allows the extracted data to be automatically shared with other biodiversity databases like GBIF. This allows taxonomic literature data to be analyzed together with data from Natural History collections and observation networks. Many studies have explored the limits and capabilities of GBIF data for setting conservation priorities^57,58,59,60, modeling^57,61,62, aggregation of different kinds of data and its biases^{52,56,59,60,63,64}, among others. The major GBIF data domains (institutional collections databases, observation networks, taxonomic literature, and, in some cases, DNA sequence databases), each have their particular biases, but taken together are complementary enough to serve as a basis for building more complete biodiversity knowledge. In the case of the Teutamus group, virtually all records in GBIF were originated from digitized collection data with only five records contributed through human observation and one through iBOL⁶⁵. Even in groups where other sources of data are not available, digitized collection data can give important insights on aspects like the group taxonomy and distributions. Two studies in the Amazonia highlight the importance of collection-based data, by aggregating museum specimen data of several unrelated taxa collected in Amazonia comparing their richness, distribution and endemism^66,67. This approach allowed them to identify undersampling bias taxonomically and spatially, and map priority areas for conservation based on biodiversity data. They also observed that even when individual datasets might be imperfect, the aggregation of different approaches and sources can help to better assess and allocate conservation efforts.

In our study, the addition of records from the taxonomic literature, aggregated with complementary data from other sources available on GBIF, improved the taxonomic, geographic, and seasonal coverage of TG taxa (Table 2), giving us an improved picture of their overall biodiversity pattern. Semantic enhancement of taxonomic literature cannot compete in volume against the millions of records sourced from natural history collections databases and especially observation networks. But records from taxonomic literature may be the only source of data available for the vast portion of biodiversity about which we know very little. In other words, observation network records tend to be copious but dominated by few species, while specimen records from natural history collections and especially taxonomic literature tend to be fewer in number, but are often the only source of data on rare species. The Plazi approach gives free and persistent access to high quality data curated by taxonomic experts that might potentially help to identify and close knowledge gaps for some underrepresented groups.

Table 2 Teutamus group in GBIF per collection/database comparing number of occurrences, total of specimens, geographical distribution and taxonomic coverage.

Full size table

Observation networks are some of the largest contributors to GBIF in terms of total records, but these tend to be quite limited in taxonomic focus and rarely include any but the most conspicuous and recognizable representatives of small bodied, high diversity groups like spiders. Here we emphasize the usefulness of the Plazi retrospective approach to close those gaps. Comparing a list of the currently valid species of the TG from the world spider catalog²⁷, the Plazi approach contributed with records on 89 out of 137 species. By contrast, only 41 species of the TG were present in GBIF before our study. Our contributions to the knowledge of these spiders can be also observed in the number of occurrences in GBIF. Literature extracted data on the TG currently represents 470 occurrences in GBIF versus the 180 occurrences that were available from collection-based data, observation and iBOL combined. Our marked-up documents account for 72% of the occurrences of the TG and the genus Teutamus, and 85% of records of our target species, Teutamus politus (Fig. 4). This gives evidence of the complementarity of these data sources and the importance of mobilizing and making publicly available all the specimen data contained in taxonomic literature.

It is worth noting that this complementarity can also mean that some records from literature and digitized collection data could be overlapping. However, ruling out these cases demands unambiguous collection numbers or specimen identifiers; or, in case this number is absent, comparing probable matches by collection date, locality, specimen count, and other data. For the Teutamus group, some records available in GBIF do have a unique collection number (e.g. Teutamus politus RMNH.ARA.15194). However, these identifiers are not always available (either in GBIF, on literature or on both) making difficult to reconcile data from different sources. Therefore setting unique identifiers and strengthening publication standards must be a top priority for the future^{12,69,70,71,72}. This will help to generate usable and reliable datasets that can help to observe, study, and ultimately preserve biodiversity.

Structured, digitized specimen data extracted from taxonomic literature remains a small portion of the overall biodiversity data sphere, but it complements more mainstream data sources in important ways and has the potential to grow into a major source of data in its own right. Our study shows the importance of taxonomic literature records that, in combination with data from other sources, contributes to the most complete available assessment of spatial and temporal biodiversity pattern. Using this data for field work planning is but one possible application, but conservation risk assessment and species distribution modeling could be important in this context as well. The Plazi approach makes these data permanently available for others to re-use and add to in ways that we may or may not be able to currently imagine. Despite decades of ambitious and largely successful digitization efforts, much of the knowledge that biologists have accumulated about global biodiversity remains undigitized and unstructured, unqueryable, and difficult to access. The challenges presented by the global biodiversity crisis are daunting, and our best hope for addressing it begins with building a data infrastructure that faithfully represents the knowledge that generations of scientists have accumulated; specimen records from taxonomic literature are a key element in such an infrastructure.

Methods

Literature data extraction

We accessed all taxonomic literature of the family Liocranidae available in the World Spider Catalog²⁷. We selected 55 publications that contained taxonomic treatments of the family Liocranidae^{19,20,21,22,23,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107} (for full list, see Supplementary Table 1). We selected and processed all publications that provided taxonomic treatments with specimen data and usable geographical references. Publications written in a language other than English were not processed since OCR parsing, as implemented by the programs used here, has mostly been developed in this language. From the marked-up documents, 21 contained information on members of the TG and two on the species T. politus. We used the program GoldenGATE Imagine V.3 (GGI; https://plazi.org/resources/treatmentbank/goldengate-editor/) to semantically enhance PDF documents, allowing atomization and categorization of data. In some cases, ABBYY FineReader V. 11 was used first to extract and correct text from the PDF document using optical character recognition (ORC) and text editing functions. Once the PDF documents were marked and revised, we used GoldenGATE to upload the files to Plazi’s TreatmentBank¹⁴.

Data analysis

We used Plazi Treatment Collection Statistics tool (https://tb.plazi.org/GgServer/srsStats) to download all the information relevant to our study in an excel spreadsheet to facilitate fine-grained management and analysis, largely following the approach described by Miller et al. (2015). We used these specimen based data to create profiles of the TG species allowing us to visualize where and when these taxa had been collected. Also, we used the GBIF occurrence search tool (https://www.gbif.org/occurrence/search) to look for records on our relevant TG taxa. The specific datasets we used can be found in the Data Accessibility section.

Site selection

Literature data were used to design our field collection in a way that allowed us to optimize the collection of adult specimens of our target taxa in Southeast Asia (SEA). We explored the number of specimens of the TG reported per country, province and location whenever possible. We favored those locations with a higher representation of genera from the TG but also those where T. politus had been reported. Finally, we analyzed the total number of adult specimens collected per month for both the TG species and T. politus in order to increase the chances of finding adult spiders. Based on this, we decided to sample in three provinces in Thailand between July 16 and August 12, 2018.

Sampling

Following the results of our literature analysis, we prioritized collections in national parks and protected areas. Precise geographical coordinates and specific habitat information was scarce or missing altogether in most taxonomic treatments. Therefore, we further divided each site in four different vegetation types (collecting sites details in the Supplementary Table 2) allowing us to cover a wide range of available habitats. We combined pitfall traps, Winkler extractors (for soil arthropods; www.entowinkler.at), and direct collecting targeting ground spiders. A mixture of propylene-glycol and ethanol was used in the pitfalls to avoid excessive evaporation and help with DNA preservation¹⁰⁸; all specimens were collected and stored in 96% ethanol. All liocranid spiders were identified to species level. Juvenile spiders were assigned to a species only when they were at a pre-adult or late juvenile instar and adults were present in the same sample minimizing ambiguities.

Data availability

Extracted data is available from Plazi¹⁴ tb.plazi.org/GgServer/srsStats (refining search as needed) and GBIF^65,109,110. A list of all the Plazi document UUID used in this study can be found in the Supplementary Table 1.

References

Catapano, T. NoTaxPub: An Extension of the NLM/NCBI Journal Publishing DTD for Taxonomic Descriptions. in Journal Article Tag Suite Conference (JATS-Con) Proceedings 2010 (National Center for Biotechnology Information, 2010). https://doi.org/10.5281/zenodo.3484285
Penev, L. et al. Semantic tagging of and semantic enhancements to systematics papers: Zookeys working examples. Zookeys 50, 1–16 (2010).
Google Scholar
Penev, L. et al. XML schemas and mark-up practices of taxonomic literature. Zookeys 150, 89–116 (2011).
Article Google Scholar
Dikow, T. & Agosti, D. Utilizing online resources for taxonomy: a cybercatalog of Afrotropical apiocerid flies (Insecta: Diptera: Apioceridae). Biodivers. Data J. 3, e5707 (2006).
Article Google Scholar
Creech, J. Biodiversity heritage library. Coll. Res. Libr. News 73, 626–627 (2012).
Article Google Scholar
Gwinn, N. E. & Rinaldo, C. The Biodiversity Heritage Library: Sharing biodiversity literature with the world. IFLA J. 35, 25–34 (2009).
Article Google Scholar
Page, R. D. M. Enhanced display of scientific articles using extended metadata. J. Web Semant. 8, 190–195 (2010).
Article Google Scholar
Agosti, D., Catapano, T., Sautter, G. & Egloff, W. The Plazi Workflow: The PDF prison break for biodiversity data. Biodivers. Inf. Sci. Stand. https://doi.org/10.3897/biss.3.37046 (2019).
Article Google Scholar
Cui, H. Converting taxonomic descriptions to new digital formats. Biodivers. Inform. 5, 20–40 (2008).
Article Google Scholar
Thessen, A. E. & Patterson, D. Data issues in the life sciences. Zookeys 150, 15–51 (2011).
Article Google Scholar
Miller, J. A. et al. From taxonomic literature to cybertaxonomic content. BMC Biol. 10, 87 (2012).
Article PubMed PubMed Central Google Scholar
Miller, J. A. et al. Integrating and visualizing primary data from prospective and legacy taxonomic literature. Biodivers. data J. 3, e5063 (2015).
Article Google Scholar
Chester, C. et al. EJT editorial standard for the semantic enhancement of specimen data in taxonomy literature. Eur. J. Taxon. https://doi.org/10.5852/ejt.2019.586 (2019).
Article Google Scholar
Plazi. PLAZI Home Page. Available from https://plazi.org/.org [20th June 2020]. (2020).
Agosti, D. & Egloff, W. Taxonomic information exchange and copyright: the Plazi approach. BMC Res. Notes 2, 53 (2009).
Article PubMed PubMed Central Google Scholar
GBIF. Global Biodiversity Informaton Facility Home Page. Available from: https://www.gbif.org [4th April 2019]. (2019).
GBIF Secretariat. GBIF Backbone Taxonomy. Checklist dataset https://doi.org/10.15468/39omei accessed via GBIF. (2019).
Ramírez, M. J. The morphology and phylogeny of dionychan spiders (Araneae, Araneomorphae). Bull. Am. Museum Nat. Hist. 390, 1–374 (2014).
Article Google Scholar
Deeleman-Reinhold, C. Forest spiders of South East Asia: with a revision of the sac and ground spiders (Araneae: Clubionidae, Corinnidae, Liocranidae, Gnaphosidae, Prodidomidae and Trochanterriidae) (Leiden, Brill, 2001).
Google Scholar
Dankittipakul, P., Tavano, M. & Singtripop, T. Neotype designation for Sphingius thecatus Thorell 1890 synonymies new records and descriptions of six new species from Southeast Asia(Araneae Liocranidae). Zootaxa 2, 1–20 (2011).
Article Google Scholar
Dankittipakul, P., Tavano, M. & Singtripop, T. Seventeen new species of the spider genus Teutamus Thorell, 1890 from Southeast Asia (Araneae: Liocranidae). J. Nat. Hist. 46, 1689–1730 (2012).
Article Google Scholar
Dankittipakul, P., Tavano, M. & Singtripop, T. Revision of the spider genus Jacaena Thorell, 1897, with descriptions of four new species from Thailand (Araneae: Corinnidae). J. Nat. Hist. 47, 1539–1567 (2013).
Article Google Scholar
Dankittipakul, P. & Deeleman-Reinhold, C. Delimitation of the spider genus Sesieutes Simon, 1897, with descriptions of five new species from South East Asia (Araneae: Corinnidae). J. Nat. Hist. 47, 167–195 (2013).
Article Google Scholar
Wheeler, W. C. et al. The spider tree of life: phylogeny of Araneae based on target-gene analyses from an extensive taxon sampling. Cladistics 33, 574–616 (2017).
Article PubMed Google Scholar
Rivera-Quiroz, F. A., Schilthuizen, M., Petcharad, B. & Miller, J. A. Imperfect and askew: A review of asymmetric genitalia in araneomorph spiders (Araneae: Araneomorphae). PLoS ONE https://doi.org/10.1371/journal.pone.0220354 (2020).
Article PubMed PubMed Central Google Scholar
Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3, 2 (2016).
Article Google Scholar
WSC. World Spider Catalog Version 21.0. Natural History Museum Bern, online at https://wsc.nmbe.ch, accessed on {17-June-2020}. (2020). https://doi.org/10.24436/2
Rstudio, T. RStudio: Integrated Development for R. Rstudio Team, PBC, Boston, MA URL https://www.rstudio.com/ (2020). https://doi.org/10.1145/3132847.3132886
Becker, R. A., Wilks, A. R., Brownrigg, R., Minka, T. P. & Deckmyn, A. CRAN—Package maps. CRAN R-Project (2017).
Wickham, H. ggplot2 elegant graphics for data analysis (Use R!). Springer https://doi.org/10.1007/978-0-387-98141-3 (2016).
Article MATH Google Scholar
Biswas, V. & Roy, R. Description of six new species of spiders of the genera Lathys (Family: Dictynidae), Marpissa (Family: Salticidae), Misumenoides (Family: Thomisidae), Agroeca (Family: Clubionidae), Gnaphosa (Family: Gnaphosidae) and Flanona (Family: Lycosidae) - F. Rec. Zool. Surv. India 108, 43–57 (2008).
Google Scholar
Biswas, B. & Biswas, K. Araneae: Spiders. in State Fauna series 3: Fauna of West Bengal 3 357–500 (1992).
Biswas, B. & Majumder, S. C. Araneae: Spider. in Fauna of Meghalaya, State Fauna Series. Zoological Survey of India Kolkata 93–128 (1995).
Chen, S. H. & Huang, W. J. A newly recorded spider Oedignatha platnicki Song et Zhu 1998 from Taiwan, with description of the female (Araneae, Corinnidae). BioFormosa 44, 31–36 (2009).
Google Scholar
Dankittipakul, P. & Deeleman-Reinhold, C. A new spider species of the genus Sudharmia from Sumatra, Indonesia (Araneae, Liocranidae). Dongwuxue Yanjiu 33, 187–190 (2012).
PubMed Google Scholar
Jäger, P. Spiders from Laos with descriptions of new species (Arachnida: Araneae). Acta Arachnol. 56, 29–58 (2007).
Article Google Scholar
Ono, H. Three new spiders of the family Clubionidae, Liocranidae and Gnaphosidae (Arachnida, Araneae) from Vietnam. Bull. Natl. Museum Nat. Sci. Tokyo 35, 1–8 (2009).
Google Scholar
Reddy, T. S. & Patel, B. H. Two new species of the genus Oedignatha Thorell (Araneae: Clubionidae) from Coastal Andhra Pradesh. India. Entomon 18, 47–51 (1993).
Google Scholar
Saaristo, M. I. New species and interesting new records of spiders from Seychelles (Arachnida, Araneaea). Phelsuma 10, 1–32 (2002).
Google Scholar
Tso, I., Zhu, M. S., Zhang, J. & Zhang, F. Two new and one newly recorded species of Corinnidae and Liocranidae from Taiwan (Arachnida: Araneae). Acta Arachnol. 54, 45–49 (2005).
Article Google Scholar
Zhang, F. & Fu, J. Y. First Report of the Genus Sesieutes Simon (Araneae: Liocranidae) from China, with Description of One New Species. Entomol. News 121, 69–74 (2010).
Article Google Scholar
Zhang, F., Fu, J. Y. & Zhu, M. S. Spiders of the genus Sphingius (Araneae: Liocranidae) from China, with description of two new species. Zootaxa 2, 31–44 (2009).
Article Google Scholar
Zhao, Y. & Peng, X. J. Three new species of spiders of the family Liocranidae (Arachnida: Araneae) from China. Orient. Insects 47, 176–183 (2013).
Article Google Scholar
Barrion, A. T. & Litsinger, J. A. Family Clubionidae Wagner- Genera Alaeho, Castianeira, Agroeca, Phrurolithus & Scotinella. in Riceland spiders of South and Southeast Asia 170–180 (1995). https://doi.org/10.5281/zenodo.897849
Bastawade, D. B. Replacement name for Amaurobius indicus Bastawade and its transfer to family Corinnidae (Arachnida: Araneae). Zoo’s Print J. 21, 2307 (2006).
Article Google Scholar
Bastawade, D. B. Three new species from the spider families Amaurobiidae, Thomisidae and Salticidae (Araneae: Arachnida) from India. J. Bombay Nat. Hist. Soc. 99, 274–281 (2002).
Google Scholar
Bennett, R., Copley, C. & Copley, D. Apostenus ducati (Araneae: Liocranidae) sp. nov.: A second Nearctic species in the genus. Zootaxa 3647, 63–74 (2013).
Article PubMed Google Scholar
Biswas, V. & Raychaudhuri, D. Sac spiders of Bangladesh-II: Genera Castianeira Keyserling, Sphingius Thorell and Trachelas Koch (Araneae: Clubionidae). Rec. Zool. Surv. India 98, 131–139 (2000).
Google Scholar
Meier, R. & Dikow, T. Significance of specimen databases from taxonomic revisions for estimating and mapping the global species diversity of invertebrates and repatriating reliable specimen data. Conserv. Biol. 18, 478–488 (2004).
Article Google Scholar
Dikow, T., Meier, R., Vaidya, G. G. & Londt, J. G. H. Biodiversity research based on taxonomic revisions—A Tale of Unrealized Opportunities. in Diptera Diversity: Status, Challenges, and Tools 323–345 (2009).
Markee, A. & Dikow, T. Taxonomic revision of the assassin-fly genus Microphontes Londt, 1994 (Insecta, diptera, asilidae). African Invertebr. https://doi.org/10.3897/afrinvertebr.59.30684 (2018).
Article Google Scholar
Meyer, C., Weigelt, P. & Kreft, H. Multidimensional biases, gaps and uncertainties in global plant occurrence information. Ecol. Lett. 19, 992–1006 (2016).
Article PubMed Google Scholar
Leather, S. R. Taxonomic chauvinism threatens the future of entomology. Biologist (2009).
Cardoso, P., Erwin, T. L., Borges, P. A. V. & New, T. R. The seven impediments in invertebrate conservation and how to overcome them. Biol. Conserv. https://doi.org/10.1016/j.biocon.2011.07.024 (2011).
Article Google Scholar
Titley, M. A., Snaddon, J. L. & Turner, E. C. Scientific research on animal biodiversity is systematically biased towards vertebrates and temperate regions. PLoS ONE https://doi.org/10.1371/journal.pone.0189577 (2017).
Article PubMed PubMed Central Google Scholar
Troudet, J., Grandcolas, P., Blin, A., Vignes-Lebbe, R. & Legendre, F. Taxonomic bias in biodiversity data and societal preferences. Sci. Rep. https://doi.org/10.1038/s41598-017-09084-6 (2017).
Article PubMed PubMed Central Google Scholar
Bartomeus, I., Stavert, J. R., Ward, D. & Aguado, O. Historical collections as a tool for assessing the global pollination crisis. Philos. Trans. R. Soc. B. Biol. Sci. https://doi.org/10.1098/rstb.2017.0389 (2019).
Article Google Scholar
Shirey, V., Seppälä, S., Branco, V. V. & Cardoso, P. Current GBIF occurrence data demonstrates both promise and limitations for potential red listing of spiders. Biodivers. Data J. https://doi.org/10.3897/BDJ.7.E47369 (2019).
Article PubMed PubMed Central Google Scholar
Bayraktarov, E. et al. Do big unstructured biodiversity data mean more knowledge?. Front. Ecol. Evol. https://doi.org/10.3389/fevo.2018.00239 (2019).
Article Google Scholar
Iannella, M., D’Alessandro, P. & Biondi, M. Entomological knowledge in Madagascar by GBIF datasets: Estimates on the coverage and possible biases (Insecta). Fragm. Entomol. https://doi.org/10.4081/fe.2019.329 (2019).
Article Google Scholar
Beck, J., Böller, M., Erhardt, A. & Schwanghart, W. Spatial bias in the GBIF database and its effect on modeling species’ geographic distributions. Ecol. Inform. https://doi.org/10.1016/j.ecoinf.2013.11.002 (2014).
Article Google Scholar
Smith, J. A., Benson, A. L., Chen, Y., Yamada, S. A. & Mims, M. C. The power, potential, and pitfalls of open access biodiversity data in range size assessments: Lessons from the fishes. Ecol. Indic. https://doi.org/10.1016/j.ecolind.2019.105896 (2020).
Article Google Scholar
Meyer, C., Jetz, W., Guralnick, R. P., Fritz, S. A. & Kreft, H. Range geometry and socio-economics dominate species-level biases in occurrence information. Glob. Ecol. Biogeogr. https://doi.org/10.1111/geb.12483 (2016).
Article Google Scholar
Hochmair, H. H., Scheffrahn, R. H., Basille, M. & Boone, M. Evaluating the data quality of iNaturalist termite records. PLoS ONE https://doi.org/10.1371/journal.pone.0226534 (2020).
Article PubMed PubMed Central Google Scholar
GBIF.org (11 August 2020). GBIF Occurrence Download (Liocranidae) https://doi.org/10.15468/dl.jfy7sp.
Kress, W. J. et al. Amazonian biodiversity: Assessing conservation priorities with taxonomic data. Biodivers. Conserv. 7, 1577–1587 (1998).
Article Google Scholar
Heyer, W. R. et al. Amazonian biotic data and conservation decisions. J. Braz. Assoc. Adv. Sci. 51, 372–385 (1999).
Google Scholar
Gu, Z., Gu, L., Eils, R., Schlesner, M. & Brors, B. Circlize implements and enhances circular visualization in R. Bioinformatics https://doi.org/10.1093/bioinformatics/btu393 (2014).
Article PubMed PubMed Central Google Scholar
Page, R. D. M. Biodiversity informatics: The challenge of linking data and the role of shared identifiers. Brief. Bioinform. https://doi.org/10.1093/bib/bbn022 (2008).
Article PubMed Google Scholar
Page, R. D. M. BioGUID: Resolving, discovering, and minting identifiers for biodiversity informatics. BMC Bioinform. https://doi.org/10.1186/1471-2105-10-S14-S5 (2009).
Article Google Scholar
Guralnick, R. P. et al. Community next steps for making globally unique identifiers work for biocollections data. Zookeys https://doi.org/10.3897/zookeys.494.9352 (2015).
Article PubMed PubMed Central Google Scholar
Nelson, G., Sweeney, P. & Gilbert, E. Use of globally unique identifiers (GUIDs) to link herbarium specimen records to physical specimens. Appl. Plant Sci. https://doi.org/10.1002/aps3.1027 (2018).
Article PubMed PubMed Central Google Scholar
Bosmans, R. The genera Agroeca, Agraecina, Apostenus and Scotina in the Maghreb countries (Araneae: Liocranidae). Bull Inst. R. Sci. Nat. Belgique 69, 25–34 (1999).
Google Scholar
Bosmans, R. On some new or rare spider species from Lesbos, Greece (Araneae: Agelenidae, Amaurobiidae, Corinnidae, Gnaphosidae, Liocranidae). Arachnol. Mitteilungen 2, 15–22. https://doi.org/10.5431/aramit4003 (2011).
Article Google Scholar
Bosmans, R. & van Keer, J. On the spider species described by L. Koch in 1882 from the Balearic Islands (Araneae). Arachnol. Mitteilungen 43, 5–16 (2012).
Article Google Scholar
Bosselaers, J. Studies in Liocranidae (Araneae): Redescriptions and transfers in Apostenus Westring and Brachyanillus Simon, as well as description of a new genus. Zootaxa 2, 37–55 (2009).
Article Google Scholar
Bosselaers, J. Two interesting new ground spiders (Araneae) from the Canary Islands and Greece. Serket 13, 83–90 (2012).
Google Scholar
Bosselaers, J. et al. High-resolution X-ray computed tomography of an extant new Donuea (Araneae: Liocranidae) species in Madagascan copal. Zootaxa 2, 25–35 (2010).
Article Google Scholar
Bosselaers, J. & Jocqué, R. Studies in Liocranidae (Araneae): A new afrotropical genus featuring a synapomorphy for the Cybaeodinae. Eur. J. Taxon. 40, 1–49 (2013).
Google Scholar
Candek, K. et al. Targeting a portion of central European spider diversity for permanent preservation. Biodivers. Data J. 1, e980 (2013).
Article Google Scholar
Crespo, L. C. et al. A DNA barcode-assisted annotated checklist of the spider (Arachnida, Araneae) communities associated to white oak woodlands in Spanish National Parks. Biodivers. Data J. 6, e29443 (2018).
Article Google Scholar
Danilov, S. N. The spider family Liocranidae in Siberia and Far East (Aranei). Arthropoda Sel. 7, 313–317 (1998).
Google Scholar
Deltshev, C. et al. Faunistic diversity of spiders (Araneae) in Galichitsa mountain (FYR Macedonia). Biodivers. Data J. 1, e977 (2013).
Article Google Scholar
Deltshev, C. & Wang, C. A new Agraecina spider species from the Balkan Peninsula (FYR Macedonia) (Araneae: Liocranidae). Zootaxa 4117, 135–140 (2016).
Article PubMed Google Scholar
Elverici, M., Özkütük, R. S. & Kunt, K. B. Two new liocranid species records from Turkey (Araneae: Liocranidae). Munis Entomol. Zool. 1, 305–308 (2013).
Google Scholar
Esyunin, S. L. & Kazantsev, D. K. On the spider (Aranei) fauna of the Pechoro-Ilychskiy reserve (north Urals), with the description of a new Agroeca species (Liocranidae). Arthropoda Sel. 16, 245–250 (2007).
Google Scholar
Felton, C., Judd, S. & Merrett, P. Agroeca dentigera Kulczynski, 1913, a liocranid spider new to Britain (Araneae, Liocranidae). Bull. Br. Arachnol. Soc. 13, 90–92 (2004).
Google Scholar
Fu, J. Y., Zhang, F. & Zhu, M. S. Redescription of a little-known spider species, Mesiotelus lubricus (Simon, 1880) (Aranei: Liocranidae) from China. Arthropoda Sel. 17, 169–173 (2009).
Google Scholar
Hayashi, T. Three species of the genus Agroeca (Araneae: Clubionidae) from Japan, including a new species. Acta Arachnol. 41, 133–137 (1992).
Article Google Scholar
Jonsson, L. J. Agroeca dentigera and Entelecara omissa (Araneae: Liocranidae, Linyphiidae), found in Sweden. Arachnol. Mitteilungen 2, 49–52 (2005).
Article Google Scholar
Marusik, Y. M. & Koponen, S. New data on spiders (Aranei) from the Maritime Province, Russian Far East. Arthropoda Sel. 9, 55–68 (2000).
Google Scholar
Marusik, Y. M., Omelko, M. M. & Koponen, S. Rare and new for the fauna of the Russian Far East spiders (Aranei). Far East. Entomol. 317, 1–15 (2016).
Google Scholar
Marusik, Y. M., Zheng, G. & Li, S. A review of the genus Paratus Simon (Araneae, Dionycha). Zootaxa 1965, 50–60 (2008).
Article Google Scholar
Namkung, J. A new species of the genus Agroeca (Araneae: Clubionidae) from Korea. Korean Arachnol. 5, 23–27 (1989).
Google Scholar
Platnick, N. I. & Di Franco, F. On the relationship of the spider genus Cybaeodes (Araneae, Dionycha). Am. Museum Novit. 9, 2 (1992).
Google Scholar
Reboleira, A. S. et al. Catalogue of the type material in the entomological collection of the University of La Laguna (Canary Islands, Spain). I. Arachnida. Zootaxa 3556, 61–79 (2012).
Article Google Scholar
Ribera, C. & de Mas, E. Description of three new troglobiontic species of Cybaeodes (Araneae, Liocranidae) endemic to the Iberian Peninsula. Zootaxa 3957, 313–323 (2015).
Article PubMed Google Scholar
Sankaran, P. M., Malamel, J. J., Joseph, M. M. & Sebastian, P. A. A new species of Paratus Simon, 1898 (Araneae: Liocranidae, Paratinae) from India. Zootaxa 4286, 139–144 (2017).
Article Google Scholar
Seo, B. K. Description of three liocranid spider species from Korea (Araneae: Liocranidae). Entomol. Res. 41, 98–102 (2011).
Article Google Scholar
Seyyar, O., Oba, A., Demir, H. & Turkes, T. Arabelia Bosselaers, 2009 and Arabelia pheidoleicomes Bosselaers, 2009 (Araneae: Liocranidae) are new records for the Turkish Spider Fauna. Serket 15, 30–32 (2016).
Google Scholar
Ubick, D. & Platnick, N. I. On Hesperocranum, A New Spider Genus from Western North America (Araneae, Liocranidae). Am. Museum Novit. 2, 1–12 (1991).
Google Scholar
Ubick, D. & Vetter, R. S. A new species of Apostenus from California, with notes on the genus (Araneae, Liocranidae). J. Arachnol. 33, 63–75 (2005).
Article Google Scholar
Vetter, R. S. Revision of the spider genus Neoanagraphis (Araneae, Liocranidae). J. Arachnol. 29, 1–10 (2001).
Article Google Scholar
Warui, C. & Jocqué, R. The first Gallieniellidae (Araneae) from Eastern Africa. J. Arachnol. 30, 307–315 (2002).
Article Google Scholar
Wunderlich, J. On European spiders of the nominal families Liocranidae, Miturgidae and Zoridae (Araneae), with descriptions of new taxa. Beiträge zur Araneologie 6, 108–120 (2011).
Google Scholar
Zapata, L. V. & Ramírez, M. J. A new species of the genus Paratus Simon (araneae: liocranidae) from Thailand. Zootaxa 2, 65–68 (2010).
Article Google Scholar
Zonstein, S. L., Marusik, Y. M. & Omelko, M. A survey of spider taxa new to Israel (Arachnida: Araneae). Zool. Middle East 61, 372–385 (2015).
Article Google Scholar
Vink, C. J., Thomas, S. M., Paquin, P., Hayashi, C. Y. & Hedin, M. The effects of preservatives and temperatures on arachnid DNA. Invertebr. Syst. 19, 99–104 (2005).
Article CAS Google Scholar
GBIF. org (12 July 2019). GBIF Occurrence Download (Liocranidae) https://doi.org/10.15468/dl.fcpcw9.
GBIF.org (20 March 2020). GBIF Occurrence Download (Sudharmia, Sesieutes, Jacaena, Koppe, Sphingius, Oedignatha, Teutamus) https://doi.org/10.15468/dl.3eh0rl.

Download references

Acknowledgements

Thanks to Joe Dulyapat and Choojai Petcharad for their great assistance and participation during our fieldwork in Thailand. Thanks to editor Uwe Fritz, reviewer Torsten Dikow, and three anonymous reviewers for their valuable comments and suggestions. Funding for the first author was provided by CONACyT Becas al extranjero 294543/440613, Mexico. All specimens collected by us in Thailand were authorized under permit 5830802 emitted by the Department of National Parks, Wildlife and Plant Conservation, Thailand.

Author information

Authors and Affiliations

Department of Terrestrial Zoology, Understanding Evolution group, Naturalis Biodiversity Center, Darwinweg 2, 2333CR, Leiden, The Netherlands
F. Andres Rivera-Quiroz & Jeremy A. Miller
Institute of Biology Leiden (IBL), Leiden University, Sylviusweg 72, 2333BE, Leiden, The Netherlands
F. Andres Rivera-Quiroz
Faculty of Science and Technology, Thammasat University, Rangsit, 12121, Pathum Thani, Thailand
Booppa Petcharad
Plazi, Zinggstrasse 16, CH 3007, Bern, Switzerland
Jeremy A. Miller

Authors

F. Andres Rivera-Quiroz
View author publications
You can also search for this author in PubMed Google Scholar
Booppa Petcharad
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy A. Miller
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.R. and J.M. conceived the ideas and designed methodology; A.R., J.M. and B.P. collected the data; A.R. analysed the data; A.R. led the writing of the manuscript. All authors contributed critically to the drafts and gave final approval for publication.

Corresponding author

Correspondence to F. Andres Rivera-Quiroz.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary file1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rivera-Quiroz, F.A., Petcharad, B. & Miller, J.A. Mining data from legacy taxonomic literature and application for sampling spiders of the Teutamus group (Araneae; Liocranidae) in Southeast Asia. Sci Rep 10, 15787 (2020). https://doi.org/10.1038/s41598-020-72549-8

Download citation

Received: 18 May 2020
Accepted: 02 September 2020
Published: 25 September 2020
DOI: https://doi.org/10.1038/s41598-020-72549-8

This article is cited by

A Local Discrete Text Data Mining Method in High-Dimensional Data Space
- Juan Li
- Aiping Chen
International Journal of Computational Intelligence Systems (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Phylogenomics and the rise of the angiosperms

Isotopic evidence of high reliance on plant food among Later Stone Age hunter-gatherers at Taforalt, Morocco

Network of large pedigrees reveals social practices of Avar communities

Introduction

Results

Literature data analysis

Fieldwork

Discussion

Literature data analysis

Fieldwork

Data aggregation

Methods

Literature data extraction

Data analysis

Site selection

Sampling

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary file1

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

A Local Discrete Text Data Mining Method in High-Dimensional Data Space

Comments

Search

Quick links