United States wildlife and wildlife product imports from 2000–2014

Eskew, Evan A.; White, Allison M.; Ross, Noam; Smith, Kristine M.; Smith, Katherine F.; Rodríguez, Jon Paul; Zambrana-Torrelio, Carlos; Karesh, William B.; Daszak, Peter

doi:10.1038/s41597-020-0354-5

Download PDF

Data Descriptor
Open access
Published: 16 January 2020

United States wildlife and wildlife product imports from 2000–2014

Evan A. Eskew ORCID: orcid.org/0000-0002-1153-5356¹,
Allison M. White¹,
Noam Ross ORCID: orcid.org/0000-0002-2136-0000¹,
Kristine M. Smith¹,
Katherine F. Smith²,
Jon Paul Rodríguez^3,4,5,
Carlos Zambrana-Torrelio¹,
William B. Karesh¹ &
…
Peter Daszak¹

Scientific Data volume 7, Article number: 22 (2020) Cite this article

9947 Accesses
30 Citations
47 Altmetric
Metrics details

Subjects

Abstract

The global wildlife trade network is a massive system that has been shown to threaten biodiversity, introduce non-native species and pathogens, and cause chronic animal welfare concerns. Despite its scale and impact, comprehensive characterization of the global wildlife trade is hampered by data that are limited in their temporal or taxonomic scope and detail. To help fill this gap, we present data on 15 years of the importation of wildlife and their derived products into the United States (2000–2014), originally collected by the United States Fish and Wildlife Service. We curated and cleaned the data and added taxonomic information to improve data usability. These data include >2 million wildlife or wildlife product shipments, representing >60 biological classes and >3.2 billion live organisms. Further, the majority of species in the dataset are not currently reported on by CITES parties. These data will be broadly useful to both scientists and policymakers seeking to better understand the volume, sources, biological composition, and potential risks of the global wildlife trade.

Measurement(s)	Import • wildlife • wildlife product
Technology Type(s)	digital curation
Sample Characteristic - Environment	wildlife trade network
Sample Characteristic - Location	United States of America

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.11439471

Searching the web builds fuller picture of arachnid trade

Article Open access 19 May 2022

Impacts of wildlife trade on terrestrial biodiversity

Article 15 February 2021

Thousands of reptile species threatened by under-regulated global trade

Article Open access 29 September 2020

Background & Summary

The wildlife trade represents a major threat to the conservation of many species due to the harvest and depletion of wild populations for the purpose of trade in animals and/or their derived products^{1,2,3,4,5,6,7}. Consequently, understanding trade patterns and drivers is essential to mitigating the negative effects of trade on ecosystems, including those on which humanity depends⁸. Characterization of the direct harvest and subsequent trade in wildlife is conceptually straightforward and should be aided by existing governmental monitoring programs. Currently, however, data on biological resource use are particularly scarce relative to information on other conservation threats, and the utility of existing datasets is often limited by a narrow taxonomic focus⁹. Furthermore, comprehensive evaluation of the wildlife trade at domestic and international scales is complicated by the existence of both legal trade pathways, which are subject to differing regulations and monitoring effort in different nations, and illegal trade pathways, which are under-detected and under-reported due to their illicit nature^10,11. Finally, multi-country wildlife trade data sources, like the CITES Trade Database, can have reporting discrepancies and complex data structures that challenge analysis and interpretation^{12,13,14,15,16,17,18}. Despite these difficulties, efforts to describe and quantify the wildlife trade have scientific value, given the trade’s demonstrated impact on wildlife conservation status^2,3,4,6, animal welfare¹⁹, the introduction of non-native species^20,21,22, and the spread of non-native pathogens, including zoonoses that may threaten human health^10,11,23,24.

The United States Fish and Wildlife Service’s (USFWS) Law Enforcement Management Information System (LEMIS) data have been used as a resource for research on the legal wildlife trade. These data, derived from legally mandated reports submitted to USFWS¹¹, contain information on US imports/exports of both live organisms and wildlife products. Previous studies, having obtained LEMIS records through Freedom of Information Act (FOIA) requests, have used the data to address broad temporal and taxonomic patterns in the US wildlife trade^8,11 and trends in the trade of specific focal taxa^18,25,26,27. However, the LEMIS trade data underlying analyses have either not been shared as part of the publication process, or the data that have been released focus on relatively limited time periods and study taxa. In addition, to the best of our knowledge, LEMIS data are not permanently archived¹¹, and independent parties acquiring LEMIS data may obtain subtly different datasets depending upon the date and specifics of their data requests. These factors, combined with the time investment and domain-specific knowledge required to request, process, and interpret LEMIS records, are likely barriers to the wider use of LEMIS data and may muddle comparability among studies.

Here, we collate and share 15 years of USFWS LEMIS wildlife trade importation data. While we have previously summarized different portions of these data^8,11,25, the cleaned dataset resulting from our data compilation efforts has not been released until now. Furthermore, we provide an R package interface for the dataset, aiming to streamline data access and ease the key initial analytical steps of data manipulation and visualization. This dataset will be of broad interest to researchers investigating the conservation implications of overexploitation through trade, the introduction of alien species, and the potential health impacts on humans, native wildlife, and domesticated species of the widespread transport of wildlife that may harbor pathogens of concern. Critically, it represents a single data resource that is relevant to researchers working across diverse taxonomic groups, allowing for greater comparability across wildlife trade work in the future.

Methods

On a consistent basis since the mid-2000s, we have filed FOIA requests to USFWS for LEMIS data concerning importation of wildlife and wildlife products from all countries, noting that we were interested in both legal and illegal products that were documented and/or seized by US authorities. Specifically, we requested: taxonomic information (i.e., species identity or lowest-level taxonomic identification available), value of the product (reported in US dollars), wildlife description (i.e., type of wildlife product such as “live” or “skin”), quantity, unit (of the quantity metric), country of origin, country of shipment, action taken by USFWS on import, final disposition decision, date of disposition, date of shipment, the US port where the product was received, the US importer, and the foreign exporter (Table 1). In response to these requests, we received data on the wildlife trade broadly defined, composed mostly of information on vertebrates and invertebrates but also including some records of plants and microorganisms. At the time of writing, these requests have generated 15 years of US wildlife importation data spanning from 2000 through 2014²⁸. We acknowledge this is a subset of the full LEMIS database, but as we continue to file requests for more recent LEMIS data, the version-controlled Zenodo data repository and R package will be updated accordingly.

Table 1 LEMIS metadata showing data fields and field descriptions for all variables appearing in the cleaned dataset.

Full size table

Data processing is described here only in broad outline both for brevity and because the entire data cleaning workflow is publicly available for inspection (see “Code availability” section). Raw LEMIS data were provided by the USFWS as Microsoft Excel files, and file structure varied slightly across request responses. We aggregated these data into a single database, and performed a variety of quality assurance and data cleaning operations to improve data integrity and usability. All data processing and cleaning took place within the R statistical programming environment²⁹.

First, we harmonized data indicating missingness and other uninterpretable field values (i.e., “***”) to the standard missing data value in R (i.e., NA values). Although our data requests specified our interest in imported wildlife or wildlife products, a small proportion of the data we received (<5%) did not contain values of “I” (indicating “import”) in the ‘import_export’ data field. Because we couldn’t confidently assess whether these records represented imported products, we removed them from the dataset. We also discovered a subset of records from one shipment year (2013) that were composed of near-duplicate records. These comprised rows that were exact duplicates of one another except for the ‘value’ field; one portion of the data for these near-duplicate matches recorded missing data for the ‘value’ field, while the other portion recorded numeric values. Given that all of the records containing missing ‘value’ data in this near-duplicate set were from the same raw data file, we deduced that we received duplicated information for this set of records, with one version of the records containing the ‘value’ data that was missing in the other. We removed the near-duplicate records that contained missing ‘value’ data, retaining the near-duplicates with good ‘value’ data.

We then cleaned data fields that should have been restricted to specific, coded values, comparing the values observed in the raw data with valid codes as indicated by USFWS code key documentation (available in our Zenodo and GitHub repositories). We converted irregular code entries to valid codes where it was possible to do so with reasonable confidence given the data context. In some cases, irregular code entries were apparent typographic errors. For example, in the ‘description’ field, “MEA” is the code used to indicate a meat product. We therefore assumed that records with a ‘description’ entry of “MAE” and a declared unit of kilograms were likely erroneous entries of the valid code “MEA”. In other cases, irregular codes seemed to be data entry errors resulting from subtle differences between commonly used abbreviations and the actual, valid codes for LEMIS data. For example, valid codes for the ‘unit’ field are two characters long; we thus assumed any ‘unit’ entries of “L” were meant to indicate a unit of liters, which should be expressed with the valid code “LT”. When we were unable to reasonably infer a particular data entry error, we converted irregular codes to a value of “non-standard value”. We also generated a ‘cleaning_notes’ field in the final dataset which preserves the original values that were converted to “non-standard value” for users who wish to attempt interpretation of the raw data. The following fields were cleaned in this manner: ‘description’, ‘unit’, ‘country_origin’, ‘country_imp_exp’, ‘purpose’, ‘source’, ‘action’, ‘disposition’, and ‘port’ (Table 1).

Next, we attempted to clean disposition date data. The ‘shipment_date’ field indicates the date of shipment arrival, and ‘disposition_date’ records the date on which a customs decision (i.e., to clear, seize, abandon, or re-export) for the shipment was reached. While the shipment dates in the raw data we received were strictly within the bounds of the years requested (i.e., 2000–2014), likely because this field was used by the USFWS to pull the data, the disposition date field was more varied. Some disposition date entries were obviously erroneous (e.g., those listing dates in the future) while others were likely artifacts resulting from data storage and sharing processes (e.g., when using Microsoft Excel files, blank values in date-formatted fields can sometimes be converted to unintended default date values). The vast majority of raw records in the dataset (>95%) list a disposition date identical to or later than the shipment date. Because logically a disposition decision should occur after a product is received, where there were obvious conflicts between the shipment date and disposition date, we assumed disposition dates should refer to a date on or after the shipment date. Thus, we cleaned all obviously problematic disposition dates, particularly those lying outside the time period 2000–2014. Note, however, that disposition dates in 2015 may be sensible and valid for shipments received late in 2014.

Finally, we cleaned and supplemented taxonomic information in the LEMIS data. Using the provided ‘species_code’ field and USFWS keys, we were able to derive a ‘taxa’ field for the vast majority (>99%) of records (Table 1). However, this USFWS-defined ‘taxa’ categorization, while useful for general data inspection, does not correspond to a consistent taxonomic concept. Therefore, we sought to designate a taxonomic class for all LEMIS data where possible. We used the R package taxadb to automatically gather class information³⁰, drawing primarily from the taxonomic classification provided by the Catalogue of Life (COL) database. Where the COL data did not allow for automated class-level taxonomic calls, we drew from the Integrated Taxonomic Information System (ITIS), harmonizing data with the COL class categorization. Furthermore, the lack of automatic class-level taxonomic assignment for some taxonomic entries alerted us to raw values potentially in need of correction, initiating an iterative data cleaning process. First, as part of this cleaning, vague or missing taxonomic information in the ‘species’ and ‘subspecies’ fields were converted to “sp.” values for consistency. Next, we manually inspected and corrected unique combinations of the ‘genus’, ‘species’, ‘subspecies’, ‘specific_name’, and ‘generic_name’ fields (Table 1). In many cases, errors represented minor misspellings (e.g., Philetarius socius instead of Philetairus socius) or inversions of the genus and species names. Finally, where we were still unable to recover automated class-level information, we manually assigned class when data specificity and context from other fields allowed. Many of these data represented cases where the LEMIS data uses alternate taxonomy that is not recognized by either the COL or the ITIS. Nonetheless, the data provided often enabled unambiguous class-level assignment.

Data Records

We present >5.5 million USFWS LEMIS wildlife or wildlife product records spanning 15 years and 28 data fields²⁸. These records, made available in a Zenodo data repository, were derived from >2 million unique shipments processed by USFWS during the time period and represent >3.2 billion live organisms (Fig. 1). We provide the final cleaned data as a single comma-separated value file. Original raw data as provided by the USFWS are also available in the Zenodo data repository. Although relatively large (~1 gigabyte), the cleaned data file can be imported into a software environment of choice for data analysis. Alternatively, our R package provides access to a release of the same cleaned dataset but with a data download and manipulation framework that is designed to work well with this large dataset (see “Code availability” section). Finally, both the Zenodo data repository and the R package contain a metadata file describing each of the data fields (presented here as Table 1) as well as a lookup table to retrieve full values for the abbreviated codes used throughout the dataset.

Twenty-three of the final data fields are cleaned versions of the original data provided by the USFWS: ‘control_number’, ‘species_code’, ‘genus’, ‘species’, ‘subspecies’, ‘specific_name’, ‘generic_name’, ‘description’, ‘quantity’, ‘unit’, ‘value’, ‘country_origin’, ‘country_imp_exp’, ‘purpose’, ‘source’, ‘action’, ‘disposition’, ‘disposition_date’, ‘shipment_date’, ‘import_export’, ‘port’, ‘us_co’, and ‘foreign_co’ (Table 1). To these original data fields, we added five: ‘taxa’, ‘class’, and ‘cleaning_notes’ (all as previously described), as well as ‘dispostion_year’ and ‘shipment_year’ (derived from ‘disposition_date’ and ‘shipment_date’, respectively). To briefly describe the LEMIS data fields, we consider ‘control_number’ to represent a unique individual shipment processed by the USFWS (Fig. 1). Different wildlife products contained within the same shipment may be represented in the LEMIS data by multiple data rows, all of which share a common ‘control_number’. Consistent with this interpretation, all rows of data sharing the same ‘control_number’ share the same country of shipment and shipment date. Different products within the same shipment may differ in other ways, however. For example, they may have been originally derived from different countries and may have different disposition histories. Next, the ‘species_code’, ‘taxa’, ‘class’, ‘genus’, ‘species’, ‘subspecies’, ‘specific_name’, and ‘generic_name’ columns all provide information serving to identify the wildlife or wildlife product (Table 1). While the ‘genus’ column largely corresponds to taxonomic genus, sometimes higher-level categorizations were provided in this field, apparently when the genus was unknown. As a result, there are 17,211 unique species names in the dataset (i.e., distinct combinations of ‘genus’ and ‘species’), and when generic identifiers are excluded (e.g., removal of records where the ‘genus’ was reported as “Tropical fish”, the ‘species’ value was given only as “sp.”, etc.), 12,924 unique species names remain (Table 2). Of the species names in this restricted set of standardized binomial nomenclature, only 3,168 (24.5%) are currently subject to reporting by CITES parties. However, we acknowledge the novelty of the LEMIS dataset may be slightly overestimated to the degree that synonymous taxa appear in the data. Using our automated taxonomic calling workflow, we were able to assign ‘class’ information to >92% of LEMIS records, which represent 63 biological classes (Table 2). All further data fields besides ‘cleaning_notes’ serve to detail the wildlife product, as outlined in Table 1. Although we consistently requested product ‘value’ information from the USFWS, it was not provided for four years of LEMIS data (2008–2010 and 2014). Finally, note that the ‘us_co’ and ‘foreign_co’ fields indicate the US importing and foreign exporting party of the shipment, respectively. Where USFWS redacted this information due to privacy concerns, values are listed as “EXEMPTIONS 6 AND 7(C)”, referring to privacy exemptions under FOIA³¹. 2.2% of records have the importing party redacted, and 0.5% of records have the exporting party redacted. 17.7% and 6.9% of records are missing importer and exporter values, respectively.

Table 2 Number of unique LEMIS species names and records, disaggregated by biological class.

Full size table

Technical Validation

Following data cleaning, which primarily aimed to ensure that all relevant data fields contained valid USFWS-defined codes, we validated our final dataset by plotting the distribution of unique values and value string lengths across all data fields. These checks serve to verify that fields only contain expected values/codes and that the string length of entries in free text fields (e.g., ‘genus’, ‘species’) were not abnormally short or long, which could indicate problematic entries.

Usage Notes

While we did remove what we believe to be erroneous near-duplicate records in the dataset (as described in the Methods), end users should note that exact duplicate records remain. This is because even exact duplicate records may represent accurate data, especially in cases where the recorded ‘quantity’ value is 1. For example, in the final dataset, ‘control_number’ 2000732392 records the importation of a shipment of garments from France which were themselves derived from reticulated pythons (Python reticulatus) originating in Malaysia. Within this ‘control_number’ value (representing one shipment), a single data record, reporting a ‘quantity’ of 1 and a ‘value’ of $1,458, is duplicated 25 times. Our assumption is that these garments, and similar duplicate products, were individually packaged but shipped together such that officers at the port of entry recorded exact duplicate data entries to capture the total product volume within the shipment. In other cases, similar information may have been aggregated during data entry (e.g., recording the identical product data as a single record with a quantity of 25). We verified that all duplicate records that remain in the data originated from the same raw data file. This indicates that these records were provided as such by USFWS and ensures they were not artifacts generated through our data processing pipeline (e.g., by combining data across multiple raw data files that contained overlapping information). Thus, we believe we have made the most conservative data processing decision by preserving the original form of the data unless we had good reason to perform data cleaning. Nevertheless, users should be aware of the potential presence of duplicate records in any data subset of interest, and these records should be scrutinized for inclusion in analyses given the specific study objectives.

The dataset provides multiple, complementary data fields reporting taxonomic identity that deserve special attention. Generally, users will want to consider the ‘taxa’ and ‘class’ fields in conjunction to analyze trade data for large taxonomic groups. While ‘class’ is typically a more specific taxonomic designation, ‘taxa’ has fewer missing values in the final dataset (‘class’ information available for >92% of LEMIS records; ‘taxa’ information available for >99% of LEMIS records). Which field deserves greater focus will depend on the analytical goals, recognizing that ‘taxa’ does not represent a consistent biological classification scheme but rather a general heuristic for categorizing groups of organisms in the trade. For example, the ‘taxa’ category “fish” encompasses LEMIS records representing six distinct ‘class’ values: Actinopterygii, Cephalaspidomorphi, Elasmobranchii, Holocephali, Myxini, and Sarcopterygii. Clearly, ‘class’ is biologically meaningful and may help users rapidly narrow their analytical focus, but users should keep in mind that there are records within the ‘taxa’ category of “fish” for which ‘class’ could not be unambiguously assigned. For some research questions, these data may also be of interest. Similarly, the ‘taxa’ categories of “coral”, “crustacean”, “plant”, and “shell” all map onto multiple distinct ‘class’ values yet are also useful for the broad categorization of records when ‘class’ could not be identified.

In addition, users must be cognizant of the fact that taxa may be represented by multiple taxonomic synonyms. While we sought to provide high-level taxonomic information (e.g., class assignments) that would help users in generating a relevant data subset for analysis, we did not attempt to synonymize species-level names given the large number of taxa present in the LEMIS data and the constantly shifting (and contentious) landscape of preferred taxonomic nomenclature. End users will need to apply their expertise on taxa of interest in order to generate sound taxonomic delineations where synonymies exist in the data.

Furthermore, data users should be cautious about their interpretation of the ‘shipment_date’ and ‘disposition_date’ fields. As previously mentioned, while ‘shipment_date’ entries within the raw data we received fell completely within the time period of 2000–2014, ‘disposition_date’ ranged more widely. Even following data cleaning to harmonize ‘disposition_date’ entries that were obviously problematic, significant discrepancies between ‘shipment_date’ and ‘disposition_date’ still exist for some records in the final dataset. We have chosen to preserve these data as there is no clear cut-off at which differences between disposition date and shipment date become invalid. For example, dispositions that occur months after the declared shipment date could reflect the reality of product processing even though a large majority of records (>70%) indicate that disposition typically occurs within a week of the shipment date. Certainly, users should be wary of any disposition date values that precede the associated shipment date, as we are unaware how this could represent an accurate accounting of the product disposition process. However, for many potential analyses, differences in the date fields may not be a significant cause for concern because ‘shipment_date’ alone provides a sound index for those interested in temporal trends in wildlife trade.

Finally, data users should be careful about interpreting the ‘country_imp_exp’ and ‘country_origin’ data fields. These fields are meant to represent the most recent location (‘country_imp_exp’) and point of origin (‘country_origin’) for the wildlife or wildlife products, but data in these fields are derived from import documents completed by the importer and are therefore not verifiable. Complex import/export histories can result in surprising entries for these fields²⁴. For example, rodents of the genus Abrocoma are native to South America. Interestingly then, our data describe a shipment of garments derived from Abrocoma sp. (‘control_number’ 2008273877) with a ‘country_imp_exp’ of Switzerland and a ‘country_origin’ of Hungary. The apparent contradiction in this case is resolved by recognizing that the ‘source’ column indicates these animals were derived from a domestic ranching operation rather than being taken directly from the wild. However, for those interested in the true origins of wildlife and wildlife products that are sourced from the wild (~78% of our data records), the ‘country_origin’ field deserves special scrutiny to ensure the recorded country is in fact a biologically-realistic point of origin for the species in question. Users seeking distribution information on focal organisms may wish to consult the IUCN Red List of Threatened Species (https://www.iucnredlist.org/) and Species+ (https://speciesplus.net/) resources.

Understanding the appropriate interpretation of the ‘country_imp_exp’ and ‘country_origin’ fields also illuminates how seemingly incongruous records listing the US as the ‘country_origin’ for a US import can in fact be valid data. For example, ‘control_number’ 2005537093 represents a shipment of shoe products derived from white-tailed deer (Odocoileus virginianus). The ‘country_origin’ is recorded as the US, where the wildlife was presumably originally harvested, while Italy is recorded as the ‘country_imp_exp’ since this was the proximate source of the shoe products. Hence, for wildlife products where some part of the manufacturing process takes place abroad, it is indeed expected that raw materials derived from US wildlife are shipped internationally, thereby resulting in LEMIS data that indicate the US importation of a wildlife product that was originally sourced from the US.

Code availability

Our custom R package, which provides access to the data described here, is publicly available at https://github.com/ecohealthalliance/lemis. Installation of the package and subsequent download of the data enables efficient, on-disk manipulation of the entire cleaned dataset^32,33. Basic package usage is outlined in the main package README file on the GitHub site. The code implementation of the data cleaning process is also available in the package codebase (via the ‘data-raw’ directory) and is outlined in the associated developer README file. These scripts span the entirety of our data processing and cleaning workflow, from importation and collation of the raw USFWS LEMIS data files through to generation of the single, cleaned data file as discussed in this manuscript. Thus, the scripts serve as transparent, reproducible documentation of our data processing in full.

References

Bennett, E. L. et al. Hunting the world’s wildlife to extinction. Oryx 36, 328–329, https://doi.org/10.1017/S0030605302000637 (2002).
Article Google Scholar
Rosser, A. M. & Mainka, S. A. Overexploitation and species extinctions. Conserv. Biol. 16, 584–586, https://doi.org/10.1046/j.1523-1739.2002.01635.x (2002).
Article Google Scholar
Hoffmann, M. et al. The impact of conservation on the status of the world’s vertebrates. Science 330, 1503–1509, https://doi.org/10.1126/science.1194442 (2010).
Article ADS CAS PubMed Google Scholar
Maxwell, S. L., Fuller, R. A., Brooks, T. M. & Watson, J. E. M. Biodiversity: The ravages of guns, nets and bulldozers. Nature 536, 143–145, https://doi.org/10.1038/536143a (2016).
Article ADS CAS PubMed Google Scholar
Ripple, W. J. et al. Bushmeat hunting and extinction risk to the world’s mammals. Roy. Soc. Open Sci. 3, 160498, https://doi.org/10.1098/rsos.160498 (2016).
Article ADS Google Scholar
Tingley, M. W., Harris, J. B. C., Hua, F., Wilcove, D. S. & Yong, D. L. The pet trade’s role in defaunation. Science 356, 916, https://doi.org/10.1126/science.aan5158 (2017).
Article ADS CAS PubMed Google Scholar
Scheffers, B. R., Oliveira, B. F., Lamb, I. & Edwards, D. P. Global wildlife trade across the tree of life. Science 366, 71–76, https://doi.org/10.1126/science.aav5327 (2019).
Article ADS CAS PubMed Google Scholar
Smith, K. F. et al. Reducing the risks of the wildlife trade. Science 324, 594–595, https://doi.org/10.1126/science.1174460 (2009).
Article CAS PubMed Google Scholar
Joppa, L. N. et al. Filling in biodiversity threat gaps. Science 352, 416–418, https://doi.org/10.1126/science.aaf3565 (2016).
Article ADS CAS PubMed Google Scholar
Rosen, G. E. & Smith, K. F. Summarizing the evidence on the international trade in illegal wildlife. EcoHealth 7, 24–32, https://doi.org/10.1007/s10393-010-0317-y (2010).
Article PubMed PubMed Central Google Scholar
Smith, K. M. et al. Summarizing US wildlife trade with an eye toward assessing the risk of infectious disease introduction. EcoHealth 14, 29–39, https://doi.org/10.1007/s10393-017-1211-7 (2017).
Article CAS PubMed PubMed Central Google Scholar
Blundell, A. G. & Mascia, M. B. Discrepancies in reported levels of international wildlife trade. Conserv. Biol. 19, 2020–2025, https://doi.org/10.1111/j.1523-1739.2005.00253.x (2005).
Article Google Scholar
Berec, M., Vršecká, L. & Šetlíková, I. What is the reality of wildlife trade volume? CITES Trade Database limitations. Biol. Conserv. 224, 111–116, https://doi.org/10.1016/j.biocon.2018.05.025 (2018).
Article Google Scholar
Pavitt, A. et al. What is the reality of wildlife trade volume? Understanding CITES trade data — A response to Berec et al. Biol. Conserv. 230, 195–196, https://doi.org/10.1016/j.biocon.2018.12.006 (2019).
Article Google Scholar
Berec, M. & Šetlíková, I. Important step to understanding the CITES Trade Database: A reply to Pavitt et al. Biol. Conserv. 230, 197–198, https://doi.org/10.1016/j.biocon.2018.12.018 (2019).
Article Google Scholar
Robinson, J. E. & Sinovas, P. Challenges of analyzing the global trade in CITES-listed wildlife. Conserv. Biol. 32, 1203–1206, https://doi.org/10.1111/cobi.13095 (2018).
Article PubMed Google Scholar
Eskew, E. A., Ross, N., Zambrana-Torrelio, C. & Karesh, W. B. The CITES Trade Database is not a “global snapshot” of legal wildlife trade: Response to Can et al., 2019. Glob. Ecol. Conserv. 18, e00631, https://doi.org/10.1016/j.gecco.2019.e00631 (2019).
Article Google Scholar
Janssen, J. & Leupen, B. T. C. Traded under the radar: poor documentation of trade in nationally-protected non-CITES species can cause fraudulent trade to go undetected. Biodivers. Conserv. 28, 2797–2804, https://doi.org/10.1007/s10531-019-01796-7 (2019).
Article Google Scholar
Baker, S. E. et al. Rough trade: animal welfare in the global wildlife trade. BioScience 63, 928–938, https://doi.org/10.1525/bio.2013.63.12.6 (2013).
Article Google Scholar
Hulme, P. E. Trade, transport and trouble: managing invasive species pathways in an era of globalization. J. Appl. Ecol. 46, 10–18, https://doi.org/10.1111/j.1365-2664.2008.01600.x (2009).
Article Google Scholar
Chapman, D., Purse, B. V., Roy, H. E. & Bullock, J. M. Global trade networks determine the distribution of invasive non-native species. Glob. Ecol. Biogeogr. 26, 907–917, https://doi.org/10.1111/geb.12599 (2017).
Article Google Scholar
García-Díaz, P., Ross, J. V., Woolnough, A. P. & Cassey, P. The illegal wildlife trade is a likely source of alien species. Conserv. Lett. 10, 690–698, https://doi.org/10.1111/conl.12301 (2017).
Article Google Scholar
Karesh, W. B., Cook, R. A., Bennett, E. L. & Newcomb, J. Wildlife trade and global disease emergence. Emerg. Infect. Dis. 11, 1000–1002, https://doi.org/10.3201/eid1107.050194 (2005).
Article PubMed PubMed Central Google Scholar
Pavlin, B. I., Schloegel, L. M. & Daszak, P. Risk of importing zoonotic diseases through wildlife trade, United States. Emerg. Infect. Dis. 15, 1721–1726, https://doi.org/10.3201/eid1511.090467 (2009).
Article PubMed PubMed Central Google Scholar
Schloegel, L. M. et al. Magnitude of the US trade in amphibians and presence of Batrachochytrium dendrobatidis and ranavirus infection in imported North American bullfrogs (Rana catesbeiana). Biol. Conserv. 142, 1420–1426, https://doi.org/10.1016/j.biocon.2009.02.007 (2009).
Article Google Scholar
Herrel, A. & van der Meijden, A. An analysis of the live reptile and amphibian trade in the USA compared to the global trade in endangered species. Herpetol. J. 24, 103–110 (2014).
Google Scholar
Gray, M. J. et al. Batrachochytrium salamandrivorans: the North American response and a call for action. PLoS Pathog. 11, e1005251, https://doi.org/10.1371/journal.ppat.1005251 (2015).
Article CAS PubMed PubMed Central Google Scholar
Eskew, E. A. et al. United States LEMIS wildlife trade data curated by EcoHealth Alliance (Version 1.1.0). Zenodo, https://doi.org/10.5281/zenodo.3565869 (2019).
R Core Team. R: a language and environment for statistical computing, https://www.r-project.org/ (R Foundation for Statistical Computing, 2019).
Boettiger, C., Norman, K., Poelen, J. & Chamberlain, S. Taxadb: a high-performance local taxonomic database interface, https://github.com/cboettig/taxadb (2019).
Office of Information Policy (OIP), United States Department of Justice. Freedom of Information Act Frequently Asked Questions (FAQ), https://www.foia.gov/faq.html (2019).
Klik, M. fst: lightning fast serialization of data frames for R, https://cran.r-project.org/package=fst (2019).
Müller, K. fstplyr: a ‘dplyr’ interface to ‘fst’, https://github.com/krlmlr/fstplyr (2018).

Download references

Acknowledgements

The authors wish to express their sincere thanks to the United States Fish and Wildlife Service and the numerous employees whose prompt, professional service over the years has helped make this data more widely available to the scientific community. The work in this paper was supported by: a National Science Foundation Human and Social Dynamics ‘Agents of Change’ award (SES-HSD-AOC “Human-Related Factors Affecting Emerging Infectious Diseases”, BCS-0826779 and BCS-0826840), a National Institutes of Health NIGMS grant (1R01GM100471-01, “MASpread”), a Joint NSF-NIH-USDA/BBSRC Ecology and Evolution of Infectious Diseases award (NSF DEB 1414374, BBSRC BB/M008894/1, “US-UK Collab: Risks of Animal and Plant Infectious Diseases through Trade (RAPID Trade)”), the United States Agency for International Development (USAID) Emerging Pandemic Threats PREDICT project, and core funding from EcoHealth Alliance.

Author information

Authors and Affiliations

EcoHealth Alliance, 460 West 34th Street – Suite 1701, New York, New York, 10001, USA
Evan A. Eskew, Allison M. White, Noam Ross, Kristine M. Smith, Carlos Zambrana-Torrelio, William B. Karesh & Peter Daszak
Department of Ecology and Evolutionary Biology, Division of Biology and Medicine, Brown University, Providence, Rhode Island, 02912, USA
Katherine F. Smith
IUCN Species Survival Commission, Rue Mauverney 28, 1196, Gland, Switzerland
Jon Paul Rodríguez
Centro de Ecología, Instituto Venezolano de Investigaciones Científicas, Apartado 20632, Caracas, 1020-A, Venezuela
Jon Paul Rodríguez
Provita, Apartado 47552, Caracas 1041-A, Venezuela
Jon Paul Rodríguez

Authors

Evan A. Eskew
View author publications
You can also search for this author in PubMed Google Scholar
Allison M. White
View author publications
You can also search for this author in PubMed Google Scholar
Noam Ross
View author publications
You can also search for this author in PubMed Google Scholar
Kristine M. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Katherine F. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Jon Paul Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Zambrana-Torrelio
View author publications
You can also search for this author in PubMed Google Scholar
William B. Karesh
View author publications
You can also search for this author in PubMed Google Scholar
Peter Daszak
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.M.S., A.M.W., K.F.S., J.P.R., C.Z.-T., W.B.K. and P.D. designed, drafted, and filed Freedom of Information Act requests. E.A.E., A.M.W. and C.Z.-T. made key contributions to the LEMIS data processing and cleaning workflow. N.R. developed and maintains the R package for data access. E.A.E. drafted the manuscript, and all authors were involved in editing and approving the final manuscript.

Corresponding authors

Correspondence to Evan A. Eskew or Peter Daszak.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Eskew, E.A., White, A.M., Ross, N. et al. United States wildlife and wildlife product imports from 2000–2014. Sci Data 7, 22 (2020). https://doi.org/10.1038/s41597-020-0354-5

Download citation

Received: 24 September 2019
Accepted: 13 December 2019
Published: 16 January 2020
DOI: https://doi.org/10.1038/s41597-020-0354-5

This article is cited by

United States amphibian imports pose a disease risk to salamanders despite Lacey Act regulations
- Patrick J. Connelly
- Noam Ross
- Evan A. Eskew
Communications Earth & Environment (2023)
Searching the web builds fuller picture of arachnid trade
- Benjamin M. Marshall
- Colin T. Strine
- Alice C. Hughes
Communications Biology (2022)
Harmonized and high-quality datasets of aerosol optical depth at a US continental site, 1997–2018
- Evgueni Kassianov
- Erol Cromwell
- Jennifer M. Comstock
Scientific Data (2021)
Thousands of reptile species threatened by under-regulated global trade
- Benjamin M. Marshall
- Colin Strine
- Alice C. Hughes
Nature Communications (2020)