The PAD-US-AR dataset: Measuring accessible and recreational parks in the contiguous United States

Browning, Matthew H. E. M.; Rigolon, Alessandro; Ogletree, Scott; Wang, Ruoyu; Klompmaker, Jochem O.; Bailey, Christopher; Gagnon, Ryan; James, Peter

doi:10.1038/s41597-022-01857-7

Download PDF

Data Descriptor
Open access
Published: 16 December 2022

The PAD-US-AR dataset: Measuring accessible and recreational parks in the contiguous United States

Scientific Data volume 9, Article number: 773 (2022) Cite this article

3256 Accesses
3 Citations
20 Altmetric
Metrics details

Subjects

Abstract

Most spatial epidemiological studies of nature-health relationships use generalized greenspace measures. For instance, coarse-resolution spatial data containing normalized difference vegetative index (NDVI) values are prominent despite criticisms, such as the inability to restrain exposure estimates to public and private land. Non-threatening natural landscapes can improve health by building capacities for health-promoting behaviors. Recreational and accessible parks may best activate such behaviors. We curated the Parks and Protected Areas Database of the U.S. (PAD-US) to identify parks that are accessible for outdoor recreation. Our title adds “AR” to “PAD-US” where A = Accessible and R = Recreational. We validated the PAD-US-AR by comparisons with greenspace datasets and sociodemographics, which demonstrated its uniqueness from other commonly employed metrics of nature exposure. The PAD-US-AR presents reliable estimates of parks in the contiguous U.S. that are accessible for outdoor recreation. It has strong associations with home prices, shares of female residents, and shares of older residents. This dataset can accompany other nature exposure metrics in environmental epidemiology and allied research fields.

Measurement(s)	park
Technology Type(s)	Geographic Information System
Sample Characteristic - Environment	County • Tract
Sample Characteristic - Location	United States

Ghost roads and the destruction of Asia-Pacific tropical forests

Article Open access 10 April 2024

FSC-certified forest management benefits large mammals compared to non-FSC

Article Open access 10 April 2024

Expert review of the science underlying nature-based climate solutions

Article Open access 21 March 2024

Background & Summary

Exposure science has historically measured the toxic elements that negatively impact human health¹. However, nature-rich environments that are perceived as non-threatening can positively influence human health through multiple pathways, including mitigation of harmful exposures (i.e., traffic emissions, heat, and noise), restoring attention and reducing stress, and promoting healthy behaviors (i.e., physical activity, sleep, and social interaction)^2,3.

Research on the health benefits of nature has grown since the 1990s^4,5. Hundreds of health outcomes/endpoints have been studied, and at least 40 systematic reviews and meta-analyses have been conducted^6,7. Collectively, these studies suggest plant-rich environments (“greenspaces”) are associated with lower rates of all-cause and stroke-specific mortality, cardiovascular disease, poor mental health, low birth weight, lower levels of physical activity, and poor sleep quality⁶. Liquid-water environments (“bluespaces”) are associated with lower rates of all-cause mortality, obesity, low levels of physical activity, and poor mental health^8,9. Finally, solid-water environments (i.e., polar regions) and rock/mineral-dominated landscapes may have emotional and mental benefits and serve as medical treatments for respiratory conditions and allergies, although evidence for these landscapes being therapeutic is minimal¹⁰.

Despite the growing interest in nature and health, ongoing research would benefit from more sophisticated and precise exposure estimates^11,12,13. One simple and imprecise exposure measure of leafy green vegetative cover is the normalized difference vegetation index (NDVI) from moderate resolution (i.e., 30m² or 250m²) satellite imagery^2,14,15. The calculation of NDVI involves determining the ratio between near-infrared and red bands of light¹⁶. NDVI measures hold some value but are limited in several respects. In defense of NDVI, values have been ground-truthed by environmental psychologists and found to correspond to ratings of “greenness“¹⁷. Values can also be easily obtained from Google Earth Engine (GEE) at different global spatial and temporal scales. Values are assigned to cells laid out in a grid that overlap land cover types and ownership lines, allowing accurate availability estimates (i.e., magnitude of greenness around the home, work, school, and activity spaces) when available at fine spatial resolutions and coupled with parcel-level ownership data^15,18. Finally, many vegetation types can activate health-promoting pathways linking nature exposure with health². In critique of NDVI, values cannot indicate the type of, quality of, access to, and experience with vegetation or other forms of nature, such as liquid water, solid water (i.e., ice, snow), or rocks and minerals (i.e., deserts)^2,10,14. These limitations should not be surprising; after all, the calculation of NDVI is restricted to plants and emerged from agricultural science to estimate crop productivity and expected yield rather than environmental epidemiology¹⁶. Also limiting NDVI is its inability to identify design characteristics that activate instorative effects of nature-based recreation, such as physical activity along greenways and social interaction at picnic shelters^19,20. NDVI values are affected by complex interactions between other environmental factors with less relevance to nature exposure, such as season, slope, and precipitation^21,22 in addition to sensor type and the spatial unit size^23,24.

Another measure of green vegetation is remotely sensed tree canopy cover. Versions of these data at coarse or moderate resolutions can be easily retrieved (i.e., from the Multi-Resolution Land Characteristics [MRLC] National Land Cover Database [NLCD], see www.mrlc.gov). Higher-resolution data are becoming available from agencies, academic institutions, and commercial providers (i.e., www.earthdefine.com/treemap/, https://insights.sustainability.google/labs/treecanopy) through object-based image analysis and related processes^25,26,27. These data can measure this specific type of greenery by classifying vegetation over a certain height (e.g., >2 m) as a tree. Canopy cover is an appropriate nature exposure metric given its opportunities for health promotion through shade, reductions in urban heat island effects, and psychological restoration^28,29. However, like NDVI, tree canopy cover data do not provide information on public access and recreational opportunities. Such information must be available at high resolution and coupled with parcel-level data or spatial algorithms that differentiate visibility along public rights-of-ways (i.e., sideways in front yards)^30,31 to identify where trees might be available to the public for recreational opportunities under canopies.

Other advances in the calculation of nature exposure have been made. For instance, machine learning algorithms have been increasingly applied to 360-degree images along streets (e.g., Google Street View [GSV] or Baidu) or photographs looking out windows to calculate the percentage of visible greenery^{32,33,34,35,36}. Still, most nature exposure metrics remain limited to greenery or open water cover rather than quantification of recreational facilities (i.e., trails and lightning) that also promote health³⁷. The need for alternative datasets remains.

Nationwide data on the location of accessible natural areas managed for outdoor recreation (i.e., parks and protected areas) would be particularly useful. While the composition and facilities in parks vary, many are managed explicitly for the mechanisms explaining the health benefits of nature, including social interaction and physical activity^38,39^,cf.⁴⁰. For instance, natural landscapes in rural areas may be used for resource extraction or conservation with few opportunities for recreation⁴¹. Meanwhile, greenery in urban areas may be intended primarily for ecosystem services such as stormwater runoff, cooling, and noise/air pollution mitigation⁴². Parks across the urban-rural spectrum are important to consider alongside other nature exposure estimates.

Researchers are beginning to use some spatial nationwide datasets for measuring park cover in the U.S. (Table 1). USA Parks was developed by the Environmental Systems Research Institute (Esri) using proprietary data from that company and TomTom⁴³. Open Street Map (OSM) includes crowdsourced data tagged by keys (topic/category) and values (features). These can be selected to identify possible public natural areas⁴⁴. The accuracy and consistency of tags vary geographically and are often imprecise, making the identification of public natural areas difficult⁴⁵. ParkServe contains data on local parks in nearly 14,000 cities, towns, and communities in the USA and was curated by the Trust for Public Land (TPL)⁴⁶. Finally, the Parks and Protected Areas Database United States (PAD-US) is an initiative of the U.S. Geological Survey (USGS) with federal, state, and local partners⁴⁷. It hopes to inventory all protected areas, including public lands, and voluntarily provide private protected areas.

Table 1 Description of park cover datasets for the contiguous U.S.

Full size table

These currently available park datasets are limited in identifying where accessible and recreational parks exist. Most lack metadata on whether each land parcel is open to the public. OSM provides some data on public access but without clear assignments. For example, our retrieval of polygons with the “leisure:park” tag returned 17 types of access from “community” and “discouraged” to “permissive,” “yes,” “restricted,” and “unknown.” Further, OSM data are crowdsourced and not validated by the agencies who manage these spaces. ParkServe also has public access metadata, but its coverage is focused on municipalities. Park cover in rural areas where many important recreational parks (i.e., National Parks) are located is limited in ParkServe.

In response to the value of park data and limitations with extant datasets, we present a new exposure indicator – the dataset for accessible and recreational parks in the contiguous United States (PAD-US-AR). We validate this dataset by comparing it to its source (the original PAD-US V2.1), other nature exposure metrics, including NDVI, tree canopy cover, alternative park datasets, and sociodemographic characteristics in counties and states.

Methods

We curated the PAD-US-AR⁴⁸ dataset from the USGS Protected Areas Database of the U.S. V2.1 (PAD-US V2.1)⁴⁷. The PAD-US is published by U.S. Geological Service in collaboration with Boise State University and through coordination with Federal, State, and non-governmental organizations that provide and verify the data. Its original release was in April 2009. Updates were made in 2010, 2011, 2012, 2016, 2018, 2020, and July 2022. Data on the completeness of the V2.0 dataset, which occurred before the V2.1 dataset used here, are available at https://www.protectedlands.net/frequently-asked-questions-about-pad-us/. In brief, 14 states had over 95% coverage of parks and protected areas, 26 states had 80–95% coverage, and the remaining 8 states in the contiguous U.S. had <80% coverage. Updated coverage statistics for V2.1 are currently unavailable.

The PAD-US is a regularly updated geographic information system (GIS) spatial dataset that compiles the best available data provided by U.S.-based land management agencies and organizations. It strives to be a complete inventory of public land and other protected areas in the U.S. Included areas are those preserved for biological diversity and other natural, recreation, historical, or cultural uses and managed for these purposes through legal or other effective means⁴⁷. Some areas consist of small land parcels with building footprints that occupy most of the area. These are not readily identified with the PAD-US V2.1 metadata. The location designation field (Loc_Ds) offers some clues with values such as “cultural arts center” and “National Register of Historic Places.” The number of unique values (N = 1,675) in the designation, easement, and fee areas of the PAD-US V2.1 limits precise identifications and removal of such areas.

The PAD-US V2.1 release became available in September 2020 and included notable updates from previous versions. These included integration of the TPL ParkServe dataset, Census American Indian/Alaskan Native Areas, Ducks Unlimited protected areas, and federal land ownership updates, among others. The PAD-US V3.0 was released in early July 2022 and contained minor updates that we expected to influence the curation process of the PAD-US-AR very little. For a complete description of version updates, see https://www.usgs.gov/programs/gap-analysis-project/pad-us-data-history.

The PAD-US has been used for conservation mapping^{49,50,51,52,53,54,55} and noise research^56,57. These studies have identified that Western U.S. National Monuments provided jobs and economic growth after establishment⁵², counties with greater coverage of protected areas with strict conservation status (i.e., Wilderness Areas and National Parks) are associated with higher average noise levels⁵⁶, and anthropogenic noise is common in many U.S. parks and protected areas⁵⁷. We are also aware of a few nature-health studies that have utilized the complete PAD-US dataset^58,59. In studies by Tsai and colleagues, the authors identified park locations and ground-truthed results with Google Maps and county/municipal data to identify park entrances. The PAD-US was used to calculate descriptive sample characteristics or covariates in models with other measures of nature exposure (i.e., tree cover and greenery), so associations between health and the PAD-US were not reported.

The opportunities and lack of precedent for curations of the PAD-US prompted us to define which types of parks and protected areas in the dataset were both accessible and recreation-oriented. Based on discussions among three authors (M.B., A.R., S.O.) and three outdoor recreation specialists in the western United States, we reached a consensus on including the following categories:

1.
Parks open for public access or restricted access (i.e., seasonally open, fees required, or permits required), including but not limited to lands managed by the National Park Service, U.S. Forest Service, Bureau of Land Management, U.S. Fish & Wildlife, Army Corps of Engineers, State Parks, State Departments of Conservation, State Departments of Natural Resources, State Departments of Land, State Fish and Wildlife Departments, State Forest Service, State Park and Recreation Departments, Tennessee Valley Authority, and city and county park and recreation departments.
2.
Publicly accessible conservation easements.

We excluded the following designations (see the paragraphs below for rationales):

1.
Department of Energy, Department of Defense, and Bureau of Reclamation lands
2.
Marine areas managed as Marine Protected Areas by the National Oceanic and Atmospheric Administration, or Bureau of Ocean Energy Management, among others
3.
Proclamation areas, which are boundaries of national lands used for administrative purposes that overlap with large areas of public lands that are not all available to the public
4.
Fish hatcheries and other lands used for water rights with regulated hunting
5.
National Park easements (i.e., lands paralleling but not including the Appalachian Trail and not used by the public)
6.
Joint management areas (i.e., university research stations)
7.
Non-governmental organization lands (aside from conservation easements)
8.
State trust/land survey lands
9.
American Indian Lands
10.
Other areas with unknown access or closed public access (i.e., limited to coordinated programs and research)

Restricting the PAD-US to these categories was a sequential process starting with the four terrestrial PAD-US domains (Fig. 1). These domains included designations (policy-designated areas such as National Parks and State Parks), easements (conservation and open space easements provided by the National Conservation Easement Database⁶⁰), fee lands (open space owned by Federal, State, or local agencies, nonprofits, or private individuals), and proclamations (boundaries of administrative areas). For further information on these domains, see http://www.protectedlands.net/pad-us-technical-how-tos/.

Our first step was to exclude all proclamation lands in the PAD-US. These administrative boundaries are not ownership lines but are instead the outer boundaries of areas used by land managers for planning regardless of internal ownership. They could but will not necessarily be publicly managed in the future. Some commercial mapping providers (i.e., Google Maps, Esri USA Parks) incorrectly use these boundaries to show protected areas and, in doing so, often show large areas of private lands as part of public lands.

Next, we excluded lands described as closed to public access in the PAD-US. Alternative classifications include open to public access, restricted, which denotes a permit may be needed, or unknown. We temporarily retained unknown access areas for further consideration since large areas of the intermountain west are designated as such. For example, the Great Salt Lake, UT, is the state’s largest water body and a recreation destination for boating, swimming, and sunbathing.

The subsequent step was refining lands labeled as unknown access in the PAD-US. Decisions were made based on the assigned land manager. City lands (Code = CITY) were included since many greenways were under this classification. County lands (CNTY), which described nearly 250 polygons run by the City of New York for parks and recreation in the city and upstate, were included. Similarly, regional agency land (REG) covered over 400 polygons concentrated in Chicago and Los Angeles suburbs used for parks and recreation; these lands were retained. State Department of Conservation (SDC) and State Department of Natural Resource (SDNR) lands were also retained. These included over 5,000 polygons across the country, including the Great Swamp Management Area, RI, an important area for birding and open to the public, and the Great Salt Lake. State Department of Land (SDOL) areas were retained, as they included approximately 30 polygons used by the public for hiking in Northwestern states. State Fish and Wildlife (SFW) lands included urban areas with trails along waterways and were retained. State Parks and Recreation (SPR) lands were retained and covered public recreational areas in Maine. Tennessee Valley Authority (TVA) and Army Corps of Engineers (USACE) areas covered large reservoirs with important water-based recreation resources and were retained. The presence of such waterbodies, which provide public recreation to millions of visitors annually⁶¹, required us to retain the entire census geographies despite evidence that removing areas covered by water can lead to more precise and realistic sociodemographic analyses⁶². Last, U.S. Forest Service (USFS) lands were retained as they included several recreational areas in Virginia.

All other areas with unknown public access in the PAD-US were deemed not accessible to the public or used for public recreation and therefore excluded. This conservative approach reduced the chances of misclassifying large tracts of land that were likely inaccessible. For example, Department of Defense (DOD) lands included ammunition plants, Department of Energy (DOE) lands included nuclear test sites, and National Oceanic and Atmospheric Administration (NOAA) lands included estuarine research reserves. Non-governmental organization (NGO) lands included nearly 17,500 polygons in the Rocky Mountains but covered too many conservation types to determine whether these were open to the public. American Indian Lands (TRIB) were on reservations and could not be assumed to be accessible and used by the public.

The final step in curating the PAD-US-AR dataset was determining how to approach the polygons in the Western and Midwestern states that were left over from the Public Land Survey System (designation = SRMA). Most of these lands follow a grid pattern and are not used for outdoor recreation. However, some state trust lands include important parks, such as DuPont State Forest, NC, a popular destination for mountain biking, hiking, swimming, and visiting waterfalls. Three of the authors conducted online searches of the uses of these lands using online resources (i.e., State Department of Natural Resource portals) for each state and selected which to include or exclude. The corresponding author also discussed these decisions with three outdoor recreation professionals living in the western U.S. Based on this examination, we removed state trust lands from Arizona, Colorado, Idaho, Louisiana, Mississippi, Montana, Nevada, New Mexico, North Dakota, Oklahoma, Oregon, South Dakota, Texas, Utah, Washington, and Wyoming.

To obtain census tract and county exposure estimates, we calculated the percentage of the PAD-US-AR covering each geographic unit. Tract-level estimates included a 0.5-mile buffer around each tract to acknowledge the opportunities for park access for residents living near the tract boundaries. Similar thresholds have been used in past research^63,64,65 and are recommended as U.S. park access standards by several nonprofits (e.g., Trust for Public Land, www.10minutewalk.org). This threshold is primarily used in urban areas and may be most relevant to those areas where most people live and where tract sizes are smaller.

No buffering was conducted around counties. Counties are >300% larger than tracts, on average. In our dataset, the median county area was 1,614 km², while the median tract area was 5 km². Counties are also jurisdictions of local governments, whereas tracts do not represent any administrative boundaries. For these reasons, we avoided buffering counties, which often have parks and recreation departments managing parks within their borders.

Data Records

PAD-US-AR⁴⁸ data are released under the Creative Commons Attribution 4.0 International (CC BY 4.0) license and publicly available on an Open Science Framework (OSF) repository (https://doi.org/10.17605/OSF.IO/PWDSG). Several files are available:

1.
Geopackage and shapefile of the PAD-US-AR⁴⁸ in a standard format (separate polygons for different parks) and dissolved format (a single polygonal layer)
2.
Spreadsheets of PAD-US-AR⁴⁸ cover in 2019 U.S. counties
3.
Spreadsheets of PAD-US-AR⁴⁸ cover in 2019 U.S. ZIP code tabulation areas
4.
Spreadsheets of PAD-US-AR⁴⁸ cover in 2019 U.S. tracts with 0.5-mile buffers around each tract

Geopackage and shapefiles include vector polygons with the original metadata from the PAD-US V2.1. For a complete listing of variables, please visit https://www.usgs.gov/programs/gap-analysis-project/pad-us-data-manual. In brief, the data include the name of the parcel; feature class (in the PAD-US-AR, the options are designation, easement, or fee); type and name of management agency (i.e., federal, state, American Indian Lands, or local government); designation (i.e., conversation easement vs. National Park); conservation protection level as designated by the International Union for the Conservation of Nature (IUCN); state name; and geographic size.

Spreadsheets include geographic identifiers (i.e., FIPS codes or GEOID) and percent park cover. These are provided as Microsoft Excel files (.xlsx) and text files (.txt) to maintain leading zeros in the geographic identifiers. Park cover ranges from 0 (no parks) to 100 (complete park cover). Tract estimates are provided for park cover within the boundaries of each tract and the 0.5-mile buffered tract boundaries.

Technical Validation

The PAD-US-AR⁴⁸ dataset presents park cover from nearly 250,000 spatial units and 1,900,000 km² in area across the contiguous U.S (Table 2). Histograms of the data within counties and tracts and by census region are presented in Figure S2. Distributions were right skewed in all regions except Northeastern and Western counties. Northeastern counties showed a flat distribution until approximately 20% cover. Higher levels of cover were present in few counties. Western counties showed a roughly flat distribution until around 80% cover, after which the number of counties with higher cover levels was small.

Table 2 Number of units and cover of datasets for park cover in the contiguous U.S.

Full size table

Comparisons with the source dataset (PAD-US V2.1) are available for each census region in Figs. 2–5. Large areas of Maine, southeast Pennsylvania, central/western Massachusetts, and northern New Hampshire were excluded from the PAD-US-AR because they were private conservation easements, watersheds with closed access as listed in the PAD-US V2.1, or otherwise unknown public access. Swaths of the Dakotas were removed as conservation easements used for wildlife management with uncertain public access. Lands in Oklahoma arranged on a gridwork were removed as state school lands typically leased out for agriculture and mineral resource purposes. A gridwork of land parcels in Montana, Wyoming, Colorado, Arizona, and New Mexico was removed as state trust lands managed for timber, surface, and mineral resource extraction. Similarly, larger parcels of state trust lands in Western Texas were excluded. Other large parcels of lands excluded were over 560,000 acres in central Idaho, 860,00 acres in southern Nevada, and nearly 200,000 acres in southern South Carolina managed by the Department of Energy; approximately 550,000 acres at Vermejo Park Ranch managed by Ted Turner Reserves, Inc., and 133,000 areas of the Stronghold District of Badlands National Park in western South Dakota owned by the Oglala Sioux Tribe under agreement by the National Park Service.

Next, we compare the PAD-US-AR dataset with other park datasets, nature exposure metrics, and sociodemographic characteristics. The value of comparing the PAD-US-AR with other park datasets is to determine whether the PAD-US-AR differs from already available datasets. Park dataset comparisons were made by tallying the number of geographic polygon units and calculating the total cover after dissolving all polygon units (to account for some polygons overlapping each other) in census regions.

The value of comparing the PAD-US-AR to nature exposure metrics is to determine whether park cover differs from other standard exposure estimates. We employed two measures of NDVI (annual averages and summertime highs) and tree canopy cover, which were derived from raster images and averaged across geographic units (tracts or counties). NDVI values were retrieved and processed in Google Earth Engine (GEE) using cumulative annuals or summertime highs (June-August) from 250 × 250 m 16-day MODIS images averaged over five years (2015–2020) after extracting cloud cover and water pixels. Tree canopy data were retrieved from the 2019 National Land Cover Database (NLCD) release⁶⁶, which provides cover estimates ranging from 0 to 100% for each 30 × 30 m pixel in 2016. This release was the most recent available during data retrieval (September 2022). To identify whether the PAD-US-AR was unique from these other estimates of nature exposure, we examined bivariate correlations between each metric and the PAD-US-AR.

Last, we examined sociodemographic correlates of park cover measured through the PAD-US-AR to inform what confounding factors should be considered when modeling associations between park cover and human health. Sociodemographic characteristics were retrieved from 2015–2019 American Community Survey (ACS) estimates from the U.S. Census at the county and tract level⁶⁷. We selected 14 variables (Table S2) based on existing literature examining correlates of greenspace, especially in studies focusing on socioeconomic and racial disparities in access to these spaces^{68,69,70,71,72}. Attempts at incorporating median household income alongside other measures resulted in multicollinearity, so this variable was excluded from the primary analyses but considered in a sensitivity analysis. We examined the results of generalized linear mixed models (GLMMs) with gamma distributions and U.S. states as random effects to account for the non-normal distribution of the outcome variable and the hierarchical nature of the data (counties and tracts within states). Models were run with complete data for 100% of counties (N = 3,108) and 97.3% of tracts (N = 70,580) in the contiguous U.S. circa 2019.

Stratified analyses using more urbanized counties (≥50 people/km²) and tracts (≥1,000 people/km²) were conducted to compare results with past research and inform future scholarship with the PAD-US-AR. There is no consensus on differentiating more vs. less urban areas in nature-health research¹⁹. Between 1,000 and 1,999 people/km² is a common cut point¹⁹. We attempted to apply that cut point to both units of analysis (tracts and counties), which split the number of tracts roughly in half (n = 32,929 as more urban). In contrast, this cut point resulted in too few counties to conduct sufficiently powered analyses (N = 45 as more urban). We attempted the 300 people/km² cut point recommended by the European Union⁷³ and used in a recent U.S. study on the association between park cover, park use, and mental health⁷⁴. This continued to produce small sample sizes: N = 43 for the Northeast, 30 for the Midwest, 93 for the South, and 16 for the West. A cut point of 50 people/km² produced reasonable sample sizes for most regions (N = 121 for the Northeast, 178 for the Midwest, 386 for the South, and 58 for the West). Applying this 50 people/km² cut point to counties also produced maps that approximated the location of the Census classification of urbanized areas (https://www.census.gov/programs-surveys/geography/guidance/geo-areas/urban-rural.html; Figure S2). This urbanized area classification scheme has been used to create other datasets on environmental exposure estimates, such as urban heat island vulnerability⁷⁵. GLMMs were used in these stratified analyses except in the Midwest, where standard linear regression models were run to avoid singularity resulting from few urban counties per state in the random effect term.

Comparison of the PAD-US-AR percent park cover dataset to other park datasets

Descriptive statistics for each park dataset are provided in Table 2, and maps of park cover are provided in Figure S3. The PAD-US-AR⁴⁸ covers 51.6% of the acreage in the PAD-US V2.1 dataset. The PAD-US-AR acreage is larger than the acreage of USA Parks and ParkServe but smaller than the OSM datasets when leisure and boundary tags are combined. Bureau of Land Management (BLM) lands are mainly absent from the USA Parks and ParkServe datasets but are partially included in the OSM datasets and prominent in the PAD-US-AR. This is particularly noticeable in Nevada, western Utah, and Wyoming. These areas include such popular recreation attractions as the Grand Staircase-Escalante National Monument, UT, and the Grand Canyon Parashant National Monument, AZ. These collectively encompass nearly 3,000,000 acres (around twice the size of Delaware), attract more than 150,000 visitors annually for hiking, backpacking, and camping, and have received thousands of 5-star reviews on Google Maps. This high number of reviews shows their popularity and visibility in the public sphere. Other notable areas include off-highway vehicle (OHV) trails, such as the Little Sahara OHV Area, UT, which offers driving/riding on a 700-foot drivable sand dune, 30,000 annual visitors, four campgrounds, and approximately 62,000 acres. Most popular mountain biking and OHV riding trails around Moab, UT (except for the Slick Rock Trail System) are also BLM lands excluded or with limited coverage in datasets other than the PAD-US and PAD-US-AR. These results demonstrate that the PAD-US-AR presents a selected sample of the PAD-US dataset with differing coverage from pre-existing park cover datasets.

Comparison of the PAD-US-AR to other nature exposure measures

Descriptive statistics for park cover and other nature exposure metrics are presented in Table 3. Maps of each metric are provided in Figure S4. Distributions of nature exposure metrics are available in Figures S5-S7.

Table 3 Descriptive statistics for the PAD-US-AR and other nature exposure metrics.

Full size table

Associations between the PAD-US-AR⁴⁸ and NDVI varied across geographies and seasons (Fig. 6). Park cover was negatively associated with NDVI at the county level. Pearson correlation coefficients (r[95% confidence interval]) were as follows: r_annual = −0.21[−0.24, −0.17]; r_summer = −0.33[−0.36, −0.30]. Park cover was not correlated with NDVI at the tract level (r_annual = 0.03[0.02, 0.04]; r_summer = 0.01[0.00, 0.02]). Associations between the PAD-US-AR and NDVI within census regions were consistently positive, except in Western counties (r_annual = −0.12[−0.21, −0.02]; r_summer = −0.03 [−0.12, 0.07]) or with NDVI summertime maximums in Midwestern counties (r = −0.02[−0.08, 0.04]). Such results are likely due to climatic and land use differences, such as arid climates in the West and high concentrations of agricultural land that only produces chlorophyll in the summer in the Midwest. Meanwhile, associations between park cover and NDVI annual averages in Midwestern counties were the strongest observed among any pairing (r = 0.28[0.22, 0.33]). This may be explained by parkland in the upper Midwest having higher concentrations of vegetation that produce chlorophyll year-round (i.e., evergreen trees, herbaceous wetland cover) than in the South and fewer urban parks with less greenery than in the Northeast. Associations at the tract level ranged from 0.03[0.02, 0.05] for NDVI summertime maximums in Midwestern tracts, where agricultural lands may only be green in the summer, to 0.23[0.22, 0.25] for NDVI summertime maximums in Western tracts.

Park cover was positively associated with tree canopy cover in every pairing. The strongest correlations were among Midwestern counties (r = 0.65[0.61, 0.68]), and the weakest correlations were in nationwide county-level models (r = 0.10[0.07, 0.14]). The consistent correlation between canopy cover and parks may be explained by people’s innate preference for open-growth trees with large amounts of canopy cover^29,76,77,78 and historical guidelines to retain such trees in park design⁷⁹.

These findings demonstrate that the PAD-US-AR⁴⁸ presents a unique exposure estimate from metrics of nature exposure. Plant-rich landscapes, or “greenspaces,” do not capture all aspects of open recreational spaces and nature-rich landscapes¹⁰. Correlations between nature exposure metrics vary in size and direction based on the unit of analysis (counties vs. tracts) and geography (regions of the country and nationwide analyses).

Comparison of the PAD-US-AR to sociodemographic characteristics

A listing of the sociodemographic characteristics considered in analyses is provided in Table S1. Descriptive statistics for each variable are presented in Tables S2–S6. Maps of the distribution of these variables are provided in Figure S8. Multivariate associations between the PAD-US-AR⁴⁸ and sociodemographic characteristics are shown in Fig. 7 and Table S7. These results were derived from GLMMs with gamma distributions and U.S. states as random effects to account for the non-normal distribution of the outcome variable with minimal multicollinearity (Table S8).

Park cover was more strongly associated with sociodemographic characteristics at the county level than at the tract level. Around 30% of the variance in countywide park cover was explained in U.S. regions after accounting for state random effects (conditional R²_Northeast = 0.29, R²_Midwest = 0.31, R²_South = 0.23, R²_West = 0.38). Variance explained within counties across the country was over 60% (R²_Nationwide = 0.63). Variance explained at the tract level was closer to 10%–20% (R²_Nationwide = 0.19, R²_Northeast = 0.09, R²_Midwest = 0.08, R²_South = 0.12, R²_West = 0.18).

Three sociodemographic characteristics showed fairly consistent associations with park cover. On average, areas with greater shares of older adults (≥65 yrs) had more park cover. Areas with higher median home values also had more park cover, except in the Northeast. Last, areas with greater shares of female residents had less park cover on average, except in Northeastern and Southern counties. Two other sociodemographic characteristics showed consistent associations within either county or tract samples. First, counties with lower Gini index values (lower inequality) had more park cover on average. Secondly, tracts with higher unemployment rates had more park cover on average.

Associations between the PAD-US-AR and other sociodemographic characteristics varied by region. Park cover in Northeastern counties was concentrated in areas with lower rates of income inequality, high school graduation, and natural resource employment. Park cover in Midwestern counties was greater in areas with higher poverty rates, income inequality, and unemployment. Park cover in Southern counties was higher in areas with greater population densities or higher rates of poverty and lower rates of income inequality, natural resource employment, or non-Hispanic Black residents. Western counties showed greater park cover in areas with more poverty, higher shares of college degree holders, less income inequality, and lower shares of non-Hispanic Asian residents. Within tracts, park cover was higher in densely populated Northeastern areas but lower in densely populated areas throughout the rest of the country. Tract-level park cover was higher in areas with greater shares of residents employed in natural resource professions in the West and Northeast, while the opposite was found in the South and Midwest; in these areas, park cover was lower in areas where greater shares of people worked in natural resources professions. Park cover was higher in Midwestern and Southern tracts with greater shares of non-Hispanic Asians, whereas park cover was lower in Western and Northeastern tracts with greater shares of non-Hispanic Asians. In summary, park cover was associated with many sociodemographic characteristics, but the strength and direction varied by geography and unit of analysis.

Multivariate associations between the PAD-US-AR and sociodemographic characteristics in urban areas are presented in Table S10. In most cases, median home value continued to show strong positive associations with park cover. One exception was observed in Midwestern tracts, where median home value was negatively associated with park cover. Percent female no longer predicted park cover except in Southern tracts. Shares of older adults also predicted park cover in only a few urban cases; significant positive associations were observed only in nationwide and Northeastern tracts. Percent Non-Hispanic Asian residents emerged as a predictor in several models, but the direction of the associations differed. Nationwide models showed negative associations, while Midwestern counties and tracts and Southern tracts showed positive associations. County-level models of urban areas continued to predict the variance explained by park cover better than tract-level models of urban areas. Alternative models substituting median household income for other socio-economic indicators found mixed relationships between this variable and park cover (Table S9, Figure S9).

Usage Notes

We present a new potential indicator of outdoor nature exposure for the contiguous U.S: the location of parks intended to be accessible for recreation. This dataset allows researchers to examine the number of outdoor recreation areas meant for public use around geographic units of interest (i.e., homes, neighborhoods, and transit routes). Other commonly-used metrics – like moderate/coarse resolution NDVI and tree canopy cover datasets – cannot identify whether the areas are managed for public recreational use. The PAD-US-AR⁴⁸ is unique from these other metrics, as determined by the correlations presented above (Fig. 6).

The PAD-US-AR also differs in coverage from pre-existing park cover datasets. These differences were observed when tallying the geographic polygon units and calculating the total cover after dissolving all polygon units to account for some overlapping units. The reasons to utilize the PAD-US-AR dataset rather than these other options include the PAD-US-AR source data (PAD-US V2.1) being validated by the agencies managing the land, our systematic examination of what is accessible for recreation, and the clarity and transparency in its curation. The potential for park cover to not match park access for all residents in a county or tract remains high, as in any area-level exposure estimate^80,81,82. Individual-level estimates should be calculated from the boundaries or centroids of park polygons along road or pedestrian networks when geolocated data for homes, schools, workplaces, or activity spaces are available.

The chances for residual confounding in area-level studies with the PAD-US-AR dataset exist if multivariate models do not control for sociodemographic characteristics of the areas encompassing parks. The PAD-US-AR has the most robust associations with home prices, shares of female residents, and shares of older presents. These should be statistically controlled in models using the PAD-US-AR as an independent variable or covariate. Other measures of socioeconomic status (i.e., median household income) might be insufficient to avoid residual confounding in ecological studies with PAD-US-AR data.

Since the PAD-US-AR was curated nationwide, it is most appropriate for use at larger geographic scales (i.e., regional and national). Studies focusing on smaller geographic contexts, such as within individual cities or states, should partner with local land management agencies and recreation departments to ensure PAD-US-AR data accurately represent all parks and protected areas managed for public outdoor recreation. Since ownership boundaries and land acquisitions can change annually, local land management agencies might also be able to identify new parks that aren’t present in the PAD-US-AR. Smaller-scale analyses may allow manual selection of land parcels with building footprints that occupy most of the area.

The PAD-US-AR may be best conceived as the minimum park coverage level. We excluded the approximately 35,000 areas covering over 42,000 km² with unknown public access in the PAD-US. Some private parks, such as golf courses or community parks restricted to residents who pay homeowner association fees, can provide opportunities for outdoor recreation that activate the same health-promoting pathways as public parks. People living in the counties and tracts presented in the datasets may have more access to outdoor recreational opportunities than suggested by the PAD-US-AR.

As the nature-health literature expands, exposure estimates are expected to develop and be refined. The PAD-US-AR presents a significant advancement in this body of literature by offering researchers an assessment of where parks are available for outdoor recreation.

Code availability

R (4.1.2) was used to generate the PAD-US-AR⁴⁸ dataset and results. QGIS (3.18.3) was used to create the maps. Scripts and source data to reproduce results are available on OSF (https://osf.io/pwdsg/).

References

Silva, R. A., Rogers, K. & Buckley, T. J. Advancing Environmental Epidemiology to Assess the Beneficial Influence of the Natural Environment on Human Health and Well- Being. Environmental Science & Technology 52, 9545–9555 (2018).
Article ADS CAS Google Scholar
Markevych, I. et al. Exploring pathways linking greenspace to health: Theoretical and methodological guidance. Environmental Research 158, 301–317 (2017).
Article ADS CAS Google Scholar
Hartig, T., Mitchell, R. J., Vries, Sde & Frumkin, H. Nature and health. Annual Reviews in Public Health 35, 207–228 (2014).
Article Google Scholar
Zhang, J., Yu, Z., Zhao, B., Sun, R. & Vejre, H. Links between green space and public health: A bibliometric review of global research trends and future prospects from 1901 to 2019. Environmental Research Letters 15, 063001–37 (2020).
Article ADS Google Scholar
Taylor, L. & Hochuli, D. F. Defining greenspace: Multiple uses across multiple disciplines. Landscape and Urban Planning 158, 25–28 (2017).
Article Google Scholar
Yang, B.-Y. et al. Greenspace and human health: An umbrella review. Innovation 2, 100164 (2021).
Google Scholar
Twohig-Bennett, C. & Jones, A. The health benefits of the great outdoors: A systematic review and meta-analysis of greenspace exposure and health outcomes. Environmental Research 166, 628–637 (2018).
Article ADS CAS Google Scholar
Smith, N. et al. Urban blue spaces and human health: A systematic review and meta-analysis of quantitative studies. Cities 119, 103413 (2021).
Article Google Scholar
Gascon, M., Zijlema, W. L., Vert, C., White, M. P. & Nieuwenhuijsen, M. J. Outdoor blue spaces, human health and well-being: A systematic review of quantitative studies. International Journal of Hygiene and Environmental Health 220, 1207–1221 (2017).
Article Google Scholar
Li, H. et al. Beyond “bluespace” and “greenspace”: A narrative review of possible health benefits from exposure to other natural landscapes. Sci Total Environ 159292, https://doi.org/10.1016/j.scitotenv.2022.159292 (2022).
Rugel, E. J., Henderson, S. B., Carpiano, R. M. & Brauer, M. Beyond the Normalized Difference Vegetation Index (NDVI): Developing a Natural Space Index for population-level health research. Environmental Research 159, 474–483 (2017).
Article ADS CAS Google Scholar
Wheeler, B. W. et al. Beyond greenspace: an ecological study of population general health and indicators of natural environment type and quality. International Journal of Health Geographics 14, 11–17 (2015).
Article Google Scholar
Helbich, M., Poppe, R., Oberski, D., van Emmichoven, M. Z. & Schram, R. Can’t see the wood for the trees? An assessment of street view- and satellite-derived greenness measures in relation to mental health. Landscape Urban Plan 214, 104181 (2021).
Article Google Scholar
Holland, I. et al. Measuring nature contact: A narrative review. Int J Environ Res Pu 18, 4092 (2021).
Article Google Scholar
Labib, S. M., Lindley, S. & Huck, J. J. Spatial dimensions of the influence of urban green-blue spaces on human health: A systematic review. Environmental Research 180, 108869 (2020).
Article ADS CAS Google Scholar
Jackson, R. D. & Huete, A. R. Interpreting vegetation indices. Preventive Veterinary Medicine (1991).
Rhew, I. C., Stoep, A. V., Kearney, A., Smith, N. L. & Dunbar, M. D. Validation of the Normalized Difference Vegetation Index as a Measure of Neighborhood Greenness. Annals of Epidemiology 21, 946–952 (2011).
Article Google Scholar
Labib, S. M., Lindley, S. & Huck, J. J. Scale effects in remotely sensed greenspace metrics and how to mitigate them for environmental health exposure assessment. Comput Environ Urban Syst 82, 101501 (2020).
Article Google Scholar
Browning, M. H. E. M., Rigolon, A., McAnirlin, O. & Yoon, H. (Violet). Where greenspace matters most: A systematic review of urbanicity, greenspace, and physical health. Landscape Urban Plan 217, 104233 (2022).
Article Google Scholar
Ekkel, E. D. & Vries, Sde Nearby green space and human health: Evaluating accessibility metrics. Landscape and Urban Planning 157, 214–220 (2017).
Article Google Scholar
Dzhambov, A. M., Lercher, P., Rüdisser, J., Browning, M. H. E. M. & Markevych, I. Allergic symptoms in association with naturalness, greenness, and greyness: A cross-sectional study in schoolchildren in the Alps. Environ Res 198, 110456 (2020).
Article Google Scholar
Kumari, N. et al. The Grass Is Not Always Greener on the Other Side: Seasonal Reversal of Vegetation Greenness in Aspect‐Driven Semiarid Ecosystems. Geophys Res Lett 47 (2020).
Helbich, M. Spatiotemporal Contextual Uncertainties in Green Space Exposure Measures: Exploring a Time Series of the Normalized Difference Vegetation Indices. Int J Environ Res Pu 16, 852 (2019).
Article Google Scholar
Su, J. G., Dadvand, P., Nieuwenhuijsen, M. J., Bartoll, X. & Jerrett, M. Associations of green space metrics with health and behavior outcomes at different buffer sizes and remote sensing sensor resolutions. Environment International 126, 162–170 (2019).
Article Google Scholar
O’Neil-Dunne, J., MacFaden, S. & Royar, A. A Versatile, Production-Oriented Approach to High-Resolution Tree-Canopy Mapping in Urban and Suburban Landscapes Using GEOBIA and Data. Fusion. Remote Sens-basel 6, 12837–12865 (2014).
Article ADS Google Scholar
O’Neil-Dunne, J. P. M., MacFaden, S. W., Royar, A. R. & Pelletier, K. C. An object-based system for LiDAR data fusion and feature extraction. Geocarto International 28, 227–242 (2012).
Article Google Scholar
MacFaden, S. W., O’Neil-Dunne, J. P. M., Royar, A. R., Lu, J. W. T. & Rundle, A. G. High-resolution tree canopy mapping for New York City using LIDAR and object-based image analysis. Journal of Applied Remote Sensing 6, 063567-1–063567–23 (2012).
Article ADS Google Scholar
Orians, G. H. Habitat selection: General theory and applications for human behavior. In (ed. Lockard, J. S.) The evolution of human social behavior.49–66 (Elsevier Science Limited, 1980).
Townsend, J. B. & Barton, S. The Impact of Ancient Tree Form on Modern Landscape Preferences. Urban For Urban Gree 34, 205–216 (2018).
Article Google Scholar
Locke, D. H., Ossola, A., Minor, E. & Lin, B. B. Spatial contagion structures urban vegetation from parcel to landscape. People Nat, https://doi.org/10.1002/pan3.10254 (2021).
Ossola, A., Locke, D. H., Lin, B. & Minor, E. Greening in style: Urban form, architecture and the structure of front and backyard vegetation. Landscape and Urban Planning 185, 141–157 (2019).
Article Google Scholar
Shahtahmassebi, A. et al. Remote sensing of urban green spaces: a review. Urban Forestry & Urban Greening 126946, https://doi.org/10.1016/j.ufug.2020.126946 (2020).
He, N. & Li, G. Urban neighbourhood environment assessment based on street view image processing: A review of research trends. Environ Challenges 100090 (2021).
Labib, S. M. Estimating multiple greenspace exposure types and their associations with neighbourhood premature mortality: A socioecological study. Sci Total Environ 147919, https://doi.org/10.1016/j.scitotenv.2021.147919 (2021).
Wang, R. et al. Visible green space predicts emotion: Evidence from social media and street view data. Appl Geogr 148, 102803 (2022).
Article Google Scholar
Zhang, J. et al. Is indoor and outdoor greenery associated with fewer depressive symptoms during COVID-19 lockdowns? A mechanistic study in Shanghai, China. Build Environ 109799, https://doi.org/10.1016/j.buildenv.2022.109799 (2022).
Zhang, R., Wulff, H., Duan, Y. & Wagner, P. Associations between the physical environment and park-based physical activity: A systematic review. J Sport Health Sci 8, 412–421 (2019).
Article Google Scholar
McCormack, G. R., Rock, M., Toohey, A. M. & Hignell, D. Characteristics of urban parks associated with park use and physical activity: A review of qualitative research. Health Place 16, 712–726 (2010).
Article Google Scholar
Cohen, D. A. et al. The First National Study of Neighborhood Parks. American Journal of Preventive Medicine 51, 419–426 (2016).
Article Google Scholar
Nieuwenhuijsen, M. J., Khreis, H., Triguero-Mas, M., Gascon, M. & Dadvand, P. Fifty shades of green. Epidemiology 28, 63–71 (2017).
Article Google Scholar
Becker, D. A., Browning, M. H., McAnirlin, O., Yuan, S. & Helbich, M. Is Green Space Associated with Opioid-Related Mortality? An Ecological Study at the U.S. County Level. Urban For Urban Gree 127529, https://doi.org/10.1016/j.ufug.2022.127529 (2022).
Nieuwenhuijsen, M. J. Green infrastructure and health. Annual Review of Public Health 42, 317-328 (2021).
Environmental Systems Research Institute. USA Parks. https://www.arcgis.com/home/item.html?id=578968f975774d3fab79fe56c8c90941 (2022).
Open Street Map Foundation. OpenStreetMap About. https://www.openstreetmap.org/about (2022).
Ludwig, C., Fendrich, S. & Zipf, A. Regional variations of context‐based association rules in OpenStreetMap. T Gis 25, 602–621 (2021).
Article Google Scholar
Trust for Public Land. ParkServe® Data Downloads. https://www.tpl.org/parkserve/downloads (2022).
U.S. Geological Survey. Protected Areas Database of the United States (PAD-US) 2.1.U.S. Geological Survey.https://doi.org/10.5066/P955KPLE (2020).
Browning, M. H. E. M. et al. PAD-US-AR. OSF, https://doi.org/10.17605/osf.io/pwdsg (2022).
Ogletree, S. S., Powell, R. B., Baldwin, R. F. & Leonard, P. B. A framework for mapping cultural resources in landscape conservation planning. Conservation Science and Practice 1, e41–11 (2019).
Article Google Scholar
Theobald, D. M. Development and Applications of a Comprehensive Land Use Classification and Map for the US. Plos One 9, e94628 (2014).
Article ADS Google Scholar
Belote, R. T. et al. Identifying Corridors among Large Protected Areas in the United States. PLOS ONE 11, e0154223–16 (2016).
Article Google Scholar
Walls, M., Lee, P. & Ashenfarb, M. National monuments and economic growth in the American West. Science Advances 6, eaay8523 (2020).
Article ADS Google Scholar
Martinuzzi, S. et al. Scenarios of future land use change around United States’ protected areas. Biol Conserv 184, 446–455 (2015).
Article Google Scholar
Sohl, T. L. et al. Spatially explicit modeling of 1992–2100 land cover and forest stand age for the conterminous United States. Ecol Appl 24, 1015–1036 (2014).
Article Google Scholar
Dietz, M. S., Belote, R. T., Aplet, G. H. & Aycrigg, J. L. The world’s largest wilderness protection network after 50 years: An assessment of ecological system representation in the U.S. National Wilderness Preservation System. Biol Conserv 184, 431–438 (2015).
Article Google Scholar
Rice, W. L., Newman, P., Miller, Z. D. & Taff, B. D. Protected areas and noise abatement: A spatial approach. Landscape and Urban Planning 194, 103701–9 (2020).
Article Google Scholar
Buxton, R. T. et al. Noise pollution is pervasive in U.S. protected areas. Science 356, 531–533 (2017).
Article ADS CAS Google Scholar
Tsai, W.-L. et al. Types and spatial contexts of neighborhood greenery matter in associations with weight status in women across 28 U.S. communities. Environ Res 111327 (2021).
Tsai, W.-L., Davis, A. J. S. & Jackson, L. E. Associations between types of greenery along neighborhood roads and weight status in different climates. Urban Forestry & Urban Greening 1–44 (2019).
National Conservation Easement Database. https://www.conservationeasement.us (2022).
US Army Corp of Engineers. Value to the Nation Value to the Nation. Recreation https://www.iwr.usace.army.mil/Missions/Value-to-the-Nation/Recreation/ (2022).
Locke, D. H. United States census block groups converted to singlepart polygons and with water removed, 2013-2017. https://www.fs.usda.gov/rds/archive/catalog/RDS-2022-0054 (2022).
Browning, M. H. E. M. & Rigolon, A. Could nature help children rise out of poverty? Green space and future earnings from a cohort in ten U.S. cities. Environmental Research 176, 1–13 (2019).
Article Google Scholar
Wolch, J., Wilson, J. P. & Fehrenbach, J. Parks and Park Funding in Los Angeles: An Equity-Mapping Analysis. Urban Geogr 26, 4–35 (2013).
Article Google Scholar
Rigolon, A. Parks and young people: An environmental justice study of park proximity, acreage, and quality in Denver, Colorado. Landscape Urban Plan 165, 73–83 (2017).
Article Google Scholar
Dewitz, J. National Land Cover Database (NLCD) 2016 Products. U.S. Geological Survey. https://doi.org/10.5066/P96HHBIE (2019).
Census, U. S. Bureau. American Community Survey Data. https://www.census.gov/programs-surveys/acs/data.html (2020).
Williams, T. G., Logan, T. M. & Zuo, C. T. Parks and safety: A comparative study of green space access and inequity in five US cities. Landscape and Urban Planning (2020).
Pham, T.-T.-H., Apparicio, P., Séguin, A.-M., Landry, S. & Gagnon, M. Spatial distribution of vegetation in Montreal: An uneven distribution or environmental inequity? Landscape Urban Plan 107, 214–224 (2012).
Article Google Scholar
Li, X., Zhang, C., Li, W. & Kuzovkina, Y. A. Environmental inequities in terms of different types of urban greenery in Hartford, Connecticut. Urban For Urban Gree 18, 163–172 (2016).
Article Google Scholar
Rigolon, A. A complex landscape of inequity in access to urban parks: A literature review. Landscape and Urban Planning 153, 160–169 (2016).
Article Google Scholar
Nesbitt, L., Meitner, M. J., Girling, C., Sheppard, S. R. J. & Lu, Y. Who has access to urban vegetation? A spatial analysis of distributional green equity in 10 US cities. Landscape and Urban Planning 181, 51–79 (2019).
Article Google Scholar
Commission, E. A recommendation on the method to delineate cities, urban and rural areas for international statistical comparisons. https://unstats.un.org/unsd/statcom/51st-session/documents/BG-Item3j-Recommendation-E.pdf (2020).
Larson, L. R. et al. Greenspace and park use associated with less emotional distress among college students in the United States during the COVID-19 pandemic. Environ Res 112367, https://doi.org/10.1016/j.envres.2021.112367 (2021).
Chakraborty, T., Hsu, A., Manya, D. & Sheriff, G. A spatially explicit surface urban heat island database for the United States: Characterization, uncertainties, and possible applications. ISPRS Journal of Photogrammetry and Remote Sensing 168, 74–88 (2020).
Article ADS Google Scholar
Hull, R. B. How the public values urban forests. Journal of Arboriculture (1992).
Suchocka, M. et al. Old trees are perceived as a valuable element of the municipal forest landscape. Peerj 10, e12700 (2022).
Article Google Scholar
Hofmann, M., Gerstenberg, T. & Gillner, S. Predicting tree preferences from visible tree characteristics. Eur J Forest Res 136, 421–432 (2017).
Article Google Scholar
Olmsted, F. L. Trees in streets and in parks. In The highest value of a park (1882).
Robinson, W. S. Ecological correlations and the behavior of individuals. American Sociological Review 15, 351–357 (1950).
Article Google Scholar
Fowler, C. S., Frey, N., Folch, D. C., Nagle, N. & Spielman, S. Who are the People in my Neighborhood?: The “Contextual Fallacy” of Measuring Individual Context with Census Geographies. Geographical Analysis 31, 461–14 (2019).
Google Scholar
Talen, E. Neighborhoods as Service Providers: A Methodology for Evaluating Pedestrian Access. Environ Plan B Plan Des 30, 181–200 (2001).
Article ADS Google Scholar
Ludwig, C., Hecht, R., Lautenbach, S., Schorcht, M. & Zipf, A. Mapping Public Urban Green Spaces Based on OpenStreetMap and Sentinel-2 Imagery Using Belief Functions. Isprs Int Geo-inf 10, 251 (2021).
Article Google Scholar
Venter, Z. S. et al. Is green space associated with reduced crime? A national-scale study from the Global South. Sci Total Environ 825, 154005 (2022).
Article ADS CAS Google Scholar
Kraemer, R. & Kabisch, N. Parks in context: advancing citywide spatial quality assessments of urban green spaces using fine-scaled indicators. Ecol Soc 26 (2021).
Zhou, Y., Lengerke, Tvon & Dreier, M. Comparing different data sources by examining the associations between surrounding greenspace and children’s weight status. Int J Health Geogr 20, 24 (2021).
Article Google Scholar

Download references

Acknowledgements

The authors are incredibly grateful for the time and attention Kuiran Zhang and Fu Li (Clemson University Virtual Reality & Nature Lab, Clemson, SC) paid to the quality assurance quality control check of the R scripts and results. The authors also thank the following individuals for consulting on the curation of state trust lands: Jordan Smith, Associate Professor in the Environment and Society Department and Director of the Institute for Outdoor Recreation and Tourism, Utah State University, Logan; Ben Lawhon, Senior Director of Research and Consulting, Leave No Trace, Boulder, CO; and Liz “Snorkel” Thomas, professional hiker, author, and speaker, www.eathomas.com. Last, the authors thank Clemson University for its allotment of computing time on the Palmetto Cluster.

Author information

Authors and Affiliations

Department of Parks, Recreation and Tourism Management, Clemson University, Clemson, South Carolina, 29634, USA
Matthew H. E. M. Browning & Ryan Gagnon
NatureQuant LLC, 907 E. 20th Ave., Eugene, OR, 97405, USA
Matthew H. E. M. Browning & Christopher Bailey
Department of City and Metropolitan Planning, The University of Utah, 375 South 1530 East, Salt Lake City, Utah, 84112, USA
Alessandro Rigolon
OPENspace Research Centre, School of Architecture and Landscape Architecture, University of Edinburgh, 74 Lauriston Place, Edinburgh, EH3 9DF, UK
Scott Ogletree
Centre for Public Health, Block A, Royal Victoria Hospital, Queen’s University Belfast, Belfast, Northern Ireland, United Kingdom
Ruoyu Wang
Department of Environmental Health, Harvard T. H. Chan School of Public Health, 655 Huntington Avenue, Boston, Massachusetts, 02115, USA
Jochem O. Klompmaker & Peter James
Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, 181 Longwood Avenue, Boston, Massachusetts, 02115, USA
Jochem O. Klompmaker
Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, 401 Park Drive, Boston, Massachusetts, 02215, USA
Peter James

Authors

Matthew H. E. M. Browning
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Rigolon
View author publications
You can also search for this author in PubMed Google Scholar
Scott Ogletree
View author publications
You can also search for this author in PubMed Google Scholar
Ruoyu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jochem O. Klompmaker
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Bailey
View author publications
You can also search for this author in PubMed Google Scholar
Ryan Gagnon
View author publications
You can also search for this author in PubMed Google Scholar
Peter James
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.B., A.R. and S.O. conceived of the research. M.B., S.O. and C.B. processed the data. M.B. wrote the manuscript. All authors contributed to and revised the manuscript. All authors read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Matthew H. E. M. Browning.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Browning, M.H.E.M., Rigolon, A., Ogletree, S. et al. The PAD-US-AR dataset: Measuring accessible and recreational parks in the contiguous United States. Sci Data 9, 773 (2022). https://doi.org/10.1038/s41597-022-01857-7

Download citation

Received: 08 August 2022
Accepted: 21 November 2022
Published: 16 December 2022
DOI: https://doi.org/10.1038/s41597-022-01857-7