INTRODUCTION

In recent years, there have been a number of studies on intra-urban variation in pollution.1, 2, 3, 4, 5, 6, 7 Relatively few of these studies, however, have examined multiple pollutants, and none, to our knowledge, have captured spatial variation during selected hours of the day — such as during rush hours or under temperature inversion conditions. This spatial variation in “peak” exposures may have strong bearing for both refined source identification and epidemiology, as daily maximum exposures may differ substantially across an urban area, and may be particularly important for epidemiological studies of acute cardiovascular and respiratory events.

This gap in the research has been due, in large part, to limitations in sampling technology, as few land use regression (LUR) campaigns have been able to employ a fleet of monitors with the ability for temporally-controlled sampling. Because these LUR models of fine-scale variability are commonly used to derive exposure estimates for epidemiology, this sampling methodology should be well developed, including clear attention to the selection of sampling intervals (e.g., morning vs afternoon; 24-h integrated; rush hour vs full day).

Intra-urban air pollution concentrations can vary due to proximity to industrial sources, traffic density, and other site characteristics, such as population and land use, and may be modified by elevation and meteorological factors.3 During temperature inversions, atmospheric convection and pollutant dispersion may be limited, intensifying concentrations near sources. Thus, peak (or maximum) spatial contrasts within urban areas may be expected during hours of limited atmospheric mixing and significant source activity. Better understanding these peak exposure contrasts can lead to better characterization of intra-urban air pollution gradients, source identification, and assessment of meteorological impacts on urban concentrations. More recent LURs have used active sampling and multi-pollutant approaches, often reducing the number of samples which can be taken concurrently.

Pittsburgh, PA is characterized by complex terrain, periods of heavy traffic, large industrial sources, and frequent inversion events.8 This combination of topography, meteorology, and emission sources results in significant spatial variability in many pollutant concentrations, including fine particulate matter (PM2.5) and black carbon (BC). Although regional industrial air pollution has decreased over recent decades, local emissions inventories remain dominated by a few large legacy steel mills and coke works southeast of Pittsburgh (Edgar Thomson Steel Works and Clairton Coke Works).9, 10, 11 Accordingly, Pittsburgh and Allegheny County remain in federal PM2.5 non-attainment,12, 13 as local regulatory monitors exceed both the average annual (>12 μg/m3) and daily (>35 μg/m3) National Ambient Air Quality Standards for PM2.5.12 The combination of hills and river valleys, episodes of traffic congestion, and large legacy industrial sources means that temperature inversion events can trap pollutants along the river valleys, where local industry is concentrated and traffic emissions can accumulate,8, 14 intensifying spatial contrasts in pollutant concentrations.15, 16

An improved understanding of pollution variability across Pittsburgh, with attention to this complex interplay among topography, industrial and vehicular sources, and frequent morning inversions could provide insights and reproducible models useful in other urban areas.17 In a previous study, we identified frequent morning temperature inversion events in the Pittsburgh industrial suburb of Braddock from 0700 to 1000 hours.8 US Environmental Protection Agency (EPA) regulatory data from multiple stations in Allegheny County suggested increased PM2.5 concentrations between 0600 and 0700 hours, coinciding with the start of morning rush-hour traffic.

Here, we employed a series of saturation monitoring campaigns to characterize intra-urban variability under two different temporal regimes (inversion-focused and 24-h integrated), using a suite of programmable monitors to capture spatial variation during hypothesized peak concentration hours.14 A citywide sampling design allowed for exploration of mean variation in PM2.5 and BC concentrations based on spatial, seasonal, and temporal differences. Seasonal pollutant-specific LUR modeling was applied to characterize intra-urban variability in air pollution concentrations, and smooth surface pollutant-specific concentration maps were created.2, 3, 18 We hypothesized that inversion-focused sampling (0600–1100 hours, Monday through Friday) would reveal stronger spatial contrasts, and stronger impacts of local emissions sources, than would the 24-h week-long sampling scheme. We anticipated that PM2.5 and BC concentrations would vary by sampling hours, elevation, proximity to industry, traffic density, and frequency of inversion events.

METHODS

Study Design

Sampling site allocation and classification is detailed in Shmool et al.14 Briefly, GIS-based methods were used to quantify spatial distributions of local pollution sources (i.e., proximity to industry, traffic density) and potential modifiers (i.e., elevation). These three factors hypothesized to impact upon local concentrations were dichotomized and cross-stratified to represent eight different combinations (Figure 1). Regular 100 m2 lattice cells were characterized according to these cross-strata, and sampling locations (n=37/year) were systematically allocated by stratified random sample to capture spatial and source variability across the Pittsburgh metropolitan area (~388 km2). The reference site, used throughout the sampling season to assess regional background pollution, was located at Settlers Cabin Park in Carnegie, PA, approximately 9 miles upwind of the study area. For comparison, an urban reference site in the community of Braddock, PA was also sampled every session.

Figure 1
figure 1

Domain within Allegheny County stratified according to classification system, with monitoring sites. Sites were classified by traffic density (high vs low), elevation (valley vs non-valley), and proximity to industry (near vs far).

For each year 1 (inversion-focused monitoring), 5-day (Monday through Friday) sampling session, samplers operated during 0600–1100 hours at six randomly-selected sites (25 h of sampling) to capture hours of frequent atmospheric inversions and heavy traffic, limit of detection for multiple pollutants. Six sampling sessions were performed during summer 2011 (July 25 to September 9) with 1 week skipped for logistical reasons. Sessions were repeated for winter 2012 (January 16 to February 24).

Year 2 (24-h integrated monitoring) sampling methods were similar to year 1, although samplers operated for an integrated 24-h 7-day sample of 15 min per hour (42 h of sampling). One-third of the sites (n=13) from inversion-focused sampling were repeated during 24-h sampling, for direct comparison.14 Year 2 sampling was performed in summer 2012 (June 5 to July 26) and repeated in winter 2013 (January 8 to March 10), with a total of eight sites sampled per session. Due to limited equipment availability, each sampling scheme was performed during a separate year.

Monitoring Instrumentation and Quality Control

Using a temperature (20°C) and relative humidity (35%) controlled glove box (PlasLabs Model 890 THC), 37 mm Teflon filters (Pall Life Sciences) were equilibrated for 48 h and then pre-weighed and post-weighed using an ultramicrobalance (Mettler Toledo Model XP2U). Sampling units were custom-designed to capture integrated street-level samples of PM2.5; Harvard Impactors (Air Diagnostics and Engineering) with 37-mm Teflon filters and a HOBO data logger (Onset Computer Corporation) were contained in waterproof Pelican cases.19, 20 Instruments were programmed for specific hours of sampling using a chrontroller (ChronTrol Corporation). A tetraCal volumetric air flow calibrator (BGI Instruments) was used to calibrate the flow to 4.0 liters/min. A HOBO data logger recorded temperature and relative humidity at 15-min intervals. All samplers were deployed on utility poles at a height of 3 m. PM2.5 concentrations were calculated using presampling and postsampling Teflon net filter weights and sampling volume. BC was measured (in absorbance units) using an EEL43M Smokestain Reflectometer (Diffusion Systems) and standard protocols.21

GIS-based Source Density Indicators

GIS-based covariates were calculated across a range of source indicator categories (Table 1). All analyses were conducted using ESRI ArcInfo Version 10 (Redlands, CA, USA), and all covariates summarized within concentric radial buffers surrounding each monitoring location (50–1000 m). Roadway shapefiles for Allegheny County were obtained from Pennsylvania Department of Transportation’s (PennDOT) publicly available annualized average daily vehicle-count data for primary roadways. Traffic covariates included: mean kernel vehicle density, sum of traffic signaled intersections, mean density of bus and truck traffic, summed length of roadway (in feet), and total traffic kernel density (vehicle count). Point elevation at each monitoring location, and the average elevation within multiple radial buffers, was assessed using the National Oceanographic and Atmospheric Digital Elevation Model.22, 23 For industrial emissions, PM2.5 (filterable plus condensable), nitrogen oxides (NOx), sulfur dioxide (SO2), and volatile organic chemicals (VOCs) were aggregated from the USEPA’s 2011 National Emissions Inventory (NEI) over a six-county region, and estimated emissions (tons) were weighted using inverse distance weighting.24 Total areas of industrial and commercial parcels within varying buffers were calculated using 2012 assessment data from the Allegheny County Office of Property Assessments. The locations and areas of commercial and industrial areas were validated using land use/land cover data derived from 2000 to 2001 orthophotography. The distance from, and summed line length of, active railroad was assessed. Census data from 2011 were obtained at the block group level, to calculate population density.25

Table 1 GIS-based source density indicators used for LUR modeling.

Meteorology

Atmospheric sounding data (i.e., Skew-T diagrams) were used to identify the presence of temperature inversions during the 0600–1100 sampling hours (year 1) and the 24-h integrated hours (year 2).26 A binary inversion metric was created in which inversion presence on each sampling day was determined. Hourly meteorological data (e.g., wind speed, wind direction, temperature, precipitation, ceiling height) were downloaded from the National Climate Data Center in TD-3505 (ISHD — full archival) format. Radiosonde upper air data was collected at the Pittsburgh National Weather Service station located in Moon Township, PA, approximately 15 miles west (upwind by predominant wind direction) of Pittsburgh and was obtained from the National Oceanic and Atmospheric Administration.27

Temporal Adjustment

PM2.5 and BC concentrations at distributed monitoring sites were temporally adjusted using the background reference concentration, which ran concurrently with all sampling sessions. Because all monitoring sites could not be sampled in the same week, temporal adjustment was performed using a reference site, to allow for comparisons between sites. This adjustment also allows us to assess the temporal contribution in the LUR models. Temporal adjustment methods and sensitivity tests are detailed elsewhere14 PM2.5 concentration at a particular monitoring location was divided by the specific weekly PM2.5 concentration from the background site then multiplied by the seasonal average PM2.5 concentration from the background site.

Quality Assurance/Quality Control

A seventh monitoring session was performed each season to co-locate monitors at four randomly-selected sites. Field blanks were deployed during each session, and all concentrations were blank-corrected. Hourly PM2.5 concentrations were obtained from Allegheny County Health Department (ACHD) for the Liberty, Lawrenceville and Avalon EPA Air Quality System (AQS) monitoring locations. ACHD data were aggregated to match sampling hours each session, and used as additional validation sites, to corroborate temporal trends observed at our background and urban reference sites. BC data was not available from ACHD monitors.

Statistical Analysis

Descriptive statistics, scatterplots, and histograms were used to characterize distributions of PM2.5 concentrations and BC absorbance, as well as spatial covariates (e.g., traffic density, elevation, industrial emissions) and temporal covariates (temperature, relative humidity, wind speed, wind direction, frequency of inversions) (Table 2). Before modeling, bivariate source–pollutant correlation analyses were performed. Data analysis and model building were performed separately for PM2.5 and BC and for summer and winter seasons. Statistical analyses were conducted using the Proc GLM command in SAS version 9.3 (SAS Institute, Cary, NC, USA).

Table 2 Descriptive statistics for citywide air sampling temporally-adjusted pollutant concentrations and meteorology.

LUR models were implemented using manual forward step-wise linear regression to assess raw PM2.5 and BC concentrations for the summer and winter seasons, using an adapted version of the modeling approach described in Clougherty et al.19 The full set of source indicator covariates (Table 1) were tested individually for each pollutant by year and by season. First, bivariate correlation coefficients (Spearman rho) were examined, and the two covariates with the highest correlations from each source category were individually incorporated, ordered by strength of the bivariate correlation. Temporal trends in PM2.5 and BC were first incorporated into LUR models using the sampling session-specific background concentration. Source terms with the strongest univariate correlation with the temporally-adjusted pollutant were then incorporated individually, in descending order by strength of the bivariate correlation. Regression models were sequentially fit to assess overall model improvement at each stage, using the coefficient of determination (R2) and removing non-significant covariates in the order of descending P-value, until all terms were significant (P<0.05). Covariates were removed, at any stage, if variance inflation factor became >2.0. Finally, we tested modification of each significant source term effect by inversion frequency, elevation, and wind speed (median dichotomized).

LUR model residuals were mapped to identify systematic spatial variation and locations poorly predicted by LUR, suggesting incorporation of additional covariates (i.e., inverse distance to NEI sites, elevation). Semivariograms of residuals were created in GIS, and residuals were mapped against latitude and longitude coordinates (decimal degrees) of monitoring locations to explore residual patterns. Spatial autocorrelation in residuals was tested using Moran’s I statistic. For each final model, a spatial R2 (roughly, the proportion of spatial variance explained by LUR terms) was derived, using final LUR covariates to predict temporally-adjusted concentrations.

Predicted PM2.5 concentrations and BC absorbance were mapped across a regular 100 × 100 m2 grid, smoothed using inverse distance weighting (IDW), allowing spatial influence from nearest 100 grid cell centroids. Isolines were calculated to connect points of equal concentration across the IDW surfaces, for visualization. Contour intervals of 2 μg/m3 for PM2.5 and 0.5 abs units for BC were selected for display.

Sensitivity Analyses

Covariate selection was sensitivity-tested using scatterplots to assess fit between each significant predictor and raw pollutant concentrations, to ensure that candidate covariates selected captured variability across the range of the data, not reliant on outliers or influential points. Tree structures and Random Forest automated methods were performed to corroborate covariate selection. A scatterplot of each retained term was tested against the residual of the prior model in the sequential model-building process to check for outliers. Model residuals were examined to ensure normality. Spatial autocorrelation across the residuals of the distributed sites was determined using Moran’s I, and spatial correlations were determined using generalized additive models (GAM) to determine whether residual smoothing was required.

Sensitivity of the model-building procedure to temporal adjustment was assessed by comparing two methods (background only vs background and urban reference site adjustment) and by modeling covariates against temporally-adjusted pollutant concentrations to determine the percentage of spatial variation across the eight LUR models. Backwards elimination from multivariate linear models including all covariates with significant bivariate correlations with pollutant concentrations further corroborated model structure and covariate selection. For validation of LUR predictions, a random 20% of sites (n=7) were removed from the analysis, and the LUR model was re-fit and used to predict pollutant concentrations at withheld sites. Analyses were performed in SAS, v 9.3, GIS ArcInfo, v 10.1 (ESRI, Redlands, CA, USA), R statistical software v 2.12.1, and Microsoft Excel 2010.

RESULTS

Inversion-Focused Monitoring (Year 1, Summer 2011/Winter 2012)

Temporally-adjusted PM2.5 concentrations and BC absorbance were, on average, higher in the summer compared with winter, using inversion-focused sampling. Regional inversions were observed on one to three days during each summer session and on two to four days during each 5-day winter session (Table 2).

24-h Integrated Monitoring (Year 2, Summer 2012/Winter 2013)

Temporally-adjusted PM2.5 concentrations and BC absorbance were, on average, higher in the summer compared with winter in 24-h sampling. PM2.5 concentrations and BC absorbance were lower, on average, using 24-h sampling, compared with inversion-focused measures, for both summer and winter (Table 2). Differences between PM2.5 concentrations by sampling scheme were statistically significant for winter (P=0.04), but not for summer (P=0.70) (Supplementary Figure S1) We found statistically significant elevated BC levels for inversion-focused sampling in both the summer (P=0.002) and winter (P=0.0002) (Supplementary Figure S2) Greater variability in pollutant concentrations was detected in the inversion-focused campaigns compared with 24-h integrated (Figure 2).

Figure 2
figure 2

Heightened spatial contrasts in temporally adjusted PM2.5 concentrations and BC absorbance were found under inversion-focused monitoring compared with 24-h integrated. The same pattern was found across repeated sites that were monitored under both sampling regimes.

Comparison of Repeated Sites

A subset (n=13) of monitoring sites were sampled using both sampling regimes, for direct comparability. Temporally-adjusted PM2.5 concentrations in both seasons were higher under inversion-focused sampling (summer mean=14.88 (SD=5.04) μg/m3; winter mean=13.74 (SD=3.37) μg/m3) than for 24-h sampling (summer mean=14.42 (SD=2.96) μg/m3; winter mean=11.49 (SD=2.66) μg/m3). Greater range of measured concentrations was found under inversion-focused sampling (Table 2, Figure 2).

Concentrations were similar using each sampling method at the background reference site during both seasons (summer 24-h=11.92 (SD=3.99) μg/m3 vs summer inversion-focused=11.91 (SD=2.22) μg/m3); (winter 24-h=8.43 (SD=1.76) μg/m3 vs winter inversion-focused=8.64 (SD=1.73) μg/m3). BC results were similar to those found for PM2.5 concentrations.

PM2.5 Variability at Reference Sites

Season-specific PM2.5 temporal trends at the background reference site (Settlers), used for LUR modeling, were corroborated by trends observed at an urban reference site (Braddock) and by EPA sites (n=2 for inversion-sampling, n=4 for 24-h sampling) (Figure 3). Temporal trends from our reference sites and EPA monitoring sites were similar in both years.

Figure 3
figure 3

Temporal trends across reference sites for all sampling seasons.

Meteorology

For the specific sampling hours, under both sampling methods, wind direction was predominantly from the west, and wind speeds were generally higher during winter than summer (Figure 4). For summer inversion-focused sampling, session 6 winds were from the south. Higher variability in wind direction occurred during 24-h integrated summer sampling, where three sessions had non-westerly winds. The frequency of inversions within a given sampling session was not a significant predictor for either pollutant in any season (P>0.05), and likewise an interaction term between inversion frequency and source terms was not significant.

Figure 4
figure 4

Wind rose diagrams for summer (top) and winter (bottom) across all hours of sampling.

PM2.5 LUR Modeling

In all season-specific PM2.5 LUR models, a substantial portion of variability in concentrations was explained by the background reference site (Table 3). For inversion-focused summer sampling, the spatial pattern in PM2.5 was predicted by commercial and industrial land use within a 200-m buffer (Table 3). For 24-h summer sampling, spatial variance in PM2.5 was predicted by the IDW of PM2.5 emissions and wind direction (interquartile range (IQR) PM2.5 increase of 1.62 μg/m3 with west/northwesterly winds; Table 3, Figure 5). For inversion-focused winter sampling, the LUR included land use and average wind speeds (m/s) (−1.89 μg/m3 PM2.5 per unit (m/s) increase in wind speeds; Table 3). For 24-h winter sampling, the number of signaled intersections within a 750-m buffer, industrial PM2.5 emissions, and industrial land use contributed toward explaining spatial variation in PM2.5 (Table 3; Figure 5).

Table 3 Land use regression (LUR) covariates and model fits for inversion-focused and 24-h PM2.5.
Figure 5
figure 5

Seasonally-averaged predicted PM2.5 exposure surface maps for inversion-focused summer and winter (left) and 24-h integrated summer and winter (right) sampling. For the 24-h integrated summer PM2.5 map, wind directions were assumed to be W/NW (predominant wind direction), as a covariate in the LUR model. For the inversion-focused winter PM2.5, wind speeds were averaged across the season for these specific sampling hours and applied to all sampling locations, as a covariate in the LUR model.

Although heightened spatial contrasts were seen during inversion-focused sampling, a greater proportion of variability in PM2.5 was explained by source terms and meteorology for 24-h sampling in both seasons (Figure 5). For the summer, the spatial R2 was 0.34 for inversion-focused sampling, notably lower than the R2 of 0.60 for 24-h sampling. During winter, the spatial R2 was also lower for inversion-focused sampling (0.40), compared with the R2 of 0.56 for 24-h sampling. Specifically, industrial emissions and wind direction were significant predictors under 24-h sampling but not under inversion-focused monitoring.

BC LUR Modeling

The weekly temporal term from the background reference site explained lesser variability in each BC model, relative to PM2.5 models (Table 4). For inversion-focused summer sampling, the spatial pattern in BC was predicted by industrial land use and elevation within a 1000-m buffer (an IQR decrease of −0.58 abs per unit increase in elevation) (Table 4). For 24-h summer sampling, BC was predicted by the IDW of PM2.5 emissions, commercial and industrial land use within a 200-m buffer, and wind direction (0.25 abs increase with west/northwesterly winds) (Table 4; Figure 6).

Table 4 Land use regression (LUR) covariates and model fits for inversion-focused and 24-h black carbon (BC).
Figure 6
figure 6

Seasonally-averaged predicted BC exposure surface maps for inversion-focused summer and winter (left) and 24-h integrated summer and winter (right) sampling. For the 24-h integrated summer BC, wind directions were assumed to be W/NW (predominant wind direction), as a covariate in the LUR model. For the inversion-focused winter BC, wind speeds were averaged across the season for these specific sampling hours and applied to all sampling locations, as a covariate in the LUR model.

For inversion-focused winter sampling, industrial land use, signaled intersections within a 500-m buffer, average wind speed (m/s) (0.33 abs decrease per unit increase in wind speeds), and elevation explained spatial variation (Table 4). For 24-h winter sampling, the final model included the IDW of PM2.5 emissions and industrial land use (Table 4) (Figure 6). Additional LUR output for PM2.5 and BC models can be found in Supplementary Tables S1 and S2.

During summer months, a greater percentage of variability was explained for 24-h sampling. For winter sampling, more variability was explained under inversion sampling. A greater range of variability in BC absorbance was observed under the inversion-focused, vs the 24-h integrated sampling scheme, during both summer and winter (Figure 6). For summer, the inversion-focused spatial R2 was 0.51, notably lower than the R2 of 0.70 for 24-h sampling. During winter, the inversion-focused spatial R2 was higher (0.75) than for 24-h sampling (0.54).

Sensitivity Analyses

Scatterplots ensured final models were not driven by outliers or influential points; one influential site was removed from summer inversion-focused modeling. Tree structures and Random Forest automated methods were used to confirm covariates that were incorporated and retained in final models. Moran’s I and GAM indicated no spatial autocorrelation in the residuals across distributed monitoring sites. LUR models with temporal adjustment using the average concentration at both the urban and background reference sites provided comparable results to models using only the regional background site concentration. Removal of a random subset (20%) of monitoring sites did not significantly change any of the eight models, and predicted concentrations of the withheld sites were within 10–15% of the measured concentrations.

DISCUSSION

We found significant spatial variability in PM2.5 and BC ambient concentrations across 37 metropolitan sites under both inversion-focused and 24-h integrated sampling schemes for two summer and two winter seasons (June 2011 to March 2013). As hypothesized, the inversion-focused sampling approach revealed greater spatial contrasts in concentrations across a region of complex terrain, in both PM2.5 and BC, compared with 24-h sampling. These differences were greater by sampling scheme than by season for both pollutants. However, GIS-based source terms did not explain more concentration variability under the inversion-focused approach compared with 24-h sampling.

The hours we hypothesized as “peak” exposure hours were selected based on: (1) morning rush hour congestion, and (2) the likelihood of inversion presence, based on our previous mobile monitoring campaign.8 Our results indicate that there is greater spatial variation in exposures during these hours, which can be captured using this systematic saturation approach.

PM2.5 temporal trends at our background reference monitor were consistent with regulatory monitors for both seasons and years regardless of sampling scheme. Under both sampling schemes, higher PM2.5 concentrations and BC absorbance were generally found at sites nearer to industry or at lower elevation (i.e., in valleys).

PM2.5 models were comparable across seasons, and in the LUR models, 47–66% of the spatial variability in concentrations was explained by the background reference site, regardless of sampling method, although GIS-based source terms generally explained a greater percentage of variability in the 24-h integrated models. The combination of commercial and industrial parcels were stronger than commercial or industrial parcels alone, suggesting higher activity in an area with more traffic and trucks may lead to heightened pollutant levels. One explanation for identifying weaker source effects in the “peak” hour models is that the spatial covariates generally available for LUR represent long-term source averaging (e.g., annual average emissions or average daily traffic), and thus may not accurately capture peak concentrations such as during morning rush hours. Because static GIS-based covariates may not be sufficient, more sophisticated covariates such as rush hour traffic information, vehicle fleet breakdown, or site-specific source dispersion information may better complement our temporally purposeful sampling scheme. Relatedly, 7-day samples may be better predicted by annual average covariates (e.g., traffic density) than are 5-day samples–an effect which is not accounted for by using period-specific reference site data.

BC models were also generally comparable across seasons, with less variability explained by the background reference site (R2=0.04–0.28) and greater observed local influence of land use, industrial emissions, and elevation. Number of signaled intersections was the only traffic indicator found significant in any model — for winter BC under inversion-focused sampling and for winter PM2.5 under 24-h sampling. Other studies have found significant traffic contributions, as total and truck traffic densities were found for wintertime PM2.5 and BC in the New York City Community Air Survey (NYCCAS) and traffic density within a 300-m buffer was found in Los Angeles.14, 19

Inversions and Meteorology

Although we found greater differences in concentrations across sites under inversion-focused sampling, frequency of inversions was not a significant predictor or source modifier. We previously found that inversion frequency modified the effect of elevation but did not remain in final inversion-focused LUR models here. This may be due to the low inversion variation between sampling sessions, and high frequency of inversions in our data set — producing minimal variation across sessions, and making it difficult to observe effects. In addition, variation in intensity of inversion events was not captured in our meteorological data — which may be more important than inversion presence. Finally, our inversion characterization (based on upwind, regional data from Pittsburgh International Airport) may be incompatible with fine-scale spatial effects within our domain.

Meteorological factors such as higher temperature and relative humidity have been found to have a role in higher PM2.5 concentrations in the eastern United States, and although temperature and relative humidity were collected at each sampling unit, contemporaneous with sampling events, these factors were not significant predictors in LUR modeling. Changes in temperature and relative humidity may be accounted for in the background temporal term used in the models.28 Given less spatial variance in PM2.5 across our sites, relative to the temporal contribution from the upwind reference monitor, this may indicate a greater relative influence of regional contribution and secondary formation.

Elevation and Complex Terrain

In our domain, elevation is spatially confounded with pollution sources, as many industries and highways in our region are located within the river valleys. Our stratified random sampling approach was designed to help distinguish these two potentially important drivers of local concentrations and examine the role of elevation in trapping pollutants during inversion events (0600–1100 hours). However, source effects were not significantly modified by elevation in LUR models. The most significant elevation covariate was at the largest buffer (1000 m) — possibly a proxy for low source intensity in locations of complex terrain (e.g., sharp elevation gradients), rather than hypothesized topography–meteorology interactions. For BC, the significance of the elevation term may reflect the influence of local sources relative to long-range transport, given the lesser amount of variability explained by the background concentration.

Sampling Density

Our study in Pittsburgh included 37 sampling locations across 388 km2 (a sampling density of 0.10 sites/km2). Although our domain was large — and included urban, suburban, and semi-rural sites — our sampling density is within the range of other LURs. Though some (e.g., NYCCAS) captured 150 sites in a 777 km2 domain (0.19 sites/km2),20 many LURs have been developed from much sparser data (e.g., in Los Angeles, 23 sampling sites across 98,500 km2 (0.0002 sites/km2), and in Vancouver, 80 sampling sites across 2199 km2 (0.04 sites/km2)).6, 7

Limitations

One key limitation of our study was the logistical inability, due to equipment availability, to capture inversion-focused and 24-h integrated samples simultaneously at each site. Nevertheless, concentrations from the 13 repeated sites were correlated across years and seasons, reference site data using each method was consistent with regulatory monitors, and temporal adjustments explained comparable variability in both years. Data were obtained from the meteorological station at Pittsburgh International Airport — rather than within our domain — to assess inversion frequency, wind speeds, and wind direction across sampling sites. Inversion presence was assessed as the percentage of days on which an inversion was detected, although we may have missed some events, as environmental soundings are collected only twice per day. The overall high frequency of inversion events resulted in low variation across sessions and sites. Finally, both “low” and “high” elevation sites were assigned the same inversion frequency metric, likely misclassifying inversion presence at sites of very different elevation. We examined inversion effects separately for low and high elevation sites, although this analysis may have had insufficient variation.14

Strengths and Implications

This inversion-focused study allowed us to determine spatial variation during selected hours of the day (hypothesized frequent inversion/high source-intensity hours of 0600–1100 hours) — hours during which we previously detected elevated PM2.5 in a mobile monitoring campaign in a nearby industrial community.8 Using a unique programmable sampling system and a structured modeling approach, our results show that local industrial sources of PM2.5 and elevation were key pollutant predictors during these inversion-focused hours. The range of meteorological data examined is a strength of our approach, although hourly meteorological data, and better measures of inversion intensity and local mixing height (especially, across complex terrain), would have improved specificity.

Across the metropolitan sampling domain, summer PM2.5 concentrations and BC absorbance were significantly higher than winter concentrations under both sampling methods. Inversion-focused sampling produced greater spatial contrast across sites, although the source terms in these LUR models did not explain more variability compared with the 24-h sampling method. These LUR models will be used to derive exposure estimates for future local epidemiological studies, to compare the predictive power of 24-h vs “peak” exposure contrasts in explaining chronic disease outcomes. Our sampling approach could be replicated in other cities, with an emphasis on attempting to observe greater pollutant spatial contrast during hypothesized peak hours, which may be more predictive of some health outcomes.