by Stefan Falke, stefan@mecf.wustl.edu
Center for Air Pollution Impact and Trend Analysis (CAPITA)
January, 1996
Pollutant concentrations measured at monitors such as those comprising the Aerometric Information Retrieval System (AIRS) (U.S. EPA, 1994) and the Interagency Monitoring of Protected Visual Environments (IMPROVE) (Sisler et al., 1993) provide measured concentrations for specific points across the U.S. Extrapolation or interpolation is required to obtain concentrations at locations between monitoring sites. A standard interpolation technique averages the monitor values surrounding a location weighted by the distance between the location and monitor. This type of averaging does not consider other factors, such as topography, which influence a location's air pollutant concentration. This can cause considerable inaccuracies in estimated values, particularly in the rugged terrain of the Western U.S. It is desirable to develop an interpolation technique which incorporates the varying elevation of the contiguous U.S.
In general, locations at high altitudes have lower aerosol concentrations than in neighboring valleys (Dutkiewicz et al., 1987; Husar et al., 1980). Pollutant sources are not normally found on mountain tops and it can be assumed that the majority of anthropogenic pollutant emissions are found in low lying areas. In addition, the mixing layer height (the lowest [~1-3km] of the atmosphere in which surface level pollutants are generally uniformly mixed up to but not above) is usually below the peaks of mountain ranges resulting in little dispersion of polluted air to higher altitudes. On the other hand, valleys channel air flow and often "trap" polluted air leading to high pollutant concentrations. This assumption, namely that at higher elevations pollutant concentrations are lower than at surrounding lower elevations, is the basis for the application of elevation in correcting PM10 (particulate matter < 10µm in diameter) concentrations.
The process of incorporating elevation data with concentration data is referred to as data fusion. Data fusion is the integration of multiple data sets with varying spatial and temporal resolution. By reconciling the multiple data inputs onto a common a common spatial and temporal projection, the data can be "fused" to create a new data set.
Observed PM10 data from the AIRS and IMPROVE networks, U.S. elevation and meteorological scale height data were combined to generate elevation corrected U.S. PM10 maps. Each of the data sets underwent an individual preparation process which ensured consistent spatial and temporal resolutions before being fused in the elevation correction process.
The IMPROVE network has monitoring sites (~40 sites) located mostly in national parks and wilderness areas. The IMPROVE data used in this work were generally collected over a 24 hour period twice weekly from 1988 to June 1993. AIRS monitoring sites (~1600 sites) are mostly located in urban areas although some suburban and non-urban sites are also included. AIRS data were available from 1982 to June 1993, but very little data existed prior to 1985. The AIRS network monitored PM10 approximately every sixth day for 24-hours. The maps below show the location of the AIRS and IMPROVE PM10 stations. The boxes represent AIRS monitoring stations while the circles are IMPROVE sites. The magnitude of the squares and circles are proportional to their respective PM10 concentrations which are shown for quarter 1 (Jan., Feb., Mar.), quarter 2 (Apr., May, Jun.), quarter3 (Jul., Aug., Sep.), and quarter 4 (Oct., Nov., Dec.).
"Raw" AIRS and IMPROVE daily PM10 data were quarterly aggregated over the time range for which the data existed.
In order for a site to be included in the aggregated data set it had to have at least four months of valid data. The minimum was set at four months of data so that a single quarter from one year
of data was not used in the aggregation over the approximatley five year span. This ensured that each quarterly average was determined from at least two years of data. In the AIRS network has about 5 observations per month so the minimum was set at 20 daily values.
IMPROVE sites required at least 32 daily observations since about 8 observations existed per month.
Data flow for the creation of Quarterly PM10 Tables.
Four or five high elevation sites (depending on the quarter) were removed because of their extremely high PM10 concentrations which contradicted the assumption used for this work. Their removal was justified since the number of sites removed was small and because the extent of their influence covered large regions thereby making the results in those regions highly biased. The AIRS sites at Mammoth Lakes, Crestline, and Keeler in California
were removed because they displayed high PM10 concentrations at high elevations. In
quarter 1, AIRS site 040051003 Flagstaff was removed for this reason as was AIRS site Lone
Pine in quarter 4, IMPROVE site San Gorgonio in quarters 2, 3, 4 , and IMPROVE site Crater Lake in quarter 3.
After the filtration and aggregation was conducted separately for the AIRS and IMPROVE data, the two sets were combined and then converted into a table format
that was suitable for use with subsequent operators used in the correction process.
An operator that was frequently used in the elevation correction process was the Contourer. It converts a table of point values to a grid format by distance wieghted spatial interpolation. The inputs consist of the table,
as well as user specified inputs that include a distance weight
function, a radius of influence, a minimum required number of
data points within the radius, a maximum number of data points
used in the calculation of the grid value, and the dimensions
for the output grid. The contouring process is described in more detail in Appendix A. The contour plots of the observed PM10 data is shown below and will be used later for comparison with the elevation corrected PM10.
The contours were obtained by strictly distance weighted interpolation; by averaging a location's
surrounding PM10 values with consideration of distance from the
location. The contouring creates a number of areas with concentrations that
do not represent the area's terrain. For instance, during the winter months
in the western half of the United States many high PM10 concentration
observations in urban areas are spread out over areas which have high moutain peaks. It is known that at these high altitudes, PM10 concentrations are low but the contouring process does not
reflect this understanding.
The elevation data originated from NOAA's Terrain Base 30-second resolution digital elevation grid (NOAA, 1995). PM10 concentrations depend on surrounding sources and pollutant transport and therefore are dependent not on absolute elevation but a relative elevation above neighboring valleys containing PM10 sources. An "excess" elevation grid was defined as the absolute elevation of a grid point minus the lowest elevation (base elevation) found an approximately 50 square kilometer area around the grid point,
The derivation of the base elevation exposed some anaomlous values in the NOAA elevation grid.
In the West, a handful of grid cells displayed extremely low
elevations with respect to their neighboring cells. In some instances, a 1kmX1km grid cell with an elevation of approximately 200 meters had
neighboring cells with elevations on the order of 2000 meters. They caused the appearance of "holes" that did not reflect the actual terrain. It was assumed that these low elevation values were erroneous and were adjusted by replacing the very low elevation value with a value of one of the surrounding cells.
The Grand Canyon area is unique in its terrain. The area is relatively flat with the canyon cutting a strip of very low elevation values through it. These characteristics cause the locations neighboring the canyon to have excessively high excess elevation values. Such high excess elevations surrounding the canyon coupled with the extremely low excess elevations in the canyon produce very low elevation corrected PM10 concentrations around the canyon and very high concentrations within the canyon. Given the unique properties of the area, it was desirable to minimize its effect and consequently the Grand Canyon was "filled" so that its elevation resembled that of the surrounding terrain.
Estimating a location’s PM10 concentration requires an understanding of the scale height, or the vertical thickness of the atmosphere. The scale height upper boundary is above the mixing layer so it contains the pollutant concentrations in the mixing layer as well as those pollutants which are left above the mixing layer following the nighttime decrease in the mixing layer.
Comparing the scale height of a location with the excess elevation reveals whether or not the particular location is above the height at which most of the pollution exists or whether it is within the layer containing higher PM10 concentrations. The scale height was obtained from the CAPITA Monte Carlo Model (Patterson and Husar, 1981; Schichtel and Husar, 1994) which
simulates pollutant transport via the movement of particles. The scale height was calculated in the model every two hours from the vertical distribution of particles.
The hourly scale height was aggregated and then spatially interpolated using the Contourer to obtain the quarterly averages.
There are three regions of note: the relatively low heights over much of the West Coast (particularly in California during quarter 1), the higher heights around and just east of the Rockies (particularly in the summer), and the relatively high values over the Appalachian Mountains. The significance of these regions will be illustrated in the maps of elevation corrected concentrations.
A two step process was employed to incorporate elevation in the spatial interpolation of PM10 concentrations. The
first consisted of adjusting the measured PM10 concentrations to an identical "base"
elevation and linearly extrapolating these "base"
concentrations. The second step corrected the extrapolated base concentrations
for actual elevation. The figure below outlines the procedure and it components. The rectangles represent the operators which manipulate data. The have inputs and outputs which are designated as ovals in the diagram.
Two step data flow diagram for the creation of Elevation Corrected PM10 Maps
For each of the AIRS and IMPROVE station locations, an excess elevation and a scale height were extracted from their respective maps. The scale height (S) was used in conjunction with the excess elevation (Ee) to produce a meteorologically normalized height (Nm),
This normalized height serves as an indicator of the mixing layer's influence at a location. A normalized height greater than one indicates that the location is above the scale height and therefore above the higher pollutant concentrations within the mixing layer.
A relation of aerosol concentration as a function of elevation, where the concentrations decreased as the elevation increased, was adapted from Husar et. al. (1980) who mapped the vertical pollutant structure by sampling from aircraft making vertical spiral soundings. This relation
The quarterly aggregated measured PM10 concentration (Ci) at each AIRS and IMPROVE site was divided by f[Nm] to adjust it to a standardized elevation concentration (Co). The data flow for this procedure consists of obtaining an excess elevation and scale height for each site, calculating a normalized height, and subsequently obtaining a correction factor which was divided by the initial PM10 concentration to get Co.
The results of equation 3 formed a table of standard elevation concentrations for the AIRS and IMPROVE locations. This adjustment removed the elevational influence of the observed concentrations. The calculated Co values were subsequently mapped out using distance weighted interpolation.
The next step in the method consisted of "elevating" the Co maps to their original elevations. This was achieved using a similar procedure as described above in bringing the concentrations to a standard level except that the factor (f[Nm]) was obtained for each grid cell and was used to multiple the standardized concentration (Co) to obtain the elevation corrected concentrations, (Cc).
The monitoring stations acted as anchor points in this process. The measured concentrations are retained through the two step method of equations (3) and (4) (Cc=Ci) while their values were applied to the estimation of concentrations at other locations.
These tables were processed through the
Contourer to produce standardized PM10 concentration grids.
West of the Rockies, one notices very little correction of the PM10 concentration
because of the relatively low elevations. The Appalachians
show some adjustment but not much because of the relatively high scale heights in
the region. In the west, the concentrations are increased dramatically in some areas
because of the numerous monitoring stations residing at high elevations
especially in the Sierra Nevada mountains where the scale height
is relatively low.
The eastern half of the U.S. shows no elevation correction of the PM10 concentration except for a few areas in the Appalachians and in the mountains of northeastern New York when compared with the strictly distance dependent interpolation maps. In the West, the high elevations of the mountain ranges stand out with low concentrations while the valleys remained essentially uncorrected. Increases by a factor of two are seen in some areas, such as in western Northern California. This is due to the spreading of the standardized elevation concentrations (Co) during the contouring process. In the final correction step, these high Co values were not adjusted to lower Cc values. These areas were also characterized as having no lower elevation sites nearby to offset the spreading of the high Co values into the lower elevation regions.
The influence of the elevation/scale height ratio is noticeable in the corrected maps. The mountainous areas of California with high excess values also are characterized by relatively low scale heights resulting in a large correction. The Appalachians, on the other hand, exhibit a relatively large excess elevation but the relatively high scale heights in the area minimizes PM10 elevation correction.
The ratios displayed above represent the overall correction applied to the initial PM10. The Eastern half of the U.S. shows no elevation correction of the PM10 concentration except for a few areas in the Appalachians and in the high peaks of northeastern New York.
In the west, the high elevations of the mountains ranges stick out with low concentrations while the valleys remained essentially uncorrected. Increases on the order
of two are seen in some areas. This is due to the spreading of the standardized elevation corrected concentrations. The sites with initial concentrations were corrected and then contoured, The
contouring was elevation independent and as a result corrected values at high elevation sites were spread out into areas of lower elevation. Subsequently, in the final
correction step, these high values were no adjusted to lower values. These areas were also characterized as having no lower elevation sites nearby to offset the spreading of the high values into the lower elevation regions.
An area that stands out is Montana in the winter seasons. The higher concentrations in Montana is in part caused by low scale heights in quarters 1 and 4. Locations with a high excess elevation are often high above the scale height which results in a correction factor between 5 and 10.
The low station density in parts of Montana allows the highly corrected concentrations to be spread over flat terrain without accounting for the fact that the elevation is lower.
A property of the elevation correction process is that it leaves the observed data intact. Table 1 presents the regression coefficients and slopes for the correlation between the elevated
corrected values and the observed concentrations. A perfect fit is not seen because when creating a grid (or map) from site specific values, the grid cells are not necessarily centered over the monitoring site locations. Therefore, the grid values will not exactly match those of the monitoring sites. The role of the distance weighted interpolation in causing the lack of perfect correlation is emphasized by the fact that the regression coefficients obtained from strictly distance dependent interpolation is virtually identical to those of the elevation correction process.
The accuracy of the elevation correction process in estimating PM10 concentrations was tested by randomly removing ten percent of the observed values, running the correction process with the remaining ninety percent, and then comparing the corrected PM10 values with measured concentrations at the locations where the ten percent were removed. The cumulative results for three separate ten percent sets are shown in the table below.
Quarter 3 shows the best fit. This is likely due to the fact that the third quarter has the least variability in PM10 concentrations over the Western U.S. Table 2 also provides the regression coefficients and slope of the distance dependent interpolation with observed values using the same three 10% sets. The elevation corrected interpolation shows a slightly better correlation but because only a small number of stations are situated at locations where elevation correction occurred, their influence on the regression is limited.
Distance-weighted Interpolation
A general technique for the interpolation of point values utilizes weighting known values dependent
upon their respective distances from the desired location. This is illustrated below.

Data Sets
PM10

INITIAL PM10 Tables (AIRS & IMPROVE)
Quarter 1
Quarter 2
Quarter 3
Quarter 4




INITIAL PM10 Grids
Quarter 1
Quarter 2
Quarter 3
Quarter 4




Elevation
A 50 km2 area was found to be an adequate size to represent a location’s excess elevation. It was small enough to retain the influence of high elevation plateaus but also large enough to account for valleys within mountain ranges. Compared to the original elevation,
the base elevation is much smoother. As expected, the west coast
exhibits the most texture. The Colorado plateau is seen very
clearly and the decreasing elevation as one moves eastward from
the Rockies is seen as a gradual decline in base elevation. The excess elevation map highlights the montain peaks in the west. The excess elevation is similar to a relative elevation used by Loibl et al (1994) in estimating ozone concentrations in complex terrain.

Elevation Grids
Elevation
Base Elevation
Excess Elevation



Scale Height

Scale Height Grids
Quarter 1
Quarter 2
Quarter 3
Quarter 4





provided a correction factor (f[Nm]) which was less than or equal to one.
Results
Standardized Elevation PM10
Standardized Elevation PM10 Concentration Tables
Quarter 1
Quarter 2
Quarter 3
Quarter 4




Standardized Elevation PM10 Concentration Maps
Quarter 1
Quarter 2
Quarter 3
Quarter 4




Elevation Corrected PM10
Elevation Corrected PM10 Concentration Maps
Quarter 1
Quarter 2
Quarter 3
Quarter 4




Comparison with Initial PM10
PM10 Correction (Corrected PM10/Initial PM10)
Quarter 1
Quarter 2
Quarter 3
Quarter 4




Correlation parameters for observed PM10 concentrations with elevation corrected concentrations
Elevation Correction
Distance Weighted
Slope
Coer. Coeff.
Slope
Coer. Coeff.
Quarter 1
Quarter 2
Quarter 3
Quarter 4
Correlation statistics for the interpolation of PM10 Concentrations
Elevation Correction
Distance Weighted
Slope
Offset
Coer. Coeff.
Slope
Offset
Coer. Coeff.
Quarter 1
Quarter 2
Quarter 3
Quarter 4
Appendix A
An important data transformation processes used in this work is contouring. Most of the spatial patterns are presented on contour maps. The contours were derived from the point observations using a spatial extrapolation scheme. In the first step, the data from the random locations were projected to a uniform grid that covers the conterminous U.S. The gridding used inverse distance squared (1/r2) as the station weighing factor. The extrapolations outside the U.S. boundaries were trimmed to eliminate spurious values.
Contour
Functionality: transform data from table format into grid format
Input: Table - a set of data points
WeightFunc - distance weight function
Radius - distance constraint
MinPoints - minimum required number of data points within Radius
MaxPoints - maximum number of data points used in calculation of a grid cell
Output: Grid - uniformly distributed set of data points
function Contour(out Grid, in Table, in WeightFunc, in Radius, in MinPoints, in MaxPoints)
{
for each Cell in Grid
CalcCell
}
CalcCell
Functionality: calculate value of a grid cell given a table of data points around it
function CalcCell(in out Cell, in Table, in WeightFunc, in Radius, in MinPoints, in MaxPoints)
{
PointList = an empty list of table points
for each point in Table that is not null
if the distance between the point and the cell is less than Radius then
add point to the PointList, sorted by distance
Cell value = Interpolate PointList
}
Interpolate
Functionality: interpolate over a sorted list of data points
function Interpolate(in PointList, in WeightFunc, in MinPoints, in MaxPoints)
{
if number of points in PointList < MinPoints return null value
TotalWeight = 0
Sum = 0
WeightExp = 0 if WeightFunc = 1
1 if WeightFunc = 1/r
2 if WeightFunc = 1/r2, etc
for first MaxPoints in PointList
Dist = point distance from cell
Weight = 1 / (Dist^WeightExp)
TotalWeight = TotalWeight + Weight
Sum = Sum + Weight * point value
if TotalWeight = 0 return null value
Sum = Sum / TotalWeight
return Sum
Dutkiewicz V.A., P.P. Parekh,, and L. Husain (1987) An Evaluation of Regional Elemental Signatures Relevant to the Northeastern United States, Atmospheric Environment, 21, 1033-1044.
Husar R.B., D.E. Patterson, D.L. Blumenthal, W.H. White, and T.B. Smith (1980) Three-Dimensional Distribution of Air Pollutants in the Los Angeles Basin in The Character and Origins of Smog Aerosols, John Wiley & Sons, New York.
Loibl W., W. Winiwarter, A. Kopsca, J. Zueger, and R. Baumann, (1994) Estimating the Spatial Distribution of Ozone Concentrations in Complex Terrain, Atmospheric Environment, 28, 2557-2566.
Patterson, D.E., R.B. Husar, W.E. Wilson and L.F. Smith. 1981. Monte Carlo simulation of daily regional sulfur distribution: comparison with SURE data and visibility observations during August 1977. Journal of Applied Meteorology, 20, 404-420.
Schichtel, B.A., R.B. Husar, The Monte Carlo Model: PC-Implementation, Aerosols and Atmospheric Optics: Radiative Balance and Visual Air Quality Proceedings of an A & WMA International specialty Conference held in Snowbird, Utah 29-30 September 1994, 578-599. Pittsburgh, PA: Air and Waste Management Association.
Sisler J.F., D. Huffman, D.A. Latimer, W.C. Malm, and M. Pitchford (1993) Spatial and temporal patterns and the chemical composition of the haze in the United States: An analysis of data from the IMPROVE network, 1988-1991. Report #ISSn No. 0737-5352-26 CIRA, CSU, Fort Collins, CO.
U.S. Environmental Protection Agency (1994) AIRS User’s Guide, Volume AQ1: AQS Data Dictionary. EPA-454/B-94-005.
U.S. National Oceanic and Atmospheric Administration, National Geophysical Data Center (1995) TerrainBase Global Digital Terrain Model, Version 1.0.
| Submit your comments, feedback, questions, and ideas pertaining this page. Your input will be automatically added to the existing annotations. In order to add a new comment, you must be registered with the CAPITA People's Page. |