The objective of this project has been to map, model and monitor land cover and provide a seamless image-base for the Southwest Ecoregion of California. The broader objective has been to develop and test an approach for monitoring inter-annual land cover dynamics in California using multi-temporal, digital Thematic Mapper (TM) imagery.
We have constrained the monitoring approach to be as simple, portable, and analyst-independent as possible, even though this required some sacrifice in mapping accuracy. Important features of the monitoring approach developed here are:
The monitoring approach was used to quantify changes in land cover in southwestern California between Summer 1990 and Summer 1993. Portions of five TM scenes were required to cover the study area. Accuracy of mapping and monitoring was assessed using ground data and air photos.
Using a ten-class land use/land cover system adopted for this project, low overall map accuracies of 50-60% were obtained for the single date classifications of 1990 and 1993 imagery. Using a simpler six-class system it was possible to achieve single-date accuracies of 85-90%. These accuracies seem quite reasonable given the large and heterogeneous study area (portions of four TM scenes) and the constraint of single-date imagery, and suggest that the map-guided algorithm developed here has real potential in an operational monitoring system. However, even higher accuracies would be needed to reliably monitor modest changes in land use and land cover.
One obvious limitation of the current study was the inconsistency and low reliability of the original source maps when applied at 25m resolution for map-guided classification of 1990 imagery. An encouraging result was the ability of the classification algorithm to produce a product of considerably higher accuracy than the original input map.
Path/Row | Scene I.D. | Date | Notes |
40/36 (San Bernardino) | y5237017420 | 8/27/90 | |
40/37 (Santa Ana) | y5237017422 | 8/27/90 |
|
41/36 (Los Angeles) | y5237717480 | 9/3/90 | |
42/36 (Santa Barbara) | y5236817542 | 8/25/90 | |
40/36 (San Bernardino) | 94010007-01 | 4/13/93 | July-August imagery too cloudy to be useful |
40/37 (Santa Ana) | 94011001-01 | 8/19/93 | Extensive stratus clouds off coast |
41/36 (Los Angeles) | y5346517504 | 8/26/93 | |
42/36 (Santa Barbara) | 94010012-01 | 6/30/93 | August imagery too cloudy to be useful |
The 1990 imagery was segmented into ecological subregions, as defined by Hickman (1994, Figure 1), in order to account for systematic spatial variation in topography, vegetation and soils, and air quality. This resulted in a total of 10 sub-images that received separate processsing from this step on.
Two of the 1993 scenes were obtained at approximately the same times of year as the 1990 scenes (Table 2). Unfortunately, the remaining areas were only covered by spring and early summer imagery. The scene for Path/Row 40/37, which covers much of the southern portion of the study area including coastal San Diego County and Orange County, proved to be especially problematic. Although the land areas were cloud-free, extensive marine stratus clouds led to sensor saturation and hysteresis so that west-to-east scan lines of coastal areas were artificially bright, producing pronounced image striping over much of western San Diego and Orange County. We masked these areas from any subsequent change analyses. In an operational monitoring program we would have been forced to correct the striping problem or to acquire a different image.
We should also add that summer 1993 followed two average to above-average rainfall years, whereas summer 1990 was at the end of an extensive, five-year drought in southern California that had pronounced effects on surface water levels, residential and agricultural irrigation practices, and native perennial vegetation. These hydrological and ecological effects were expressed as marked differences in imagery from the two periods.
The 1993 scenes were first geometrically rectified
from their original Space Oblique Mercator (SOM) projection to
the Albers Projection. This step was performed using a terrain-correcting
rectification procedure (GCPWorks) available in the PCI image
processing package. A total of roughly 300 control points and
USGS 3 arc-second digital elevation data were used to co-register
1993 and 1990 imagery. These points were acquired from obvious
image tie points (e.g., road intersections, small dams, coastline
features) obtained by visual interpretation of both images. After
the collection of ground control points, the 1993 data were re-sampled
using cubic convolution. Most registration tests showed errors
of less than one pixel between the two time periods (e.g., Figure
2). Following geometric rectification, the imagery was segmented
into 10 sub-images for subsequent processing steps.
Figure 2. Subimage from urban location in the study region showing coregistration of TM Band 4 in 1990 and 1993. Images are superimposed, showing good co-registration of roads and other well-defined features.
We did not correct for atmospheric effects in the 1990 imagery, because the correction would not have affected the outcome of our classification procedure. However, we did radiometrically rectify the 1993 imagery to be statistically consistent with the 1990 scenes. This facilitated visual comparison of imagery from the two time periods and added consistency to results of subsequent classification procedures. Each 1993 sub-image was radiometrically adjusted using the procedure described by Schott et al. (1988). This method involves applying a linear transformation to the image from the second time period to make it appear as if the surface was imaged through the same atmosphere as the first. "Pseudoinvariant" features that are constantly bright over time (e.g., urban surfaces) are identified by the analyst through interactively setting image DN thresholds in specific bands. Once identified, these areas are treated as reference areas and a linear transformation is applied to match the histograms of DN values for each band in each time period (see Schott et al. (1988) for details).
For both 1990 and 1993 sub-images, we next derived brightness, greenness, and wetness (BGW) images by applying the linear coefficients published by Crist and Cicone (1984). Finally, prior to image classification we masked any apparent clouds in the imagery. Areas with clouds in either 1990 or 1993 imagery were masked after DN threshholding to guide on-screen digitizing of cloud boundaries.
Based on discussions with SWEPG members, we decided to use a very simple classification system essentially comparable to Anderson Level 1 or, for natural vegetation, to the highest level of Federal Geographic Data Committee's (FGDC) vegetation classification system (Table 2). The system distinguishes five natural vegetation classes based on canopy structure, three anthropogenic cover types based on land cover and use, water, and bare soil or rock. The system has obvious shortcomings. For example, a single urban class is used to capture all built environments, regardless of density or use. However, the simplicity of the system was viewed as appropriate for the kind of regional, interagency monitoring activities envisioned by the group, and was also expected to be easier to map with acceptable accuracy than a more detailed classification scheme.
.
Image classification was performed by an algorithm developed for this project and referred to as "iterative, map-guided image classification." The approach requires that a reasonably accurate and appropriately detailed land cover map (e.g., a map from the baseline or previous mapping period) exists for the region of interest. An unsupervised classification is performed on the image, and the resulting classified but unlabeled map is then intersected with the existing map to calculate the proportion of each spectral cluster in each land cover class. The spectral cluster with the highest level of association (i.e., the highest ratio of pixels in a cluster and information class combination relative to the sum of pixels in the cluster in all classes) is assigned to the corresponding information class. The pixels in that spectral cluster are then masked in the image and the procedure (i.e., unsupervised classification, comparison to map, labeling of the cluster in highest concordance with a map class) is repeated with the remaining data. The level of association from the first iteration is multiplied by .95, and this value is set as the threshold for assignment in the next iteration. If no cluster reached the threshold in subsequent iterations, the current highest association becomes the new threshold. Processing continues in iterative fashion until all pixels are assigned to a land cover or until a stopping rule is invoked.
The rationale for the iterative procedure is as follows:
Best results are obtained from map-guided image classification when the input map data have the same resolution as the imagery. We did not have a TM-based land cover map available for producing the 1990 map. Instead, we derived the 1990 regional land cover map using 1990 county land cover maps prepared for Riverside, Orange and San Diego Counties. These were produced at roughly 1:24,000 scale by interpretation of large scale air photos. These maps cover only part of the total region, and we were forced to use smaller scale (1:100,000) maps prepared by the California Gap Analysis project (Davis et al. 1995) over the remainder of the study region.
The 1993 land cover map was produced by applying the image classifier to 1993 imagery using the TM-derived land cover map from 1990 required to use a smaller scale (1:100,000) land cover map for the remaining areas (Davis et al. 1995). These maps were combined to form a seamless, 25m map for the entire ecoregion (Figure 3). This input map was used to derive a new regional 1990 land cover map.
Before quantifying changes in land use land cover, we applied logic rules to eliminate ecologically nonsensical change. The logic filters that were applied to the imagery were specific to the study area and the two time periods of image, but the procedure is more general. The following transitions were considered spurious:
In an effort to reduce misregistration effects which are inherent in per-pixel image classifications, and to reduce noise caused by incorrectly classified pixels, we agglomerated the per-pixel classifications into "areal reporting units (ARUs)" of 10, 100 and 1000 acre resolutions. We counted the number of pixels for each class for each time period occurring in each ARU. These counts were then compared on a class by class basis at the ARU level in order to identify changes between both time periods. An ARU with no change would have the same number of pixels for each class for both times
Forty randomly located sample cells measuring 100 meters by 100 meters at ground scale were assigned to each of four 1:24,000 scale USGS 7.5 minute topo quads. Cells immediately contiguous to other cells were omitted from the sample. A total of 153 sample cells were used. We printed the sample cells on acetate overlays, along with major roads and map edge neatlines that matched the topo maps. At that scale, the sample cells were 4.2 millimeters on a side.
We used 1:65,000 scale color IR photographs (July 1990) and 1:48,000 color photos (February 1993) to make an accuracy assessment of the 1990 and 1993 TM images. We used a zoom transfer scope to superimpose the air photos on the overlays and map, and assigned each sample cell to the dominant land cover class.
To assess repeatability and accuracy of the air photo interpretation, we selected a random subset of 69 sample cells to be interpreted by a second person using the same cover class definitions.
In general, we had the most trouble consistently
defining samples cells as shrub, barren, or grassland in situations
where vegetation cover was sparse and required precise and accurate
estimation of shrub cover to determine whether there was sufficient
cover to assign the area shrubland status. Another problem was
making consistent determinations for mixed pixels. Although the
location of the sample cells could probably be determined on the
air photos to within a fraction of the width of the cell, this
may have been enough to make a difference in determination of
cell cover type in a few cases.
The two photointerpreters did not always agree in
their classification of sampled areas (Tables 3 and 4). In double-blind
analysis of 69 samples, they agreed only 77% of the time in interpreting
the 1990 air photos and only 74% of the time in assessing the
1993 photos. Urban and orchard classes were assigned with the
highest consistency. As noted above, assigning classes to mixed
areas and to open shrubland/grassland areas were the main sources
of uncertainty. These results suggest that some of the classification
error measured in the regional land cover maps could derive from
error in photointerpretation.
Table 3. Confusion matrix
showing class assignments by two photointerpreters for 1990 air
photos. A sample of 69 areas was randomly selected from four scattered
7.5-minute quadrangles in the study region.
Table 4. Confusion matrix
showing class assignments by two photointerpreters for 1993 air
photos. The same areas were analyzed as in the 1990 air photos.
Disappointing results were obtained in classifying
1990 TM imagery. Overall agreement between air photo samples and
TM-derived samples was only 55%, and kappa-based accuracy was
only 40% (Table 5). Several factors contributed to this low accuracy:
Needless to say, the low accuracy of the 1990 land
cover map eliminates any possibility of obtaining accurate estimates
of land cover change in the region between 1990 and 1993 based
on these map classes. However, a simpler classification scheme
(for example, water, urban, barren/herbaceous, shrubs, trees),
would have higher accuracy and might be useful for some kinds
of monitoring objectives. The accuracy of the 1990 map based on
this 5-class scheme was 85%, still not high enough to monitor
land cover change with much confidence (Table 7).
Table 5. Confusion matrix
showing classification error for the 1990 land cover map, based
on 153 random samples from four 7.5-minute quadrangles. Rows are
based on intepretation of air photos, and columns are TM-derived
classes. The Kappa statistic is calculated based on the equation
in Congalton (1991).
Table 6. Confusion matrix
comparing map class in 1990 air photo samples (rows) and the 1990
land cover maps (columns) used to train the iterative map-guided
classification algorithm.
Classification acuracy for the 1993 land cover map
was slightly higher than that for 1990 (59% agreement, Table 8).
It is interesting to note that the classes that were mapped with
highest accuracy in 1990 (e.g., urban, orchard) were also mapped
with higher accuracy in 1993. As in 1990, there was high confusion
between barren, grassland and cropland cover types, and between
forest and orchard, so that reasonable accuracies could be achieved
using a simpler classification scheme based on dominant life forms
(Table9).
Table 7. Confusion matrix
for 1990 air photos versus TM-derived land cover for a simplified
classification scheme (n=146).
Table 8. Confusion matrix
for the 1993 TM-derived land cover map.
Table 9. Confusion matrix
for the 1993 TM-derived land cover map based on a simplified land
cover classification system.
Given the low accuracy of the 25 m products when
evaluated using 1 ha reference sites and the ten-class classification
system, there was little reason to analyze change by direct comparison
of the 1990 and 1993 maps. However, a comparison based on the
simplified classification system is more reasonable, given the
higher single-date accuracies. The possible reliability of such
a comparison is illustrated in Table 10, which compares urban
change as predicted in the maps from that detected in the analysis
of the air photos. The results are not encouraging. The one sample
that both analysts agreed had been converted from non-urban to
urban cover was classified as urban in the 1990 map and other
in the 1993 map. Only half of the areas classified by the analysts
as urban in both time periods were similarly classified in the
TM-based maps. We should note that such errors are in large part
related to the scale of the sampling unit.and occurred almost
exclusively in mixed areas that included both urban development
and open space. On a per-pixel basis our accuracies may have been
higher.
Table 10. Confusion matrix
for urban change analysis based on air photos versus TM-derived
maps.
Members from the Southwest Ecoregion Planning Group
reviewed our results for the southernmost TM scene that covered
San Diego County and parts of Orange and western Riverside Counties.
The results were mixed. Even with our relatively crude land
cover classification system, some members expressed optimism that
our results would at least point them in the right direction when
trying to detect change. Other members expressed less enthusiasm,
desiring better results and stricter categorization. We do not
think that our solution is in any way complete, but find the approach
sufficiently encouraging to continue refining the algorithms.
Our intent was to create a reproducible TM-based
monitoring process requiring as little human intervention or subjectivity
as possible. We have come close to this goal, and recognize that
a number of factors have operated from the start to lower the
quality of our results. In particular, it may be unreasonable
to expect high accuracies using single-date imagery, and a more
robust map-guided classification procedure may be needed.
After reassembling the classified maps for both time
periods, we observed some sharp discontinuities at the boundaries
of the subecoregions in the 1993 product. Some of the land cover
boundaries might be expected to be sharp between the subecoregion
boundaries, but others are clearly erroneous classifications.
These artifacts are likely due to the seasonal differences between
the 1990 and 1993 imagery. In order to accurately detect change
using satellite imagery from two dates, it is important to employ
imagery from dates as close to the same time of year as possible.
While all of our 1990 data were from late summer, one 1993 image
was from June and another from April. The change in seasonality
between these two dates has greatly affected the results for these
scenes, and it is difficult to imagine any change detection algorithm
that would perform well under these conditions.
Another problem with our input data was the algorithm
used to georeference and terrain-correct the imagery. While the
1990 data were uniformly processed by EROS, the 1993 data were
processed using a different algorithm provided by PCI. Although
we had sub-pixel registration quality in some areas, in other
areas co-registration accuracy was difficult to determine due
to the lack of recognizable and stable image features. We suspect
that registration errors may often exceed one pixel in rugged
areas.
Finally, we expect that much of the uncertainty in
assessing change from 1990 to 1993 is related to the very different
climate conditions during those years.
In contrast to traditional change detection methods,
the output product is not a differenced image, but instead a series
of change maps for each land cover type illustrating gain or loss
of each land cover type. This output can also be summarized tabularly
by any regular or irregular spatial summary unit. We have not
yet performed a formal analysis of the effect of Areal Reporting
Unit size on the reliability of the assessment, but it appears
that the patterns of change are highly scale dependent. We are
continuing to explore this component of our monitoring approach
with the goal of developing a more formal model of scaling properties
of land cover dynamics.
6.REFERENCES
Congalton, R. G. 1991. A review of assessing the
accuracy of classifications of remotely sensed data. Remote Sensing
of Environment 37: 35-46.
Crist, E. and R. Cicone. 1984. A physically-based
transformation of Thematic Mapper data - the TM tasseled cap.
IEEE Transactions on Geoscience and Remote Sensing GE-22: 256-263.
Davis, F.W., P. A. Stine, D.M. Stoms, M. I. Borhert,
and A. D. Hollander. Gap analysis of the actual vegetation of
California 1. The Southwestern Region. Madrono 42: 40-78.
Dobson, J. E., et al. 1997. NOAA Coastal Change Analysis
Program (C-CAP): guidance for regional implementation. NOAA Technical
Report NMFS 123. Department of Commerce.
Goodenough, R. 1992. The nature and implications
of recent population growth in California. Geography 77: 123-133.
Hickman, J. C. (ed.). 1993. The Jepson manual of
higher plants of California. University of California Press, Berkeley.
Scott, J. R., C. Salvaggio, and W. D. Volchok. 1988.
Radiometric scene normalization using pseudoinvariant features.
Remote Sensing of Environment 26: 1-16.
Strahler, A. H. 1981. Stratification of natural vegetation
for forest and rangeland inventory using Landsat digital imagery
and collateral data. International Journal of Remote Sensing 2:
15-41.
4. RESULTS
4.1. Analyst Performance in Map Accuracy Assessment
4.2. Classification Error in the 1990 and 1993
Land Cover Maps
4.3. Change Analysis, 1990 to 1993
5. CONCLUDING COMMENTS