IBM Final Report: Conclusions

A SPATIAL MODELING AND DECISION SUPPORT SYSTEM FOR CONSERVATION OF BIOLOGICAL DIVERSITY

CONCLUSIONS AND RECOMMENDATIONS

Conclusions and Recommendations for Changes in Approach
Recommendations for Follow-on Research

Conclusions and Recommendations for Changes in Approach

Regional vegetation mapping
Wildlife habitat modeling
Monitoring environmental change
Regional conservation planning and reserve design

Progress in conservation assessment and planning has been severely and unnecessarily limited by hardware and software for mapping and spatial analysis. Specifically: 1) biogeographers and conservation biologists have not had adequate computing resources to analyze the large volumes of data involved in conservation assessments; 2) data management systems in general use are poorly designed for manipulation of heterogeneous biogeographic data; 3) there is practically no coupling among database management systems and analytical software used in biodiversity analyses; 4) it is difficult to visualize biogeographical data sets and model outputs with existing display tools; and 5) spatial modeling and decision support are constrained by inadequate hardware and cumbersome protocols for conducting sensitivity and error propagation analyses.

This research project was to design and enable a prototype regional computing facility for storage, analysis, and visualization of biodiversity data. More specifically, applications were to be developed to support regional gap analysis and siting of nature reserve systems. As originally conceived, the project was intended to develop an object-oriented database with a suite of operators to perform the most important, standard data transformations needed in conservation work and integrated through a common user interface. As the research progressed, however, the technology advanced so rapidly that it made more sense to adapt these technologies to our needs rather than to develop new, parallel solutions. For instance, the advent of the World Wide Web provided a universal environment for data cataloging. Commercial GIS software began to include tools for customizing user interfaces.

Also, the concept of a single user interface to support the gamut of conservation analysis tasks from image processing of satellite image data through location-allocation modeling to site new reserves became unwieldy. Each regional assessment would have unique data and processing requirements, making a prototype system impractical. Instead we focused our effort on advancing some of the individual research problems by improving the approaches, particularly those where processing vast quantities of data were involved. These tasks generally required both scientific and computational advances in the approaches currently used in conservation studies.

Regional vegetation mapping

Most tasks in regional conservation assessment and planning require comprehensive and consistent land-cover maps at a taxonomic level detailed enough to reflect variation in the native biota. For large ecoregions, this mapping typically involves massive data sets of satellite imagery, which are difficult to acquire under similar viewing conditions. We have addressed this issue by developing and testing a new method of compositing daily images into composites covering 10-14 day periods with improved viewing angle and elimination of most cloud cover (Stoms et al. 1997). This compositing strategy was used to compile AVHRR data for a growing season over the Intermountain Semi-Desert Ecoregion covering parts of nine western states. With further testing, this approach could improve the quality of composites being generated to support an international global land-cover mapping and global change research efforts.

A new image classification technique developed for this IBM project was used to classify this multi-temporal image data set into vegetation alliances, using the state GAP maps as training information for the maximum likelihood classifier (Stoms et al. in review). The major innovation of this map-guided classification technique is that it is iterative, assigned pixels to map classes only when there is strong agreement at the current iteration between spectral clusters and map information classes. This technique was applied to the AVHRR data set for the Intermountain Semi-Desert Ecoregion to compile a spatially and taxonomically consistent land-cover map, where the individual state maps had created abrupt discontinuities at boundaries between states. The nation's first multi-state regional gap analysis was conducted using this land-cover map as an expanded coarse-filter for assessing the status of biodiversity in the region (Stoms et al. in review). Map-guided classification is also being used to monitor changes in land-cover over time and could be useful in any large area mapping project as a means of integrating data from different sources.

Much useful information can be derived from the California GAP database with its rich set of landscape attributes. It achieves a view of vegetation over large, heterogeneous regions while containing considerable floristic information and spatial detail. This view was only possible by integrating many kinds of spatial data ranging from modern satellite imagery to air photos and archival vegetation maps. It is intermediate in detail between traditional regional biogeography and local ecological studies, and helps to bridge those very different perspectives. An important distinction between the map-based study in coastal sage scrub (Davis et al. 1994) and earlier studies is that the GAP database is spatially exhaustive across the range of the community type and therefore better suited to regional planning and policy analyses than strictly plot-based information. By relating the information to other spatial data we can readily answer queries such as: Which coastal scrub types occur on national forest lands? Which lands dominated by Salvia leucophylla are zoned as open space? Where are large areas of coastal scrub vegetation that are likely candidates for new reserves?

Wildlife habitat modeling

The orange-throated whiptail study (Hollander et al. 1994) illustrates how different distribution and environmental data at various scales can generate predictive distribution maps and hypotheses about the factors controlling them. None of these representations can be considered definitive, but each has its uses. The advantages of each map approach become more apparent when these different representations are considered together. Thus we envision a mapping environment where the researcher struggles no longer to produce a single map, but produces suites of them at will. Data integration is one component to this, but so are the flexibility and clarity of the underlying models, the multiple images thereby creating a better representation of the complex reality underlying diverse data sources.

From the point of view of wildlife habitat modeling in general, there are a number of results to be highlighted from the wild pig study. The first element to the wildlife habitat modeling is incorporation of human disturbance as a factor affecting relative abundance levels. This is done multiplicatively, with higher levels of hunting pressure or greater road densities corresponding to lower relative abundances. Also, additional influencing factors can be incorporated in the network model by adding nodes and links to the diagram. Finally, both the local and regional models are spatialized in that they integrate habitat factors across the landscape rather than focusing on a single point. This is a step towards building dynamic models of wildlife populations through space and time.

With respect to GIS methodology, the wild pig habitat modeling project has illustrated how expert review of the component layers in a GIS model can enhance the modeling process. Our methodology illustrates how interactive review of the components of the GIS model can aid in its development. This has been accomplished in a workshop where the components of GIS models were presented to wildlife experts. The portability of the presentation has been facilitated by technology such as the use of IBM laptop computers, overhead display devices, and current GIS visualization software. Nevertheless, the present technology did not allow these models to be altered interactively during the workshop. Another issue concerning expert review is whether to elicit feedback at a workshop, as done here, or through interviews with experts carried out individually. The latter offers the possibility of reaching the opinion of more people, whereas group review allows for more synergism and consensus from participants. Another component of expert review is the ability to embed expert knowledge into a formal system for GIS modeling. This can be carried out at several levels. The first is simply to translate the GIS model into a script written in the macro language of the GIS system. This allows the modeling process to be replicated using new datasets. Another level is to create a graphical representation of the formal model for ease in communication. This has been done here in the network diagrams for both the local and regional models. Moreover, this network diagram structure is closely related to influence diagram models from decision analysis. Algorithms have been developed to evaluate such diagrams, which means that these that diagrams can constitute a formal expert system.

Monitoring environmental change

Environmental monitoring and change detection is a very active area of geographic research. Many technical challenges (for example, scene-to-scene radiometric and geometric rectification, scale-dependence, classification strategies, accuracy assessment) and conceptual issues (e.g., criteria for defining and recognizing environmental change) remain. Although monitoring was not a central focus of our IBM-ERP research, we did make substantive progress in the areas of AVHRR pre-processing, compositing methods for obtaining cloud-free imagery (Stoms et al. 1997), and map-guided classification for detecting change using satellite remote sensing.

Regional conservation planning and reserve design

Major advances were made in the area of regional conservation planning. In particular, our linking of the emerging science of conservation biology with the traditions of operations research led to very fruitful research directions. The reserve selection problem in the past was usually solved with simplistic greedy-adding heuristics. Rare attempts to find optimal solutions were thwarted by the apparent combinatorial dilemma for any problem of the dimensions of real-world planning situations. By bringing the problem solving insights of linear programming, we were able to formulate it as a maximal covering location problem and to find optimal solutions to the basic reserve selection problem in reasonable computing time to be useful to conservation planners (Church et al. 1996). By reformulating the MCLP as a p-median problem, we also succeeded in integrating the model into a commercial GIS which most planners can use (Gerrard et al. in press).