Agronomy Journal Grow Your Career With ASA
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Published online 1 September 1999
Published in Agron J 91:761-772 (1999)
© 1999 American Society of Agronomy
677 S. Segoe Rd., Madison, WI 53711 USA
This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (33)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hartkamp, A.D.
Right arrow Articles by Hoogenboom, G.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Hartkamp, A.D.
Right arrow Articles by Hoogenboom, G.
Agricola
Right arrow Articles by Hartkamp, A.D.
Right arrow Articles by Hoogenboom, G.
Related Collections
Right arrow Remote Sensing
Right arrow Other Models
Agronomy Journal 91:761-772 (1999)
© 1999 American Society of Agronomy

SIMULATION & MODELING

Interfacing Geographic Information Systems with Agronomic Modeling

A Review

A.Dewi Hartkampa, Jeffrey W. Whitea and Gerrit Hoogenboomb

a Natural Resources Group, CIMMYT, Lisboa 27, Apartado Postal 6-641, 06600 México, DF, Mexico
b Dep. of Biological & Agricultural Engineering, Univ. of Georgia, Griffin, GA 30223-1797 USA

d.hartkamp{at}cgiar.org


    ABSTRACT
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 Strategies for Interfacing
 Structural Issues Affecting the...
 Applications
 Challenges for Successful...
 Discussion and Conclusions
 REFERENCES
 
Agronomic models are traditionally used for point or site-specific applications due to limitations in data availability as well as computer technologies. Interfacing geographic information systems (GIS) with agronomic models is attractive because it permits the simultaneous examination of spatial and temporal phenomena. The objective of this review is to examine strategies for interfacing GIS with agronomic models. It considers the diverse terminology in use, programming approaches, issues of data and scale, and existing applications. Linking is defined as merely passing input and output between a GIS and a model, combining is defined as automatic data exchange and GIS tool functions, and integrating is defined as embedding a model in a GIS or vice versa. Due to differences in research objectives, spatial and temporal scales, data sources or formats, and the natural processes being modeled, there is no universal approach for interfacing. Because of the detailed input requirements for agronomic models, expanding the models from a point-based application to a spatial application can greatly increase the volume of input data. This review suggests that a major challenge in interfacing GIS to models lies in developing systems that handle spatial processes by implying interactions among spatial units. Moreover, extensive data requirements must be satisfied, while also ensuring data quality control.

Abbreviations: ascii, American Standard Code for Information Interchange • GIS, geographic(al) information system(s)


    INTRODUCTION
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 Strategies for Interfacing
 Structural Issues Affecting the...
 Applications
 Challenges for Successful...
 Discussion and Conclusions
 REFERENCES
 
GEOGRAPHICAL INFORMATION SYSTEMS (GIS) facilitate the storage, manipulation, analysis, and visualization of spatial data. Most process-based agronomic models have examined temporal variation using point data from specific sites, and model outputs thus are site-specific. Agriculture is a spatial activity, however, and there is growing interest in placing site-specific information in a spatial and long-term perspective. Precision agriculture requires models that calculate spatial variation in crop growth at a scale of meters and with a time scale appropriate for management decisions, often hours or days (NRC, 1997). Efficient targeting of germplasm or production practices requires models that calculate germplasm x environment interactions on a regional scale (e.g., 1 to 5 km), usually through modeling at daily time scales (e.g., Chapman and Barreto, 1996). Climate change research calculates global effects with models that are often run on scales of 50 to 100 km, on the basis of multicentury time scales (NSTC, 1998). Furthermore, there is increasing interest in understanding how processes with a spatial component, such as runoff and lateral flow of solutes, affect system behavior. The interaction of both spatial and temporal issues seems best handled through interfacing agronomic models with GIS.

Geographical information systems have existed for almost three decades, but only in the last 10 years have applications been widely used in agriculture and natural resource management (Burrough, 1986). In the 1980s, the number of applications grew as a result of vendor-driven efforts to show the capabilities of GIS (Suan-Pheng, 1993), and vendors' perceptions of the market guided the development of these applications (Dangermond, 1991). During the 1990s, as access to powerful computer technology became less costly, the number of GIS applications specific for research and development has increased. Consequently, a new generation of problems and issues have surfaced that are more pertinent to researchers and particular research objectives than to GIS developers per se (Suan-Pheng, 1993).

An example of such a new issue is the adding of time as a fourth dimension to GIS capabilities. The time dimension can be included in GIS analyses in two ways. In the first approach, time-series of historic data from surveys or remote sensing can be examined as a series of overlays (Marble, 1984). These static spatial snapshots may be analyzed with the help of statistical procedures (Croft and Kessler, 1996), such as Markov chains (Stoorvogel, 1995; Tomlinson Associates, 1987). Such analyses can document past trends, but their predictive power is weak, especially for new production practices or conditions. The second approach, that avoids this shortcoming by using process-based models to represent variation with time, is emphasized in this paper. The resulting model outputs may be viewed as a time series in GIS.

Use of the words `model' and `modeling' in relation to GIS can cause confusion. Firstly, the focus of this paper is on simulation modeling, as opposed to spatial and environmental modeling. Spatial modeling often refers to techniques such as reclassification, overlay, and interpretation (Yakuup, 1993). Environmental modeling refers to techniques ranging from interpolating climate data to the use of data models and remote sensing. These techniques do not relate to simulation modeling per se, although environmental modeling in the narrow sense also exists (e.g., simulations of groundwater flow and the fate of contaminants) (Maslia et al., 1994). Nonetheless, spatial modeling can be used to facilitate interfaces between GIS and modeling. Secondly, this paper focuses on process-based models concerned with agricultural issues (e.g., crop production, soil erosion, or water pollution), as opposed to rule-based (logical) and empirical (regression) models.

The main attraction of interfacing models and GIS is to facilitate simultaneous analysis of spatial and temporal variation in processes. Our understanding and interpretation of the simulation results can not only significantly improve by spatially visualizing the results of models (Engel et al., 1997) but, more importantly, improve by advanced spatial analyses of model results (Campbell et al., 1989; Stoorvogel, 1995). Relevant methods include multivariate analysis, spatial autocorrelation, cluster analysis to define homogeneous zones prior to modeling, point pattern analysis, and error analyses.

Despite the growing number of computer-based applications, little attention has been paid to developing conceptual frameworks for the simultaneous use of GIS and modeling. The objective of this review is to examine strategies for interfacing GIS with agronomic models. We consider the diverse terminology in use, concepts of interfacing, and issues of data, scale, and error. Examples of applications in agronomy and natural resource management are discussed, including extraneous major challenges to effective interfacing.


    Strategies for Interfacing
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 Strategies for Interfacing
 Structural Issues Affecting the...
 Applications
 Challenges for Successful...
 Discussion and Conclusions
 REFERENCES
 
Models have been interfaced with GIS since the mid-1980s, but early efforts did not emphasize process-based models (Nyerges, 1991). Nyerges (1991) noted that GIS vendors have had few incentives to develop such complex models, because of their limited market potential. In the past, therefore, GIS–model interfaces were developed within the various research disciplines in an ad hoc manner by researchers who were not professional GIS programmers (Stoorvogel, 1995). Because of these circumstances, a conceptual framework with standards for terminology, formats, and procedures for interfacing models with GIS does not exist.

Terms frequently used in describing systems that interface GIS and models, and their definition (Longman, 1984), include the following:

We suggest that interface and interfacing be used as umbrella words for the simultaneous use of GIS and modeling tools, since they do not imply a specific level of interaction between them.

We consider linking, combining, and integrating to be suitable terminology for degrees of interfacing. Burrough (1996) and Tim (1996) refer to "loose coupling," "tight coupling," and "embedded coupling," which correspond to linking, combining, and integrating, respectively. Fedra (1993) uses "deep coupling," which corresponds to integrating. Distinguishing between linking and combining can be difficult, while integration is more easily distinguished (Tim, 1996). The terms `linking', `combining', and `integrating' relate to the physical extent to which the GIS and models are interfaced.

Linking
Simple linkage strategies use GIS for spatially displaying model outputs. This approach often involves interpolation of model outputs (e.g., White and Hoogenboom, 1995). More sophisticated linkage strategies use GIS functions such as interpolation, overlay, and slope calculation to produce a database containing inputs for the model. Model outputs can be exported to the same or a separate database. Communication between the software systems is achieved through grid cell or polygon identifiers that link input and output to field locations. Simple transfer of files in ascii format or a common binary file format is usually sufficient in this strategy. The concept of linking GIS and models is presented in Fig. 1a . Limitations of this strategy often include (i) the system's dependence on either the GIS or model output format; (ii) failure to take full advantage of the functional capabilities of the GIS (e.g., spatial analysis tools); and (iii) the incompatibility of operating environments and hardware (Tim, 1996). Lam et al. (1996) and Fedra (1991) have emphasized that users cannot exploit the full potential of the systems through linking. Examples of linking are GLEAMS to ArcInfo (Stallings et al., 1992), USLE to MAP GIS (Hession and Shanholtz, 1988), and WOFOST to ArcInfo (Van Laanen et al., 1992) (Table 1) .



View larger version (26K):
[in this window]
[in a new window]
 
Fig. 1 Organizational structure for (a) linking, (b) combining, and (c) integrating geographical information systems (GIS) and models. (Adapted from Tim, 1996, and Tim and Jolly, 1994.)

 

View this table:
[in this window]
[in a new window]
 
Table 1 Examples of GIS–model interfaces, organized by interface type, main data format (vector or raster) and reference

 
Combining
Combining also involves processing data in a GIS and displaying model results; however, the model is configured with interactive tools of the GIS and the data are exchanged automatically (Burrough, 1996). Extensive use is made of mechanisms that are offered by GIS packages: macro languages, interface programs written in standard program languages, and libraries of user-callable routines (Tim, 1996). This approach usually requires more complex programming and data management than do simple linkages. The concept of combining GIS with models is presented in Fig. 1b. Examples of combining are AEGIS with ArcView (Engel et al., 1997), (GIDM) Gleams with ArcInfo (Fraisse et al., 1994), and WEPP with ArcView (Cochrane et al., 1997) (Table 1).

Integrating
Integration implies incorporating one system into the other. Either a model is embedded in a GIS, or a simple GIS system is included in a modeling system. Aside from making use of GIS and modeling tools, integration usually involves automatic use of relational databases, expert systems, and statistical packages. Full integration implies systems developed within the same or similar data structures. File transfer and format conversions are avoided or automated and thus are invisible to the user. The development of such systems may mean starting from scratch with data organization, among other tasks. A considerable programming effort is needed to develop these software systems, not to mention a considerable mutual understanding between the GIS specialist and modeler, and so only limited attempts have been made to integrate process-based models with GIS. More often integrated systems make use of simplified models (Tim, 1996). The concept of integrating GIS with models is presented in Fig. 1c. Examples of integrating are RAISON (Lam and Swayne, 1991; Lam et al., 1996) and the interface described by Stuart and Stocks (1993) (Table 1).

Additional examples of interfaces that have been linked, combined, and integrated are presented in Table 1. Abbreviations of the model names and interface tools are listed alphabetically in Table 2 . In summary, limitations of the different strategies are related to problems of incompatibility of database structures, software, and hardware (Stoorvogel, 1995; Tim, 1996; Burrough, 1996). Linking strategies usually underuse the functional capabilities of GIS to achieve an interactivity between the GIS and models. Point models are run only for a series of locations, and there is no attempt to consider interaction between neighboring locations, such as run-off or run-on in adjacent plots. In combining and integrating, interactivity can be more readily achieved. Almost all interfacing activities require considerable effort from the developers and users (Engel et al., 1997). The ease of use, efficiency, development and maintenance costs, and necessary human resource training are important considerations for system design (Fedra, 1991; Nyerges, 1991). The amount of effort needed to develop integrated systems is large, and probably for this reason most efforts at interfacing have evolved through linking models with GIS. Stoorvogel (1995) noted that a modular approach should contribute to the transparency and flexibility of structure and procedures.


View this table:
[in this window]
[in a new window]
 
Table 2 Abbreviations and acronyms of models and interface tools

 
The choice of the interfacing strategy should depend on the research problem, the application objectives, and the investment the user is able to make. This sounds easy, but it is not. The research problem often has different relevance at different spatial and temporal scales (Fresco and Kroonenberg, 1992). The choice of the scale at which the problem is addressed may be subjective. It is often difficult to determine an approach that will provide valid research results at the farm level as well as the regional level. Procedures for up-scaling and down-scaling exist, but methods for calculating the effects of these procedures on resulting model calculations are still scarce. Few investigators have studied the effect of up- and down-scaling (e.g., Izaurralde et al., 1996; Hijmans and Bowen, 1997; Wagenet and Hutson, 1996). Hijmans and Bowen (1997) described aggregation of data in time (weather data) and space (soil data) when models were interfaced with GIS. They found that the effect of the aggregation or disaggregation on resulting calculations depends on a combination of environmental variability and model sensitivity. In heterogeneous regions and for models that are sensitive to changes, this may lead to errors in the resulting GIS–model calculations. Others have recognized the constraint (e.g., to aggregation to regional levels; Rosenberg, 1992) and explained that this is caused by a lack of efficient means to incorporate spatial variability in input variables (Carbone et al., 1996).


    Structural Issues Affecting the Interface Strategy
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 Strategies for Interfacing
 Structural Issues Affecting the...
 Applications
 Challenges for Successful...
 Discussion and Conclusions
 REFERENCES
 
The physical extent to which modeling and GIS capabilities are interfaced can be viewed as a programming issue. However, there are structural issues that affect the interface strategy. These are related to the research problem or the purpose of the application and include the scale, type (linear, nonlinear), and complexity of the processes modeled and of the data sources, the format and structure of the available data, and the dynamic relations between model runs and the spatial units.

Scale and Complexity
The spatial scale of the research problem may range from the plot to the field, farm, watershed, region (intranational), nation, region (international), continent, and/or global level. The temporal scale can vary from seconds to several years or more. Related scales in interfacing include the data measurement scale, original map and GIS scale, modeling scale, data manipulation scale, natural scale of the phenomenon, and scale of application (Burrough, 1996).

Issues of scale for the GIS component of an interfaced system are straightforward. The map scale is often predefined. For instance, map scales between 1:100 000 and 1:250 000 are often recommended for regional studies, whereas scales of 1:1000 and 1:2500 are more appropriate for farm-level applications (Garrity and Singh, 1991). Unfortunately, use of detailed map scales is frequently precluded by practical constraints, such as poor data availability or inadequate computer resources. Wilson et al. (1996), Inskeep et al. (1996), and Wagenet and Hutson (1996) found that the impact of input map (data) resolution on final model calculation depends on the processes accounted for by the model.

The scale of a model is more problematic and involves three closely linked dimensions: space, time, and complexity (Penning de Vries, 1996). Model scale is often equated with the complexity of the processes that are modeled. However, this is incorrect. For example, a model of global climate patterns may simulate complex atmospheric processes but operate at a spatial resolution of 50 km or larger. Simple models of physiological processes may run on a time scale of minutes. Traditional point-based agronomic models lack an explicit spatial scale, although it is often suggested that they are valid at the plot or field scale. However, when such a model is interfaced to spatial data in a GIS, the spatial scale is usually predetermined by the scale of the spatial data or application. The temporal scale of a model may be influenced by the time span of the types of processes being modeled, data availability, computational constraints or scale of the application.

Model complexity is largely determined by the type (e.g., linear or nonlinear) and detail of the processes represented. However, it can also be influenced by data availability, computational constraints, and interests in making underlying assumptions readily understandable.

The choice of an appropriate model scale is often difficult and is a topic of active debate. Clearly, the answer to the scale and/or complexity issue should lie in the research problem or application objective itself (Boote et al., 1996). Passioura (1996) made the valuable distinction between scientific models, which are intended to improve the understanding of processes, and engineering models, which are intended to provide sound calculations for decision makers. Monteith (1996) noted that the ease of software development afforded by modern personal computers may have led some researchers to develop excessively complex models. However, this view may be contrasted with that of Leenhardt et al. (1995), who argued that, for modeling effects of spatial soil and water variability at a regional scale, simplification of models to facilitate modeling across large regions or long time scales is unjustified on theoretical grounds. Given equal experimental effort, simple approaches allow a greater spatial sampling density than more mechanistic ones, but simplifying processes can reduce the sphere of validity of the outputs. Furthermore, integrated parameters are often difficult to relate to specific measurable parameters, such as soil texture or leaf area index. Burrough (1989) related the choice of model complexity and sampling density to an economic consideration of investment. Stoorvogel (1995) noted that complex models are often avoided because of limited availability of data.

Model complexity relates not only to the processes modeled and data requirements, but also to computational requirements of the model. A simple model, such as the Universal Soil Loss Equation (USLE) (Wischmeier and Smith, 1978), can be embedded more easily in a GIS than a model with complex computational requirements, such as parallel processing, that are not provided by GIS software and hardware (Burrough, 1996).

Model complexity also affects functionality for end-users (Moore et al., 1993). The tendency to develop complex, physically based models that are difficult for users to understand is liable to grow unless common concepts and terminology are developed. Generic system-integration tools developed under a common conceptual theory have been proposed as a means to reduce the gap between theoretical GIS practitioners and discipline-oriented applications specialists (Stoorvogel, 1995).

Spatial Distribution and Type of Data
The spatial distribution of the available data can influence the strategy for interfacing. Weather stations and soil samples can have an irregular, sparse distribution. Their values can show large variation in space. Before these weather and soil data can be useful for input into a model they may need to be interpolated. Interpolation methods include kriging and cokriging (Krige, 1951; Journel and Huijbregts, 1978), splining (Hutchinson, 1991), and spatial domain methods based on state-space models (Shumway et al., 1989; Wendroth et al., 1992). The choice of spatial interpolators depends on type and distribution of the data and research objective (e.g., DeBrule, 1983).

The type of data, continuous or discontinuous, can also influence the strategy for interfacing. For example, it can be problematic to relate point measurements of discontinuous variables, such as soil taxonomic units, to a final polygon or raster structure unless borders are exactly delineated by measurement. If a variable is discontinuous, point data lose their connection to the polygon or raster unless borders are exactly delineated by measurement. In this case, Monte Carlo simulations can be used to capture the variance, as in the work of Foussereau et al. (1993) on soil variables within discontinuous soil taxonomy units. Also, fuzzy logic can be used to estimate the spatial distribution of soil types and to derive soil properties (e.g., Zhu et al., 1997).

Spatial Data Format
The appropriate spatial data format depends on the type of data or data source. Quantitative data such as climate or soil traits are often provided as interpolated surfaces in raster (gridded) format. Soil taxonomy maps and land-use data are more commonly recorded in vector (polygon) formats. The choice of raster or vector depends on the importance of spatial interactions in the process being studied and how these are handled in the model (Fraisse et al., 1994). Relative advantages and disadvantages of the two different formats are reviewed in basic texts on GIS (e.g., Burrough, 1986). Fortunately, enhancements in recent software have reduced this format incompatibility, and some interfaces can use both formats (e.g., Wilson et al., 1996). The spatial data format has consequences for subsequent analysis, particularly where spatial scales are varied. Raster data (grid cells) can be easily overlaid and can be aggregated (lumped to bigger grid cell sizes) more easily than polygon structures, which have irregular shapes. Furthermore, with polygon structures small `splinter' areas are often formed, which are hard to interpret.

Model Simulations in Relation to Type and Size of Spatial Unit
A model can be run for all spatial units (i.e., grid cells or polygons) in a study area or, to decrease the number of runs, for a subset of the spatial units. The type (interacting or noninteracting) and size of spatial units influence the selection of spatial units for the simulations.

Noninteracting Spatial Units
Noninteracting spatial units are units (grid cells or polygons) whose value does not affect the value of the neighboring unit. If the total study area is small relative to the spatial unit size, simulations may be run on all possible spatial units, and the spatial database may be used as model input without alteration. However, if the study area is large and the spatial units are small (with few classes), simulations may be run for only specific classes of units. Class values that are not farming areas, such as cities or water bodies, can be masked out. Values can be sorted and classified in an intermediate database structure, using multivariate analyses. Studying maize (Zea mays L.) yield potential of East Africa, Collis and Corbett (1997) created what they called effective environments—climate zones that were defined through cluster analysis. Model simulations were conducted for each environment only.

For large databases or small spatial units, a random subset of units can be evaluated (the Monte Carlo method). In a modified Monte Carlo, units are prestratified. For example, variable numbers of simulations are executed for different regions according to their relative importance as production areas or for specific soil types. More simulations may be executed for border cells to reduce edge effects.

Interacting Spatial Units
Spatial units are considered to interact when values of one unit affect the values of neighboring units. For interacting units, a model may have to be run for all spatial units, or else subsampling has to be carefully managed. Furthermore, the order of the simulation runs must be determined prior to running the simulation. In the case of surface run-off, the sequence of the simulations is determined by identifying the flow path over the terrain, as is done in ANSWERS (Rewerts and Engel, 1991) and ZOO (Ioannidis et al., 1997). Given that traditional agronomic models are one-dimensional and essentially ignore horizontal flow when interfaced to a GIS, this potentially important interaction is usually ignored. These applications will remain less useful in areas of variable terrain or topography.

Managing Input and Output
To analyze different scenarios, many simulations may have to be conducted for a single spatial unit. A data and programming issue that arises is how to manage long-term modeling of different inputs for one spatial unit structure. For example, if three irrigation levels and three fertilizer levels are applied to five crops for 20 years of variable soil and weather data, a single spatial unit would have 900 values for each output variable. Fortunately, advances in imaging allow outputs to be viewed as a series of images, either displayed together or as a dynamic view (animation). For example, yearly images could present differences in yield due to treatments, with summary statistics used as aids. Further development of viewing and analysis tools in this area is anticipated.


    Applications
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 Strategies for Interfacing
 Structural Issues Affecting the...
 Applications
 Challenges for Successful...
 Discussion and Conclusions
 REFERENCES
 
In agronomy and natural resource management research, applications of interfaces of GIS and modeling have grown from primarily hydrological applications in the mid-1980s to the current wide range of applications. Ordered roughly by increasing detail of spatial scale, examples can be categorized into groups such as the following.

  1. Atmospheric modeling (Lee et al., 1993).
  2. Climate change, sensitivity and/or variability studies (Rosenzweig, 1990; Wei et al., 1994; Beinroth et al., 1998).
  3. Agroecological characterization and zonation (Aggarwal, 1995; Bouman et al., 1994).
  4. Regional risk analysis (Bouman, 1993).
  5. Scenario modeling and impact assessment, ex ante and also ex post (WRR, 1992; Stoorvogel, 1995; De Koning et al., 1993; Stockle, 1996; Lam, 1993).
  6. Hydrology, water quality, water pollution (Mamillapalli et al., 1996; Warwick and Haness, 1992; Holloway, 1992; Corwin and Loague, 1996; Kovar and Nachtnebel, 1993; Maidment, 1993).
  7. Spatial yield calculation—regional, global (Haskett et al., 1995; Van Keulen and Stol, 1995; Karthikeyan et al., 1996).
  8. Precision farming (spatial yield calculation) (Booltink and Verhagen, 1997a; Engel et al., 1997; Hoogenboom et al., 1993).

However, strict borders between the application groups do not exist. For instance, nutrient management, particularly minimizing nitrate leaching, is a cross-cutting theme, especially in Groups 5, 6, and 8. Climate change and variability may be seen as a scenario, but scenario modeling, as defined here, includes scenarios derived from policy goals (e.g., WRR, 1992). Examples of linking the DSSAT family of crop models to GIS at different spatial scales (field to regional) are presented in Table 3 .


View this table:
[in this window]
[in a new window]
 
Table 3 Applications of DSSAT models at different scales

 
By interfacing with GIS, models are often run for areas where they have not been validated. In this case, the interfacing of GIS to models serves as a sensitivity analysis of the model. White and Hoogenboom (1995) simulated dry bean (Phaseolus vulgaris L.) yields over the eastern United States and Canada, varying only weather conditions, and found that the simulated crop growth roughly matched expectations based on known crop distributions. These exercises could be extended by varying parameters such as soil depth and moisture retention. However, caution must be taken to ensure that potential users understand the difference between sensitivity analysis and calculation, and consider the limitations of spatial scale.

Additionally, care must be taken not to assume that the model application remains the same when it is interfaced to GIS. The model name or acronym may remain unchanged as the application evolves. EPIC (Erosion Productivity Impact Calculator) was first developed to examine relations among crop management practices, productivity, and soil erosion. More recently the GIS-interfaced version of the model has been used in climate sensitivity, hydrology, and water quality assessment applications. Consequently, the original acronym has acquired a new meaning, Environmental Policy Integrated Climate (Ramanarayanan, personal communication, 1997). In some cases, the confusion is reduced by giving the GIS–model interface a new name, such as AEGIS (Agricultural and Environmental Geographic Information Systems), which developed within the DSSAT (Decision Support System for Agrotechnology Transfer) framework.


    Challenges for Successful Applications
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 Strategies for Interfacing
 Structural Issues Affecting the...
 Applications
 Challenges for Successful...
 Discussion and Conclusions
 REFERENCES
 
The large number of software systems in which models and GIS have already been interfaced (Table 1) suggests that, while interfacing per se is not a trivial exercise, it is a relatively tractable software engineering problem. In this review, other challenges are apparent. These include developing interfaced systems that achieve interactivity and satisfying data requirements while ensuring data quality control by developing methods for error analysis.

Interactivity
As mentioned above in the section on strategies for interfacing, linking interfaces often does not achieve interactivity, because spatial output is created merely by interpolating point-based simulations. The points themselves are considered independent of their neighbors when the simulations are run. A challenge to interfacing applications is to make the simulations interactive, and so truly achieve spatial modeling. This interactivity between spatial units can be achieved more easily by combining and integrating efforts.

Data Availability and Information Sharing
Because of the detailed input requirements for agronomic models, expanding the models from a point-based application to a spatial application necessarily expands the need for data. Although it is not stated in the interfacing studies, data availability must be enhanced to fully realize the potential of interfacing GIS to models.

Improved data availability may be achieved through use of additional sources, encouraging data format standards, and improving information sharing management systems. Remote sensing is thought to be a potentially important additional source of data in precision agriculture (NRC, 1997) and in county-level yield mapping (Carbone et al., 1996). It has been used to help estimate parameters for model input, such as leaf area index, soil moisture, and surface soil water evaporation (Bouman, 1995; Reiniger and Seguin, 1986; Moran et al., 1995), and to evaluate and validate results of GIS–modeling efforts (Maas, 1993; Bouman, 1995; Booltink and Verhagen, 1997b).

Data availability can also be improved by reducing data incompatibility due to physical storage (format), syntactic organization (conversion, repackaging needs), quality and accuracy, or semantic interpretation. The first two problems can be resolved by standards (Evans, 1994). Efforts to standardize input formats have been promoted for crop modeling by the International Consortium for Agricultural Systems Application (ICASA) (Hunt et al., 1994; Ritchie, 1995; Tsuji et al., 1998). These standards facilitate interchange of data, thereby increasing data availability. Differences in accuracy and semantics are harder to solve and may still inhibit information sharing (Evans, 1994).

Besides technical issues, data availability and information exchange are often affected by organizational, legal, cultural, and bureaucratic factors. There is considerable discussion on whether governments should encourage data distribution on a free or subsidized basis, as opposed to charging the full costs of data collection and distribution. Economic analyses (Porter and Callahan, 1994) suggest that data contributors should receive more benefit than they currently do. Information management policies that increase the credit to the collector and ensure the responsibility of quality and documentation are necessary. However, the benefits of open data access in agricultural systems might include long-term effects on regional economies or on the natural resource base that are difficult to quantify. Porter and Callahan (1994) provided a broad review of other issues related to the organizational, legal, and bureaucratic aspects of data sharing for environmental research.

Error Analysis
Spatial data have errors due to measurement, digitization, or interpolation. Similarly, models, being simplified representations of reality, produce output with error. How these errors interact when systems are interfaced is poorly understood, and so error analysis will become increasingly important as more models are interfaced with GIS. Users become concerned about the reliability and quality of the model outputs (Loague et al., 1998). Error analysis is also useful for assessing optimal combinations of sampling density and model complexity (Leenhardt et al., 1995). Uncertainty analysis is related to error analysis. Quantifying the effects of the uncertainty of variables on modeling can provide an indication of the reliability of the resulting calculations (Bouman, 1993; Corwin and Loague, 1996).

Conventional error propagation theory can be used to assess the quality of modeling results only if they are influenced by random errors. For data or variables stored in a GIS or used as model input, sources of error are usually functions of observations, measurements, and entry. These are random errors. However, some techniques used in GIS, such as logic models (e.g., suitability classes) contain a systematic error of unknown magnitude that error propagation theory cannot address effectively (Drummond, 1987). Burrough (1986) recognized this problem and developed propagation rules for several GIS procedures.

Other attempts at error analysis for GIS have used probability modeling. This approach is problematic, because of the variety of possible spatial data processing procedures and the rigorous requirements of probabilistic data gathering. In a GIS, two major classes of error or uncertainty can be defined, those dealing with positional error (digitizing, georeferencing) and those addressing thematic uncertainty and error. For questions of spatial variability, fuzzy surfaces are used for uncertainty analysis and Monte Carlo methods for error analysis (Davis and Keller, 1996).

In simulation modeling validation (Neelamkavil, 1987), sensitivity analysis and Monte Carlo analyses can help determine error (Bouman, 1993). Bouman (1995) also suggested using remote sensing to reduce uncertainty in modeling efforts.

Methods for analyzing error propagation in GIS and model interfaces are still lacking. Hill et al. (1996) estimated error using an iterative Monte Carlo process for a range of model parameters, grid resolutions, and value estimates where the rules of Burrough (1986) were not applicable. Data resolution and model organization are often changed to interface GIS and models. Error can increase because of the aggregation. De Roo et al. (1989) found that simulations with the GIS-interfaced version of a model calculated 46% more runoff and 36% more erosion than with the original model. Stallings (personal communication 1997) found that aggregated soil data led to a 100% error in model outputs. These results suggest that there is still a poor understanding of how up- and down-scaling influence error propagation, when models are interfaced to GIS.


    Discussion and Conclusions
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 Strategies for Interfacing
 Structural Issues Affecting the...
 Applications
 Challenges for Successful...
 Discussion and Conclusions
 REFERENCES
 
In reviewing existing GIS–model interfaces, this study identified `linking,' `combining,' and `integrating' as suitable terminology for characterizing basic strategies for interfacing agronomic models with GIS. Structural issues such as scale of models and the type, distribution, and scale of data were discussed.

Although there is an increased availability of user interfaces for linking GIS to simulation modeling, there is no guarantee that this improves science. On the contrary, in working with complex interfaces, users have fewer incentives to learn basic concepts, procedures, and the limitations of the underlying systems. Questions on the contribution of complex systems to our problem-solving capacity have been raised in crop modeling research (Passioura, 1996; Sinclair and Seligman, 1996). Both GIS and simulation models have been developed with their own conventions, procedures and limitations. However, linking them at a technical level does not guarantee improved understanding nor useful prediction (Burrough, 1996). There is a danger that calibration, validation, and error analysis will be neglected if GIS–modeling interfaces become too easy to use (Burrough, 1996).

Major challenges lie in achieving full interactivity of a GIS and a model, and in satisfying spatial data requirements while ensuring data quality control through error analysis. Qualitative and subjective procedures are often used for spatial analysis in GIS, and the resulting information loses much of its relevance and statistical validity (Stoorvogel, 1995). More quantitative quality indicators, together with spatial statistics and error analysis, are needed to improve the value of GIS–modeling interfaces.Longman Dictionary of the English Language 1984; National Science and Technology Council. 1998


    NOTES
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 Strategies for Interfacing
 Structural Issues Affecting the...
 Applications
 Challenges for Successful...
 Discussion and Conclusions
 REFERENCES
 
Presented at Annu. Meet. ASA, 89th, Anaheim, CA, 26–31 Oct. 1997.

Received for publication March 26, 1998.
    REFERENCES
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 Strategies for Interfacing
 Structural Issues Affecting the...
 Applications
 Challenges for Successful...
 Discussion and Conclusions
 REFERENCES