GIS and Spatial Analysis in Urban and Regional Research: Derek Bond
GIS and Spatial Analysis in Urban and Regional Research: Derek Bond
GIS and Spatial Analysis in Urban and Regional Research: Derek Bond
Research
Derek Bond
Abstract
The early optimism surrounding GIS has now faded. Few GIS implementations have
lived up to expectation. The most successful have been those at the operational level
where their main role has been to replace paper maps with digital ones. However, as the
cost of GIS implementation falls and spatial data becomes more readily available there
is a need to consider the future of GIS and spatial analysis. This is particularly important
in the field of urban and regional research especially where considerable investment in
GIS has already been made. This paper looks at the major issues that face regional and
urban research in the next stage of development of GIS and spatial analysis. The main
conclusion of the paper is that regional and urban statistics offices and researchers
are well placed to benefit from future developments in GIS and spatial analysis.
Derek Bond
School of Business, Retail and Financial Services
University of Ulster at Coleraine
Cromore Rd.
Coleraine , Co. Derry
United Kingdom
BT52 1SA
e-mail: [email protected]
C&R ISSN 1568-167X
(c.f. Masser (1999) for discussion of the issues surrounding the development of national
spatial infrastructures). The availability of topographic data has reduced the overheads
associated with developing a GIS and has led to an increase in demand for more
spatially dis-aggregated and referenced attribute data to use with such systems. This
demand and a lack of national and international spatial data infrastructures have various
implications for those involved in regional and urban research and analysis. The most
common problems found in the literature are those of attaching consistent spatial
references to attribute data and of the requirement for fine detail small area statistics
which do not compromise on confidentiality. Caught in the middle of these problems
are Urban and Regional Statistics Offices. Most of these offices are both data users and
providers. While involved in some primary data collection, often as a by-product of the
administrative processes of their administrations, they often add value to data
collected by National Statistical Institutes (NSIs) and other bodies. Often these
secondary data sets do not contain spatial references that are consistent with the
topographic data or other attribute data used. The issue of attaching spatial identifiers is
a complex one (for example, see Visvalingam (1991) for discussion) and is linked to that
of statistical confidentiality.
Statistical confidentiality is often quoted as a major issue hindering the development of
GIS. The problem is that in the ideal world a GIS would contain the basic records of any
attribute dataset with consistent spatial references attached. In most countries this is
impossible as statistical disclosure rules do not permit the identification of individuals or
single organisations (see for example Marsh et al. (1992) for discussion on confidentiality
surrounding the 1991 United Kingdom's Census of Population and EuroStat (1992)
for general discussion of various issues surrounding statistical confidentiality). The issues
of statistical confidentiality and spatial tagging are addressed in this volume by
Wendy Treadwell.
While spatial tagging and statistical confidentiality are problems that need to be
addressed for the successful implementation of GIS in an organisation GIS also
provides tools with which to consider these issues. For example, GIS software is often
used to assist in the construction of statistically 'sound' spatial reporting regions.
However, as Coombes (1998) concluded, while discussing the construction of
'Travel To Work Areas' for England and Wales: '... the methods and software which is
needed for defining new sets of areas -..- are still too specialised to be found in many
GIS packages'.
The problems facing urban and regional statistics offices are two faced. They are often
trying to combine data from various data sources to provide high quality insights
into the functioning of their areas whilst being seen as the first port of call by
researchers and other end users looking for detailed secondary data on the area.
Much innovative research has been done by these offices to integrate spatial data
sets and provide consistent data and information to their end users (see, for example
Tammilehto-Luode and Backer (1999) for a description of a Scandinavian exercise
to provide small area statistics using GIS).
The visual element of GIS has always played a large part in its popular appeal and research
has shown (Smelcer & Carmel (1997)) that, provided the task is relatively simple, a map can
provide an ideal starting point for information discovery. This has led to growth in GIS based
interactive information seeking software. The aim of this software is to replace traditional textbased information retrieval strategies with ones based on visualisation and knowledge
discovery in databases (Fabrikant (2000)). Knowledge Discovery in Databases (KDD) has
been defined as 'the non-trivial process of identifying valid, novel, potentially useful, and
ultimately understandable patterns in data' (Fayyad et al. (1996) p.6). KDD is closely linked
with data mining and uses the computational power of modern IT to explore data sets rather
than relying on data reduction techniques. A simple example of the use of visualisation and
KDD is the Casweb Interface to the United Kingdom's 1991 Census of Population
(http://jimay.mcc.ac.uk/casweb/). In this volume the paper by Hans Voss et al. looks at, amongst
other things, the issues surrounding the implementation of map interfaces to complex data and
the role of KDD.
SPATIAL DATA ANALYSIS
It is often argued (c.f. Csillag and Kabos (1999)) that developments in methods of spatial
analysis have failed to keep up with those in GIS thus hindering its uptake. While there may
be some truth in this most of the discussion is couched in the traditional Fisher-Neyman
paradigm and emphasis is put on the need for parametric estimation and model building rather
than on KDD and Geographic Visualisations (GVis) (c.f. Maceachren et al. (1999)). The
development in the use of these techniques has come from two distinct fields. GVis has been
developed by mathematical geographers whereas KDD has grown from work on non-spatial
data mining in mathematics.
Traditional spatial analysis has also developed and adopted a more Exploratory Data Analysis
(EDA) approach. The paper in this volume by Peter Brown et al. is a good example, using
geodemographic profiling, of the current policy interface between GIS and spatial analysis.
The problem with approaches such as this is the reliance on the results of one data reduction
exercise. GVis and KDD allow the user to more flexibility explore the larger complex
datasets. The paper by Hans Voss et al. in this volume looks at the issues of visualisation and
KDD in spatial datasets and the paper by Maribel Santos and Luis Amarel gives an example
of how spatial reasoning might be applied in KDD.
GIS AND SPATIAL ANALYSIS SOFTWARE
To many end users the talk of GVis and KDD in spatial datasets seem a distant dream as
traditional GIS software, other than simple raster based mapping packages have had the
reputation of being extremely expensive (Pinals (1998)). However, as the papers by Patrick
Gerland et al. and Dang Van Due in this volume show there is now a wealth of low cost and
free software available for end-users. Similarly the paper by Maribel Santos and Luis Amarel
shows how KDD might be done on spatial datasets using standard software and the paper by
Natali Andrienko et al describes ready available visualisation and KDD software.
In the first of these papers Patrick Gerland et al. describe the GIS software developed by the
United Nations for population research which includes software for capturing, storing,
retrieving and analysing spatial data - all of which is readily available on the web
(http://www.undo.org/popin/softproj/index.htm) and are windows based for ease of use. In the
second of these papers Dang Van Duc outlines the web based MapOnline software which has
developed out of the UN's PopMap software. The paper by Maribel Santos and Luis Amarel
gives an example of how the SPSS add in 'Clementine' might be used for spatial KDD.
Clementine is a ready available datelining programme and again has an easy to use window
interface. The last of these papers by Natali Andrienko et al. discusses the use of the Java
based Descartes software (http://allanon.gmd.de/and/java/iris/). This software like MapOnline
is a web based spatial analysis tool allowing for KDD.
References
Campbell, H and I. Masser (1995): GIS and Organisations. London, Bristol: P.A. Taylor and
Francis.
City of Helsinki (1999): Comparative City Statistics 1999 City of Helsinki Urban Facts,
Yliopistopaino, Helsinki.
City of Nuremburg (1995) Facts and Figures in Comparison 1994 City of Nuremburg,
International Relation Office, Nuremburg.
Csillag F., and S. Kabos (1999): 'Convergence between GIS and spatial statistics: What?
Where? When?' Bulletin of the International Statistical Institute, Vol. 58, book 1, pp. 249252.
Department of the Environment (1987): Handling Geographic Information: report of the
Committee of Enquiry chaired by Lord Chorley. London.: H.M.S.O.
Eurostat (1992): International Seminar on Statistical Confidentiality: Proceedings,
Eurostat/ISI.
Fabrikant S.I. (2000): 'Spatialized Browsing in Large Data Archives' Transactions in GIS Vol. 4,
pp. 65-78.
Fayyad, U., G. Piatetsky-Shapiro and P. Smyth (1996): 'From Data Mining to Knowledge
Discovery in Databases: An Overview'. In Fayyad, U., G. Piatetsky-Shapiro, P. Smyth and R.
Uthurusamy (eds.) Advances in Knowledge Discovery and data Mining,. Menlo Park, CA.:
AAAI Press/MIT Press.
Garcia-Molina, H., J. Widom, J. Wiener, W. Labio, B. Lent and Y. Zhuge (1995)
Hendrik P.H.J. (1998): 'Information Strategies for Geographic Information Systems'
International Journal of Geographic Information Science, Vol. 12, pp. 621-639.
Hendrik P.H.J. (2000): 'An Organisational Learning perspective on GIS' International
Journal of Geographic Information Science, Vol. 14, pp. 373-396.
Lee-Smeltzer, K.H. (2000): 'Finding the Needle: controlled vocabularies, resource discovery
and Dublin Core' Library Collections Acquisitions and Technical Services, Vol. 24, pp. 205215.
Lievesley, D. (1992): 'The Role of the ESRC's Data Archive in the Dissemination of Data for
Secondary Analysis' Journal of the Market Research Society, Vol. 35, pp. 267-278.
Jarke M, C. Quix, D. Calvanese,M. Lenzerini, E. Franconi, S. Ligoudistianos, P. Vassiliadis,
and Y. Vassiliou (2000) 'Concept based design of data warehouses: The DWQ
demonstrators' Sigmod Record, Vol. 29, pp. 591-591.
Maceachren, A.M., M. Wachowicz, R. Edsall, D. Haug and R. Masters (1999): 'Constructing
knowledge from multivariate Data: integrating geographical visualisation with knowledge
discovery in database methods'
International Journal of Geographic Information Science, Vol. 13, pp. 311-334.
Pinals, D. (1998): 'Choosing A Mapping Software package that suits your company's needs,
Business Geographics July 98.
Smelcer, J. and E. Carmel (1997): 'The effectiveness of different representations for
managerial Problem Solving: Comparing Maps and tables' Decision Science,
Vol. 28, pp. 391-420.
Smith, T. (1996): 'Alexandria digital library Communications of the ACM,
Vol. 38, pp. 61-62.
Worrall, L. and D. Bond (1996): 'Geographical Information Systems, Spatial Analysis and
Public Policy' International Statistical Review, Vol. 65, pp. 365 -379.