Database Assignment 3
Database Assignment 3
Database Assignment 3
LEVEL: 3.2
LECTURER: R. GUMBO
-The unique characteristics of spatiotemporal datasets are that they carry distance and
topological information which require geometric and temporal computation.
-Spatial and temporal relationships like distance, topology, direction, before and after
are information bearing. They need to be considered in spatiotemporal data analysis
and mining.
-Spatial and temporal relationships are implicitly defined. They are not explicitly
encoded in a database. These relationships must be extracted from data. There is a
trade-off between preprocessing them before the actual mining process starts and
computing them on-the fly as and when they are actually needed.
-Scale effect in space and time is a challenging issue in spatiotemporal data analysis
and mining. Scale in terms of spatial resolution or temporal granularity can have a direct
impact on the kind and strength of spatiotemporal relationships that can be discovered
in datasets.
-The attributes of neighbouring patterns may have significant influence on a pattern and
should be considered. For example, spatiotemporal event like hurricane will have
influence on traffic jam pattern.
-Many rules of qualitative reasoning (ex: transitive property) on spatial and temporal
data provide a valuable source of domain independent knowledge that should be taken
into account when generating patterns. How to express rules and how to integrate them
with spatiotemporal reasoning mechanism is an issue.
-Development of effective visual interfaces for viewing and manipulating the geometrical
and temporal attributes of spatiotemporal data is another challenge.
(K, 2012)
QUESTION 2
Describe the differences between the following approaches for the integration of
a data mining system with a database or data warehouse system: no coupling,
loose coupling, semitight coupling, and tight coupling. State which approach you
think is the most popular, and why. [10]
No Coupling: In this scheme, the data mining system does not utilize any of the
database or data warehouse functions. It fetches the data from a particular source and
processes that data using some data mining algorithms. The data mining result is stored
in another file.
Loose Coupling: In this scheme, the data mining system may use some of the
functions of database and data warehouse system. It fetches the data from the data
respiratory managed by these systems and performs data mining on that data. It then
stores the mining result either in a file or in a designated place in a database or in a
data warehouse. Loose coupling is better than no coupling because it can fetch any
portion of data stored in databases or data warehouses by using query processing,
indexing, and other system facilities.(Integration of a Data Mining System with a
Database or Data Warehouse System) However, many loosely coupled mining systems
are main memory-based. Because mining does not explore data structures and query
optimization methods provided by database or data warehouse systems, it is difficult for
loose coupling to achieve high scalability and good performance with large data sets.
Semi-tight Coupling: In this scheme, the data mining system is linked with a database
or a data warehouse system and in addition to that, efficient implementations of a few
data mining primitives can be provided in the database. These primitives can include
sorting, indexing, aggregation, histogram analysis, multi way join, and precomputation
of some essential statistical measures, such as sum, count, max, min, standard
deviation,
Tight coupling: In this coupling scheme, the data mining system is smoothly integrated
into the database or data warehouse system. The data mining subsystem is treated as
one functional component of an information system. Data mining queries and functions
are optimized based on mining query analysis, data structures, indexing schemes, and
query processing methods of a database or data warehouse system. This approach is
highly desirable because it facilitates efficient implementations of data mining functions,
high system performance and an integrated information processing environment
Tight coupling is highly desirable, but its implementation is nontrivial and more reserch
is needed in the area. Semi-tight coupling would be the most suitable choice because it
is a common choice and an even compromise between loose and tight coupling.(Data
Mining, Southeast Asia Edition - Jiawei Han, Jian Pei, Micheline Kamber - Google
Books)
Identify any two database modelling software that are suitable for your
department in the mining environment and highlight its features/functions,
advantages and also the weakness of the modelling software in your
organisation. [15]
References
Corvus Axiom vs iGantt vs Micromine vs Vulcan - 2020 Feature and Pricing
Comparison (no date). Available at:
https://www.capterra.com/mining-software/compare/170815-129663-135411-93452/
Corvus-Axiom-vs-Micromine-vs-Vulcan-vs-iGantt (Accessed: 28 May 2020).
Data Mining, Southeast Asia Edition - Jiawei Han, Jian Pei, Micheline Kamber - Google
Books (no date). Available at: https://books.google.co.zw/books?id=AfL0t-
YzOrEC&pg=PA34&lpg=PA34&dq=what+is+the+best+Integration+Of+A+Data+Mining+
System+With+A+Database+Or+Data+Warehouse+System&source=bl&ots=Ux-
RwMdrF4&sig=ACfU3U0ydbkzZ3UUvabmnWp8XIh5kEInmw&hl=en&sa=X&ved=2ahU
KEwiFx6nf49TpAhVGiVwKHdyeDNwQ6AEwDnoECAgQAQ#v=onepage&q=what is the
best Integration Of A Data Mining System With A Database Or Data Warehouse
System&f=false (Accessed: 28 May 2020).
Data Mining - Systems - Tutorialspoint (no date). Available at:
https://www.tutorialspoint.com/data_mining/dm_systems.htm (Accessed: 27 May 2020).
Integration of a Data Mining System with a Database or Data Warehouse System (no
date). Available at: http://www.brainkart.com/article/Integration-of-a-Data-Mining-
System-with-a-Database-or-Data-Warehouse-System_8310/ (Accessed: 27 May 2020).
K, V. R. (2012) ‘Spatiotemporal Data Mining: Issues, Tasks And Applications’,
International Journal of Computer Science & Engineering Survey, 3(1), pp. 39–52. doi:
10.5121/ijcses.2012.3104.
Maptek - 3D Mine Planning, Mine Design, Geology & Scheduling in Vulcan (no date).
Available at: https://www.maptek.com/products/vulcan/ (Accessed: 28 May 2020).