Lecture - 1 02032023 095637am 1 29022024 124126pm
Lecture - 1 02032023 095637am 1 29022024 124126pm
Lecture - 1 02032023 095637am 1 29022024 124126pm
Introduction
Recommended Books
Data Mining: Concepts and Techniques, Second Edition
by
Jiawei Han and Micheline Kamber
How?
Well defined
SQL
Poorly defined
Database Output
Precise
Subset of database
Data Mining Output
Fuzzy
Not a subset of database
Query Examples
Database
Find all credit applicants with last name of Smith.
Identify customers who have purchased more than
$10,000 in the last month.
Data Mining
Find all credit applicants who are poor credit risks.
(Classification)
Identify customers with similar buying habits.
(Clustering)
Find all items which are frequently purchased with
milk. (Association rules)
Why Data Mining
Credit ratings/targeted marketing:
Given a database of 100,000 names, which persons are the least
likely to default on their credit cards?
Identify likely responders to sales promotions
Fraud detection
Which types of transactions are likely to be fraudulent, given the
demographics and transactional history of a particular customer?
Customer relationship management:
Which of my customers are likely to be the most loyal, and which
are most likely to leave for a competitor? :
Data Mining:
Use of algorithms to extract the information and
patterns derived by the KDD process.