Lecture 4 - Density Based Methods
Lecture 4 - Density Based Methods
Lecture 4 - Density Based Methods
Cluster Analysis
1. What is Cluster Analysis?
2. Types of Data in Cluster Analysis
3. A Categorization of Major Clustering Methods
4. Partitioning Methods
5. Hierarchical Methods
6. Density-Based Methods
7. Grid-Based Methods
8. Model-Based Methods
9. Clustering High-Dimensional Data
10. Constraint-Based Clustering
11. Outlier Analysis
12. Summary
based)
11/1/22 Data Mining: Concepts and Techniques 2
Density-Based Clustering: Basic Concepts
Two parameters:
Eps: Maximum radius of the neighbourhood
MinPts: Minimum number of points in an Eps-
neighbourhood of that point
NEps(p): {q belongs to D | dist(p,q) <= Eps}
Directly density-reachable: A point p is directly density-
reachable from a point q w.r.t. Eps, MinPts if
p belongs to NEps(q)
core point condition: p MinPts = 5
q Eps = 1 cm
|NEps (q)| >= MinPts
Outlier
Border
Eps = 1cm
Core MinPts = 5
techniques
Reachability Distance o
p2
o
Max (core-distance (o), d (o, p))
MinPts = 5
r(p1, o) = 2.8cm. r(p2,o) = 4cm
11/1/22 e = 3 cm
Data Mining: Concepts and Techniques 10
Reachability
-distance
undefined
‘
Cluster-order
of the objects
11/1/22 Data Mining: Concepts and Techniques 11
Density-Based Clustering: OPTICS & Its Applications
d ( x , xi ) 2
D N 2
f Gaussian ( x) i 1
e 2
d ( x , xi ) 2
( x, xi ) i 1 ( xi x) e
D N
2 2
Major features f Gaussian