CS231. Nhập môn Thị giác máy tính: Image segmentation

TRƯỜNG ĐẠI HỌC CÔNG NGHỆ THÔNG TIN
KHOA KHOA HỌC MÁY TÍNH
CS231. Nhập môn Thị giác máy tính
Image segmentation
Mai Tiến Dũng

Image Segmentation
• Image segmentation is the operation of

partitioning an image into a collection of
connected sets of pixels.
• Segmentation in easy words is assigning
labels to pixels. All picture elements or pixels
belonging to the same category have a
common label assigned to them.
2
Image Segmentation
• a
Feature
Image Segmentation Classification
Extraction
Input Feature Vector Object Type
Object Image
Image
3
Approaches in Image Segmentation
1. Similarity approach
2. Discontinuity approach
4
1. Similarity approach:
• Based on detecting similarity between image
pixels to form a segment, based on a threshold.
• ML algorithms like clustering are based on this
type of approach to segment an image.
5
2. Discontinuity approach:
• This approach relies on the discontinuity of pixel
intensity values of the image.
• Line, Point, and Edge Detection techniques use
this type of approach for obtaining intermediate
segmentation results which can be later
processed to obtain the final segmented image.
6
Groups of image segmentation
• Semantic segmentation
• Instance segmentation
7
• Semantic segmentation is an approach

detecting, for every pixel, belonging class of
the object.
• For example, when all people in a figure are
segmented as one object and background as
one object.
8
• Instance segmentation is an approach that

identifies, for every pixel, a belonging instance
of the object. It detects each distinct object of
interest in the image. For example, when each
person in a figure is segmented as an
individual object.
9
10
11
• Object Detection
 to classify and localize all the objects in the
image
12
• Semantic Segmentation
• The goal of semantic image segmentation is to label each pixel of an

image with a corresponding class of what is being represented. Because
we’re predicting for every pixel in the image, this task is commonly
referred to as dense prediction.
13
• Instance segmentation
 to classify each instance of a class separately
14
15
Image Segmentation Techniques
1. Threshold Based Segmentation

2. Edge Based Segmentation
3. Region-Based Segmentation
4. Clustering Based Segmentation
5. Artificial Neural Network Based Segmentation
16
1. THRESHOLD BASED SEGMENTATION
17
Threshold Based Segmentation
• To create a binary or multi-color image based

on setting a threshold value on the pixel
intensity of the original image.
• How to find an automatic threshold ?

 considering variations in illumination & surface
• One popular approach : histogramming
18
Threshold Based Segmentation
• Consider the intensity histogram of all the

pixels in the image  then we will set a
threshold to divide the image into sections.
19
Thresholding techniques
1. Global thresholding
2. Manual thresholding
3. Adaptive Thresholding
4. Optimal Thresholding
5. Local Adaptive Thresholding
20
Thresholding techniques: Global
• Use a bimodal image  with 2 peaks of

intensities in the intensity distribution plot.
– One for the object and
– One for the background.
• Deduce the global threshold value for the
whole image.
• A disadvantage of this type of threshold is that
it performs really poorly during poor
illumination in the image.
21
Thresholding techniques: Global
• Find the “peaks” and “valleys” of the histogram

• Set threshold to the pixel value of the “valley”
• Non-trivial to find peaks/valleys :

– ignore local peaks; choose peaks at a distance find the valley between
those peaks
– Maximize “peakiness” (difference btw peaks & valleys) to find the
threshold as valley
Thresholding techniques: Manual
• Iterative threshold selection

• The following process goes as follows
24
Thresholding techniques: Manual
25
Thresholding techniques: Adaptive
• To overcome the effect of illumination, the

image is divided into various subregions, and
all these regions are segmented using the
threshold value calculated for all these
regions. Then these subregions are combined
to image the complete segmented image.
• This helps in reducing the effect of illumination
to a certain extent.
26
27
28
Thresholding techniques: Optimal
• Optimal thresholding technique can be used to

minimize the misclassification of pixels
performed by segmentation.
29
Thresholding techniques: Optimal
• The probability of a pixel value is given by the

following formulae :
30
Thresholding techniques: Local Adaptive
• Due to variation in the illumination of pixels in

the image, global thresholding might have
difficulty in segmenting the image.
• Hense the image is divided into smaller
subgroups and then adaptive thresholding of
those individual groups is done.
31
Thresholding techniques: Local Adaptive
• After individual segmentation of these

subgroups, all of them are combined to form
the completed segmented image of the
original image.
• Hence, the histogram of subgroups helps in
providing better segmentation of the image.
32
Thresholding techniques: Double
• Starting from a conservative initial threshold T1,

determine the “core” parts of the object
• Continuing from this core part, grow this object by
including neighboring pixels which are between T1 and
T2
Thực hành 1 – Baitap5
• Áp dụng phân đoạn ảnh cho các ảnh:

1. Chọn ngưỡng: global
• particles.bmp
• Phandoan01.jpg
• wdg2.jpg
• Rice.png
2. Chọn ngưỡng: Adaptive
• Ảnh wdg3.jpg
34
def drawHist(x):
hist = cv.calcHist([x], [0], None, [256], [0, 256])
plt.plot(hist)
plt.xlim([0, 256])
plt.legend(('histogram'), loc = 'upper left')
plt.show()
35
def globalThresholding(img, thres=127):
img_rst = img.copy()
for i in range(img.shape[0]):
for j in range(img.shape[1]):
if img_rst[i][j] < thres:
img_rst[i][j] = 255
else:
img_rst[i][j] = 0
return img_rst
36
def adaptiveThresholding(f, nrow, ncol):
g = f.copy()
r = int(f.shape[0] / nrow)
c = int(f.shape[1] / ncol)
for i in range(int(nrow)):
for j in range(int(ncol)):
x = f[i * r : (i + 1) * r, j * c : (j + 1) * c]
cv2_imshow(x)
drawHist(x)
#t = np.average(x)
t = int(input())
g[i * r : (i + 1) * r, j * c : (j + 1) * c] = globalThres
holding(x, t)
return g
37
Edge Based Segmentation
38
• Rely on edges found in an image using

various edge detection operators.
• These edges mark image locations of
discontinuity in gray levels, color, texture, etc.
When we move from one region to another,
the gray level may change. So if we can find
that discontinuity, we can find that edge.
39
• A variety of edge detection operators are

available but the resulting image is an
intermediate segmentation result and should
not be confused with the final segmented
image.
• We have to perform further processing on the
image to the segment it.
40
• Additional steps include combining edges

segments obtained into one segment in order
to reduce the number of segments rather than
chunks of small borders which might hinder
the process of region filling. This is done to
obtain a seamless border of the object.
• The goal of edge segmentation is to get an
intermediate segmentation result to which we
can apply region-based or any other type of
segmentation to get the final segmented 41
image.
Types of edges
42
• Tham khảo bài học về thông tin cạnh - edge
43
Clustering-Based Segmentation
44
Clustering-Based Segmentation
• Clustering is a type of unsupervised machine

learning algorithm. It is highly used for the
segmentation of images.
• One of the most dominant clustering-based
algorithms used for segmentation is K-means
Clustering. This type of clustering can be used
to make segments in a colored image.
45
Partitioning Clustering
• Partitioning Clustering methods subdivides the

data into k groups, where k is a number
predefined by the user. For K-means
Clustering which is the most popular
Partitioning Cluster method
46
K-means
Algorithm:
1. We choose k random points in the data as the center of
clusters and assign each point to the nearest cluster by
looking at the L2 distance between the point and the
center.
2. Compute the mean of each cluster, assign that mean value
as the new center of the cluster.
3. Reassign each data point to its closest cluster center.
Repeat step 2.
The process keeps going until no new assignment is performed (so the
model is converged, there is nothing to go furthermore) or for a given
number of iteration. Therefore, K-means Clustering is an iterative method
where we can determine the iteration number too.
47
K-means
• Step-1 : randomly pick k centers

K-means
• Step 2: Assign each point to nearest center

K-means
• Step 3: re-estimate centers

K-means
• Step 4: Repeat
K-means
• Step 4: Repeat
K-means
• Step 4: Repeat
K-means
• Ground-truth vs k-means
Ground truth K-means 100 iterations

K-means on image pixels
path_filename = os.path.join(dir_baitap, 'vegetables.jpg')
img = cv.imread(path_filename)
cv2_imshow(img)
from sklearn.cluster import KMeans
def Kmeans(img, n_clusters = 6):
nrow, ncol,nchl = img.shape
g = img.reshape(nrow*ncol,nchl)
k_means = KMeans(n_clusters = n_clusters,
random_state = 0).fit(g)
t = k_means.cluster_centers_[k_means.labels_]
img_res = t.reshape(nrow, ncol, nchl)
return img_res
55
Iteration 1 Iteration 5
Final: Iteration 17
Picture courtesy David One of the clusters from

Forsyth k-means
Thực hành
• Representing each pixel as (r,g,b)

– Áp dụng cho ảnh: vegetables.jpg, hand.jpg,
thuoc.jpg
• Represent each pixel as (r,g,b,x,y)
– Áp dụng cho ảnh: vegetables.jpg và thuoc.jpg
---------------------------------
Áp dụng thuật toán k-means
59
def Kmeans2(img, n_clusters = 6):
img_tmp = img.copy()
nrow, ncol,nchl = img.shape
g = []
for y in range(nrow):
for x in range(ncol):
tmp = [img_tmp[y,x][0], img_tmp[y,x][1], img_tmp[y,x][2], x, y]
g.append(tmp)
k_means = KMeans(n_clusters=n_clusters, random_state=0).fit(g)

# t = k_means.cluster_centers_[k_means.labels_]
arrcolor = np.random.rand(20,3) * 255

t = arrcolor[k_means.labels_]
img_res = img_tmp
i = 0
for y in range(nrow):
for x in range(ncol):
img_res[y, x] = t[i][0:3]
i += 1
return img_res 60

CS231. Nhập môn Thị giác máy tính: Image segmentation

Uploaded by

Copyright:

Available Formats

CS231. Nhập môn Thị giác máy tính: Image segmentation

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

CS231. Nhập môn Thị giác máy tính: Image segmentation

Uploaded by

Copyright:

Available Formats

TRƯỜNG ĐẠI HỌC CÔNG NGHỆ THÔNG TIN

KHOA KHOA HỌC MÁY TÍNH

CS231. Nhập môn Thị giác máy tính

Mai Tiến Dũng

• Image segmentation is the operation of

• Semantic segmentation is an approach

• Instance segmentation is an approach that

• The goal of semantic image segmentation is to label each pixel of an

1. Threshold Based Segmentation

• To create a binary or multi-color image based

• How to find an automatic threshold ?

• Consider the intensity histogram of all the

• Use a bimodal image  with 2 peaks of

• Find the “peaks” and “valleys” of the histogram

• Non-trivial to find peaks/valleys :

• Iterative threshold selection

• To overcome the effect of illumination, the

• Optimal thresholding technique can be used to

• The probability of a pixel value is given by the

• Due to variation in the illumination of pixels in

• After individual segmentation of these

• Starting from a conservative initial threshold T1,

• Áp dụng phân đoạn ảnh cho các ảnh:

• Rely on edges found in an image using

• A variety of edge detection operators are

• Additional steps include combining edges

• Clustering is a type of unsupervised machine

• Partitioning Clustering methods subdivides the

• Step-1 : randomly pick k centers

• Step 2: Assign each point to nearest center

• Step 3: re-estimate centers

Ground truth K-means 100 iterations

path_filename = os.path.join(dir_baitap, 'vegetables.jpg')

Picture courtesy David One of the clusters from

• Representing each pixel as (r,g,b)

k_means = KMeans(n_clusters=n_clusters, random_state=0).fit(g)

arrcolor = np.random.rand(20,3) * 255

You might also like