Improving Image Retrieval Performance by Using Both Color and Texture Features
Improving Image Retrieval Performance by Using Both Color and Texture Features
Improving Image Retrieval Performance by Using Both Color and Texture Features
Dengsheng Zhang Gippsland School of Comp. and Info. Tech. Monash University, Churchill, Victoria 3842, Australia Email: [email protected] Abstract
Most content-based image retrievals (CBIR) use color as image features. However, image retrieval using color features often gives disappointing results because in many cases, images with similar colors do not have similar content. Color methods incorporating spatial information have been proposed to solve this problem, however, these methods often result in very high dimensions of features which drastically slow down the retrieval speed. In this paper, a method combining both color and texture features of image is proposed to improve the retrieval performance. Given a query, images in the database are firstly ranked using color features. Then the top ranked images are re-ranked according to their texture features. Results show the second process improves retrieval performance significantly. Keywords: CBIR, color, texture, image retrieval. of the image in some cases. Therefore, we provide two alternatives to user, one is the retrieval based on color features, the other is retrieval based on combined features. When the retrieval based on color fails, the user will use the other alternative which is the combined retrieval. By integrating these two alternatives, retrieval performance is improved significantly. The rest of the paper is organized as following. In Section 2, the color descriptor is presented. Section 3 briefly describes the rotation invariant Gabor texture features. Section 4 describes the indexing and retrieval process. Retrieval results and performance will be reported in Section 5, and the paper is concluded in Section 6.
1. Introduction
In recent years, Content Based Image Retrieval (CBIR) has been proposed in attempt to index image automatically and objectively [1, 2]. In CBIR, images in database are represented using such low level image features as color, texture and shape, which are extracted from images automatically. Among these low level features, color features are the most widely used features for image retrieval because color is the most intuitive feature and can be extracted from images conveniently [3, 4]. However, image retrieval using color features often gives disappointing results, because in many cases, images with similar colors do not have similar content. This is due to the global color features computed often fails to capture color distributions or textures within the image. Several methods have been proposed to incorporate spatial color information in attempt to avoid color confusion by machine [5, 6, 7, 8]. However, these methods often results in very high dimensions of features which drastically slow down the retrieval speed. In this paper, we propose a method combining both color and texture features to improve retrieval performance. We compute both the color and texture features from the images and images in the database are indexed using both types of features. During the retrieval process, given a query image, images in the database are firstly ranked using color features. Then in the second process, a number of top ranked images are selected and re-ranked according to their texture features. Because the texture features are extracted globally from the image, they are not an accurate description
Y / Yn 0.008856
where
and (Xn, Yn, Zn) is the reference white in XYZ space. There are usually millions of colors in the color space, in order to build the color histogram from an image, the color space is normally quantized into a number of color bins with each bin represents a number of neighboring colors. However, in the L*u*v* space, representative colors are used instead of
quantizing each color channel by a constant step. The number of representative colors is given by the combinations (512) of the following three components in L*u*v* space [10]. These representative colors are uniformly distributed in L*u*v* space. L component u component v component 6.25 -110.875 -123.625 18.75 -66.625 -90.875 31.25 -22.375 -58.125 43.75 21.875 -25.375 56.25 66.125 7.375 68.75 110.375 40.125 81.25 154.625 72.875 93.75 198.875 105.625 In contrast to the conventional histogram building which assigns the color of each pixel to a single color bin, the PWH assigns the color of each pixel to 10 neighboring color bins based on the following weight: 1/ di wi = 1 / d1 + 1 / d 2 + ... + 1 / d10 where d = ( L L ) 2 + (u u ) 2 + (v v ) 2
i 0 i 0 i 0 i
where W is called the modulation frequency. After applying Gabor filters on the image with different orientation at different scale, an array of magnitudes is obtained:
E (m, n) = | G mn ( x, y ) |, m = 0, 1, , M - 1; n = 0, 1, , N - 1
x =0 y =0 P Q
and (L0, u0, v0) is the color of the pixel to be assigned, (Li, ui, vi) is the color of bin i. The use of PWH overcomes the drawback of conventional histogram methods which would in many situations assign a pixel color to a bin of a quite different color, e.g., assign a pixel color (6.4, 21, 40) to bin (18.75, 21.875, 40.125). The PWH also overcomes the drawback of having to assign two quite different colors to a same color bin, e.g., assigning pixel color (6.4, 21, 40) and (30, 21, 40) to bin (18.75, 21.875, 40.125) in conventional histogram building. As the result, PWH is much more accurate in representing the image than conventional histograms.
These magnitudes represent the energy content at different scale and orientation of the image. The main purpose of texture-based retrieval is to find images or regions with similar texture. It is assumed that we are interested in images or regions that have homogenous texture, therefore the mean mn and standard deviation mn of the magnitude of the transformed coefficients are used to represent the homogenous texture feature of the region. A feature vector f (texture representation) is created using mn and mn as the feature components [11]. Five scales and six orientations are used in common implementation and the Gabor texture feature vector is thus given by: f = (00 , 00 , 01 , 01 , , 45, 45). The similarity between two texture patterns is measured by the Euclidean distance between their Gabor feature vectors. To make the above extracted texture features robust to image rotation, a simple circular shift on the feature map is used [12].
* where, K is the filter mask size, and g mn is the complex conjugate of gmn which is a class of self-similar functions
situations. Solution to this problem can be segmenting image into regions and extracting texture features from the regions. However, segmentation is a complex problem which has no desirable result at this moment. Therefore, we let the user to decide if texture features is needed. If the color retrieval result is not satisfactory, the user then selects the retrieval using combined features. In the combined retrieval, the images in the database are first ranked using color features according to the distance between the query and the target images. Then a number of the top ranked images are selected and they are then re-ranked using their texture features.
provides users with only one alternative, therefore, the overall performance may not be improved in those approaches. From the precision-recall chart of Figure 2, it can be observed that the retrieval precision using color features has a sharp drop from 40% of recalls. However, retrieval precision using combined features is very robust and has a smooth drop of performance. Overall, the retrieval performance of combined features outperforms that of color features significantly.
6. Conclusions
In this paper, an image retrieval method using both color features and combined features has been proposed. Experiment results show, our method outperforms the common color feature retrieval significantly. We have used the PWH color features to improve conventional histogram features. A robust texture feature which is suitable for image retrieval has also been presented in the paper. In contrast to most conventional combined approaches which may not give better performance than individual features, our approach provides users with two alternatives, i.e., retrieval using color features only and retrieval using combined features. Since we give the users control to select the type of retrieval, the improvement of retrieval performance is guaranteed. In the future, we plan to segment image automatically into homogenous texture regions using split and merging technique. By using regional features instead of global features, we will be able to improve retrieval performance of combined features further. We also plan to use more queries to test the retrieval performance.
References:
[1] G. Lu, Multimedia Database Management Systems, Artech House, 1999. [2] J. M. Martnez, MPEG-7 Overview (version 7), ISO/IEC JTC1/SC29/WG11 N4674, Jeju, March 2002. [3] W. Niblack et al. The QBIC Project: Querying Images By Content Using Color, Texture and Shape. SPIE Conf. On Storage and Retrieval for Image and Video Databases, Vol.1908, San Jose, CA, pp.173-187, 1993. [4] M. Swain and D. Ballard, Color Indexing, International Journal of Computer Vision, 7(1):11-32, 1991. [5] J. R. Simith and S. F. Chang, Querying by Color Regions Using the VisualSEEK Content-Based Visual Query System, In M. T. Maybury, ed., Intelligent Multimedia Information Retrieval, AAAI/MIT Press, 1996. [6] T. S. Chua, K. L. Tan and B. C. Ooi, Fast Signature-based Color-spatial Image Retrieval, In Proc. of IEEE
International Conf. on Multimedia Computing and Systems, pp362-369, Ontario, Canada, 1997. [7] W. Hsu, T. S. Chua and H. K. Pung, Integrated Color-spatial Approach to Content-based Image Retrieval, In Proc. of ACM Multimedia, pp305-313, 1995. [8] G. Pass, R. Zabih and J. Miller, Comparing Images Using Color Coherence Vectors, In Proc. of the 4th ACM Multimedia Conference, pp65-73, 1996. [9] G. Lu and J. Phillips, Using Perceptually Weighted Histograms for Color-Based Image Retrieval, In Proc. of International Conference on Signal Processing, pp11501153, Beijing, China, 1998. [10] B. Y. Chua, Color-based Image Retrieval Incorporating Relevance Feedback, Honors Thesis, Monash University, 2002. [11] B. S. Manjunath and W. Y. Ma, Texture Features for Browsing and Retrieval of Large Image Data, IEEE Trans. PAMI-18 (8):837-842, August, 1996. [12] D. S. Zhang and G. Lu, Content-based Image Retrieval Using Gabor Texture Features, In Proc. of First IEEE Pacific-Rim Conference on Multimedia (PCM'00), pp.392395, Sydney, Australia, December 13-15, 2000.
[13] B. S. Manjunath et al, Color and Texture Descriptors, IEEE Transactions on Circuits and Systems for Video Technology, 11(6):703-715, 2001.
Precision-Recall Chart 100 90 80 70
Precision Color-Texture Color
60 50 40 30 20 10 0 0 20 40
Recall
60
80
100
R1-ct
R1-c
R2-ct
R2-c
R3-ct
R3-c
R4-ct
R4-c
R5-ct
R5-c R6-ct R6-c Figure 3. Sample screen shots of retrievals using color and combined features.