New Approach on the Techniques of Content-Based Image Retrieval (CBIR) Using Color, Texture and Shape Features ()
1. Introduction
With the widespread use of computer network, collection of information data such as image has quickly grown and continues to increase in the future. Due to rapidly grown of internet, every image information and data was collected and digitized. Every internet user was able to access it because all that image information and data available in digital image format.
In order to access this huge amount of data, efficient technique and method is needed for querying the indexed image database. In order to find and locate the needed information from these large databases, it is difficult and surely will take a lot of time. So, a method called content-based image retrieval is the efficient retrieval method to find relevant and needed images. Content Based Image Retrieval (CBIR) system was introduced in 1990 [1]. It is basically a techniques used for automatic retrieval of images in a large database that perfectly matches the query image.
Basically, in content based image retrieval system, it consists of two kinds of content based image retrieval which are general image retrieval and application specific image retrieval. For the general image retrieval, the goal of the query is to obtain images with the same object as the query. Such CBIR imitates web search engines for images rather than for text. For application specific, the purpose tries to match a query image to a collection of images of a specific type such as fingerprints image and x-ray. The need for efficient finding, searching and retrieval of digital image has been rapidly increase in many image processing application, viz, medicine, commerce, crime prevention, military, education, culture, and entertainment [2].
2. CBIR Architectural Design
Basically a general CBIR system architecture consists of six functional blocks which is image database (the block that contains image of the chosen database), feature database (can be characterized by a set of features), feature extraction (features may be text-based features and visual features), query image (can be any of the database image), image matching and indexing (the features of query image are compared with features that are already stored in the feature database), and retrieved image (the system that will select any number of image having the highest overall similarity to the given query image and present to the user as retrieved images[2]. Figure 1 shows the general architecture of CBIR system.
3. CBIR Techniques
Generally, in CBIR systems, while building an image database or retrieving an image from the database, feature vectors are the first extracted from images. The features can be color, texture, shape, and the combination of these features (hybrid features) [2]. These features are then used to build the feature database. When query image is given, its feature vectors are extracted and similarity matching
Figure 1. Content-based image retrieval system general architecture.
is performed with the feature vectors already present in feature database as illustrated in Figure 1 in previous section.
3.1. CBIR Using Color Features
Color is one of the important features that make possible the recognition of image by human. J. Yue et al. in 2011, state that the color is one of the most widely used low-level visual features and is invariant to image size and orientation [3]. According to B.S. Manjunath et al., color features are the most expressive of all the visual features and have been extensively used in the image retrieval system [4].
Swain and Ballard have proposed a color matching method based on color indexing in early 1990s. This method is also known as color histogram intersection method and it is based on matching of color histograms [2]. In 2011, G. H. Liu et al. state in his paper that, color histogram is invariant to orientation and scale and this makes it powerful in image classification. Subsequently, color histogram-based color descriptor has been broadly studied and generally utilized in CBIR systems for its simplicity and effectiveness [5].
3.2. CBIR Using Texture Features
Another important element in visual perception is “Texture”. Texture also can be used to separate regions of interest in an image. According to Savita et al. in 2015, texture is also one of the most used low level visual features that refer to innate surface properties of an object and their relationship to the surrounding environment and it also contains important information about the structural arrangement of surfaces and their relationship with the surroundings [6].
According to M. Kokare et al., texture can be defined as, “a region in image has a constant texture if a set of local statistics or other local properties of the picture are constant, gradually differing or approximately periodic” [7]. In 1998, Y. S. Kim and W. Y. Kim explored that the methodology for existing texture classification can be roughly divided into three categories which are Statistical method, Structural method, and Model-based method [8].
Some of the algorithm for CBIR based on this texture as feature has been proposed by several researchers. In 2000s, a rotational invariant texture features using Dual-Tree Complex Wavelet Transform (DTCWT) for CBIR has been proposed by P. R. Hill, D. R. Bull, and C. N. Canagarajah, [9]. In 2003, C. M. Pun has proposed an effective rotation-invariant polar-wavelet texture feature for image retrieval [10]. Other than that, in 2005 M. Kakore et al. proposed texture feature for image retrieval that supports shift invariant by using a set of two-dimensional Dual Tree Complex Wavelet Transform (DTCWT) and Dual Tree Rotated Complex Wavelet Filters (DTRCWF) jointly [11]. J. Han and K. K. Ma in 2007 has presented texture features for image retrieval using Gabor wavelet. In their proposal, mean and standard deviation values of transformed coefficients computed individually found to be rotation and scale invariant [12]. In 2009, a texture image retrieval methodology based on non-tensor product wavelet filter banks has been proposed by Z. He, X. You, and Y. Yuan [13].
3.3. CBIR Using Shape Features
In content-based image retrieval system, another important visual feature is shape. According to D. Zhang and G. Lu in 2004, shape is one of the basic features used to describe image content and it is also can be used to provide powerful information. Nevertheless, shape representation and description is a difficult task because when a 3-D real world object is projected onto a 2-D image plane, one dimension of object information is lost. As a result, the shape extracted from the image only partially represents the projected object. Further, shape is often corrupted with noise, defects, arbitrary distortion and occlusion [14].
According to Rao & Kumar in 2012, shape representation and description methods can be divided into two main categories which is boundary-based and region-based [2]. Savita Gandhani et al. in 2015 stated that the boundary-based technique will use only the contour or border of the object and completely ignores its interior, whereas region-based technique apply segmentation to divide an image into different regions/segments, by setting threshold values according to the desirable results [6].
According to Sharmin Siddique, the most successful representations for shape categories are Fourier Descriptor and Moment Invariants. She stated in her paper, the main idea of Fourier Descriptor is to use the Fourier transformed boundary as the shape feature, whereas Moment invariants is to use region-based moments, which are invariant to transformations as the shape feature [15].
3.4. Comparison Study for Color, Texture, Shape and Combined (Hybrid) Features
In content-based image retrieval system, some key parameters have been defining and evaluate based on precision, recall and the response time. The image retrieval precision rate and recall rate can be defined as this following formula:
Figure 2. Comparison study for color, texture, shape and combined (hybrid) features.
The N is stand for the number of images returned for the query, whereas R is images associated with the example in the result. Besides, M is stand for images associated with the example in test set S (saturation) [16].
Based on comparison study by D. Guoyong et al. in 2011, for the texture feature technique, the precision percentage is 60.1%, recall percentage score is 72.3%, and response time is 1372 millisecond. Other than that, for the color feature technique, the precision percentage score is 53.7%, recall percentage is 65.2%, and response time is 824 milliseconds. Besides that, for the shape feature technique, the precision percentage is 62.2%, recall percentage is 70.8%, and response time is 2544 millisecond. Last but not least, for the combined feature technique, the precision percentage is 79.6%, recall percentage is 88.3%, and response time is 3861 milliseconds [16] (Figure 2).
Based on the comparison study, for the first parameter which is precision percentage, combined feature is the highest. For second parameter, which is recall percentage, combined feature is the highest. Besides, for the third parameter which is response time, color feature is the fastest. Based on that comparison study, the combination or hybrid features technique which is combination of texture, color and shape strongly improved precision and recall, although it has longer response time.
4. Conclusion
Image features play an important function in retrieving suitable images from the large image database. Effective image retrieval from database requires exploration of suitable features such as color, text and shape of images in the database. Appropriately, many previous researchers have presented and proposed several image retrieval algorithms by exploring various features and transforms. In this paper, various CBIR techniques and algorithms are briefly described and discussed. Based on the literature study, researcher also need to find a good combination of image feature with its own similarity measure because the major requirement of an ideal CBIR system is to develop computational faster CBIR algorithm with improved average retrieval efficiency.