Image retrieval systems are essential tools in dealing with large volume of visual data and require effective descriptors to produce reasonable results. Zernike Moments (ZMs) have been used often in literature for object representation and retrieval. We note that most existing works tested ZMs on datasets with single objects with tight bounding boxes. We investigate the suitability of ZMs for a general image representation and retrieval. We show through theoretical analysis and experimental study that ZMs are not suitable for representation and retrieval of images, which may contain multiple objects of different sizes, at different locations and with different orientations and backgrounds.

image retrieval, Zernike moments, multimedia retrieval, image analysis

0 Introduction





1 Zernike moments and suitability for image retrieval

ZMs introduce a set of continuous image moments and a set of complex polynomials denoted by {Vn,m} that form a complete orthogonal set over the unit disk ofX2+Y2=1 in polar coordinates. The Zernike polynomial is defined as



wherenis a positive integer or zero;mrepresents positive or negative integers and is subject to constraints:n-|m| is even and |m|≤n.ρis the length of the vector from the origin to pixel (x,y).θrepresents the angle between the vectorρand thex-axis in the counterclockwise direction. For digital images, ZMs are calculated using the following equation


As the ZMs are only rotation invariant, to achieve the translation and scaling invariances, image normalization should be applied by employing following equation


wherexc,ycare the centroid (center of the mass) of the image that are calculated using the following equations



Fig.1 Unit circle of image for ZMscalculation




(2)Imagescontainingobjectsofdifferentsizes(Fig.2(c)): f(x, y)isdifferentfromtheoriginalimage(Fig.2(a)).




Visually,thesiximagesinFigure2aresimilar.Soideally,usinganyofthesiximagesasaquery,theotherfiveimagesshouldberetrievedandrankedhighly(assimilar).Butduetothedifferenceinthecentroidlocationsand/orf(x, y)aslistedabove,theZMsofthesiximagesareverydifferent,leadingtolargedistancesamongthesiximagesandlowimageretrievalaccuracy.

Fig.2 Different situations in real image retrieval application

We will use experimental examples to show the impact of the above five situations on retrieval accuracy.

2 Experimental study

In the literature, several research[7-10,12-13]exploited ZMs for image retrieval, however their proposed methods are evaluated using databases containing images and each image has only a single object. The object generally islocated at the center of the image and is tightly bounded. The MPEG 7 CE shape database[7-10,13], COIL100 database, kimia[8-9]are the frequently used databasesas the benchmark in suchresearch. To investigate the suitabilityof ZMs in retrieving general images from a database, we have selected the MPEG 7 CE shape database. This database contains 1400 images that are grouped into 70 classes. ZMs order 10 is used in this work as recognized as sufficient for image presentation by [8-9,13]. The performance is measured using precision and recall[18]as this is one of the most commonly usedmeasures in image retrieval research.

The limitations of ZMs discussed in Section 1 are studied here by using the following experiments. Three classes of images are selected from the MPEG 7 databasenamelyApple,DeerandDevice4. Fig.3, Fig.4, and Fig.5 shows the retrieval results of the selected classes using the ZMs order 10 using the original data set. The left-most image at the first row is the query image and the rest are the fifteen highest ranked retrieved images.

To analyze the limitations of ZMs discussed in the previous section, fivehighest ranked of the retrieved images presented in Fig. 3, Fig.4 and Fig. 5 are modifiedconsidering the following situations, images containingmultiple objects, objects of different sizes, at different locations, orientations and backgrounds. Fig.6 to 10 present these modified images, these images are visually similarto five highest ranked of the retrieved images presented in Fig.3 to 5. The retrieval results after adding these modified visually similar images are presented to highlight the effect of them.

Fig.3 Retrieval result of apple class using ZMs order 10

Fig.4 Retrieval result of deer class using ZMs order 10

Fig.5 Retrieval result of device 4 class using ZMs order 10

Fig.6 Visually similarimages to five highest ranked retrieved images with multiple objects

Fig.7 Visually similar images to five highest ranked retrieved images with different sizes

Fig.8 Visually similar images to five highest ranked retrieved images with objects different size at different locations

Fig.9 Visually similar images to five highest ranked retrieved images with objects of different sizes and orientations

Fig.10 Visually similar images to five highest ranked retrieved images with different backgrounds

2.1Images with multiple objects

In this experiment, we added modified images in Fig.6 to the test data set, and used the three queries (apple, dear and device 4) to retrieve similar images. Fig.11 to 13 show the ranked retrieval results. Although the images in Fig.6 are visually similar to the respective queries, they are ranked lower than images from other classes, except for one of the deer images, whichis ranked 11. This result shows that adding a small object to the original object will affect the ZM features and reduce retrieval accuracy.

Fig.11 Retrieval results of apple class

Fig.12 Retrieval results of deer class

Fig.13 Retrieval results of device 4 class

2.2Images with objects of different sizes

In this experiment, we added modified images in Fig.7 to the test data set, and used the three queries (Apple, Deer and Device 4) to retrieve similar images.Fig.14 to 16 show the ranked retrieval results. Although the image in Fig.7 is visually similar to the respective queries, they are ranked lower than images from other classes. This result shows that objects of different sizes will affect the ZM features and reduce retrieval accuracy.

Fig.14 Retrieval results of apple class

Fig.15 Retrieval results of deer class

Fig.16 Retrieval results of device 4 class

2.3Images with objects at different locations

In this experiment, we added modified images in Fig.8 to the test data set, and used the three queries (Apple, Deer and Device 4) to retrieve similar images. Fig.17 to 19 show the ranked retrieval results. Although the images in Fig.8 are visually similar to the respective queries, they are ranked lower than images from other classes. This result shows that objects of different sizes at different locations will affect the ZM features and reduce retrieval accuracy.

Fig.17 Retrieval results of apple class

Fig.18 Retrieval results of deer class

Fig.19 Retrieval results of device 4 class

2.4Images with objects at different locations

In this experiment, we added modified images in Fig.9 to the test data set, and used the three queries (Apple, Deer and Device 4) to retrieve similar images. Fig.20 to 22 show the ranked retrieval results. Although the images in Fig.9 are visually similar to the respective queries, they are ranked lower than images from other classes, except for one of the deer image, which is ranked 12. This result shows that objects of different sizes and orientations will affect the ZM features and reduce retrieval accuracy.

Fig.20 Retrieval results of apple class

Fig.21 Retrieval results of deer class

Fig.22 Retrieval results of device 4 class

2.5Images with objects at different locations

In this experiment, we added modified images in Fig.10 to the test data set, and used the three queries (Apple, Deer and Device 4) to retrieve similar images. Fig.23 to 25 show the ranked retrieval results. Although the images in Fig.10 are visually similar to the respective queries, they are ranked lower than images from other classes, except for one of the apple image, which is ranked 11. This result shows that images containing object on different background will affect the ZM features and reduce retrieval accuracy.

Fig.23 Retrieval results of apple class

Fig.24 Retrieval results of deer class

Fig.25 Retrieval results of device 4 class

The experimental study shows that the retrieval results of selected classes are affected by changes in having multiple objects with different sizes, at different locations, orientations and backgrounds. The aforementioned modified visually similar images in Fig.6 to 10 that are added to the database are mostly not in the top 15 retrieved and are ranked lower than images from other classes.

2.6Images with objects at different locations

Fig.26 shows the retrieval result of the three queries (Apple, Deer and Device 4)using original database and modified databases as per used through Sections 2.2 to 2.5. The retrieval accuracy is considerably reduced as seen from the Precision and Recall graph for the modified databases.This is the result of the modificationsdiscussed in Section 1, images with multiple objects, objects of different sizes, at different locations, with different orientations and backgrounds that change the center of the mass of images and consequently ZM feature vectors.

Fig.26 Retrieval results of three queries(Apple,Deer and Device 4) against original database and modified databases

3 Conclusion

ZMshave been used for image retrieval, howeverthey have been evaluateddatabases with single object in the center of the image and tightly bounded. ZMs are calculated within a unit circle, and the centered at thecenter of the mass of images; hence any changes in the image will result in changes in the center of the mass and different ZM feature vectors that affect the retrieval results.

Asgeneral images might have different objects with different sizes, locations, orientations and backgrounds, any of these situations changes the center of the mass of image and/or pixel values. This work studied how this calculationof ZMs affect the retrieval performance and our experimental results have shown that the results are considerably affected in presence of any aforementioned situations. ThereforeZMs are not suitable for general image retrieval, unlessrobust image segmentation is available as a preprocessing step. Unfortunately, based on our understanding, there is no automatic segmentation technique currently exists that is generally robust in segmenting objects from general images into the manner required by ZMs to work effectively.


