Part-based annotation-free fine-grained classification of images of retail products
Article Type
Research Article
Publication Title
Pattern Recognition
Abstract
We propose a novel solution that classifies very similar images (fine-grained classification) of variants of retail products displayed on the racks of supermarkets. The proposed scheme simultaneously captures object-level and part-level cues of the product images. The object-level cues of the product images are captured with our novel reconstruction-classification network (RC-Net). For annotation-free modeling of part-level cues, the discriminatory parts of the product images are identified around the keypoints. The ordered sequences of these discriminatory parts, encoded using convolutional LSTM, describe the products uniquely. Finally, the part-level and object-level models jointly determine the products explicitly explaining coarse to finer descriptions of the products. This bi-level architecture is embedded in R-CNN for recognizing variants of retail products on the rack. We perform extensive experiments on one In-house and three benchmark datasets. The proposed scheme outperforms competing methods in almost all the evaluations.
DOI
10.1016/j.patcog.2021.108257
Publication Date
1-1-2022
Recommended Citation
Santra, Bikash; Shaw, Avishek Kumar; and Mukherjee, Dipti Prasad, "Part-based annotation-free fine-grained classification of images of retail products" (2022). Journal Articles. 3415.
https://digitalcommons.isical.ac.in/journal-articles/3415