TY - GEN
T1 - Scene image classification
T2 - 2012 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2012
AU - Banerji, Sugata
AU - Sinha, Atreyee
AU - Liu, Chengjun
N1 - Copyright:
Copyright 2013 Elsevier B.V., All rights reserved.
PY - 2012
Y1 - 2012
N2 - This paper introduces several novel color, shape and texture-based image descriptors for scene image classification with applications to image search and retrieval. Specifically, first, a new 3-Dimensional Local Binary Pattern (3DLBP) descriptor is proposed for color image local feature extraction. Second, a new shape descriptor (HaarHOG) is introduced by combining Haar wavelet transformation and Histogram of Oriented Gradients (HOG). Third, these descriptors are fused using an optimal feature representation technique to generate a robust 3-Dimensional LBP-HaarHOG (3DLH) descriptor that can perform well on different scene image categories. Finally, the Enhanced Fisher Model (EFM) is applied for discriminatory feature extraction and the nearest neighbor classification rule is used for image classification. The proposed descriptors and fusion technique are evaluated using three grand challenge datasets: the MIT Scene dataset, the UIUC Sports Event dataset, and a part of the Caltech 256 dataset.
AB - This paper introduces several novel color, shape and texture-based image descriptors for scene image classification with applications to image search and retrieval. Specifically, first, a new 3-Dimensional Local Binary Pattern (3DLBP) descriptor is proposed for color image local feature extraction. Second, a new shape descriptor (HaarHOG) is introduced by combining Haar wavelet transformation and Histogram of Oriented Gradients (HOG). Third, these descriptors are fused using an optimal feature representation technique to generate a robust 3-Dimensional LBP-HaarHOG (3DLH) descriptor that can perform well on different scene image categories. Finally, the Enhanced Fisher Model (EFM) is applied for discriminatory feature extraction and the nearest neighbor classification rule is used for image classification. The proposed descriptors and fusion technique are evaluated using three grand challenge datasets: the MIT Scene dataset, the UIUC Sports Event dataset, and a part of the Caltech 256 dataset.
KW - Enhanced Fisher Model (EFM)
KW - The HaarHOG descriptor
KW - image search
KW - scene classification
KW - the 3-Dimensional Local Binary Pattern (3D-LBP) descriptor
KW - the 3DLH descriptor
UR - http://www.scopus.com/inward/record.url?scp=84872410750&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84872410750&partnerID=8YFLogxK
U2 - 10.1109/ICSMC.2012.6378083
DO - 10.1109/ICSMC.2012.6378083
M3 - Conference contribution
AN - SCOPUS:84872410750
SN - 9781467317146
T3 - Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
SP - 2294
EP - 2299
BT - Proceedings 2012 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2012
Y2 - 14 October 2012 through 17 October 2012
ER -