TY - GEN
T1 - Matrix Factorization-based clustering of image features for bandwidth-constrained information retrieval
AU - Chakareski, Jacob
AU - Manohar, Immanuel
AU - Rane, Shantanu
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/9/22
Y1 - 2016/9/22
N2 - We consider the problem of accurately and efficiently querying a remote server to retrieve information about images captured by a mobile device. In addition to reduced transmission overhead and computational complexity, the retrieval protocol should be robust to variations in the image acquisition process, such as translation, rotation, scaling, and sensor-related differences. We propose to extract scale-invariant image features and then perform clustering to reduce the number of features needed for image matching. Principal Component Analysis (PCA) and Non-negative Matrix Factorization (NMF) are investigated as candidate clustering approaches. The image matching complexity at the database server is quadratic in the (small) number of clusters, not in the (very large) number of image features. We employ an image-dependent information content metric to approximate the model order, i.e., the number of clusters, needed for accurate matching, which is preferable to setting the model order using trial and error. We show how to combine the hypotheses provided by PCA and NMF factor loadings, thereby obtaining more accurate retrieval than using either approach alone. In experiments on a database of urban images, we obtain a top-1 retrieval accuracy of 89% and a top-3 accuracy of 92.5%.
AB - We consider the problem of accurately and efficiently querying a remote server to retrieve information about images captured by a mobile device. In addition to reduced transmission overhead and computational complexity, the retrieval protocol should be robust to variations in the image acquisition process, such as translation, rotation, scaling, and sensor-related differences. We propose to extract scale-invariant image features and then perform clustering to reduce the number of features needed for image matching. Principal Component Analysis (PCA) and Non-negative Matrix Factorization (NMF) are investigated as candidate clustering approaches. The image matching complexity at the database server is quadratic in the (small) number of clusters, not in the (very large) number of image features. We employ an image-dependent information content metric to approximate the model order, i.e., the number of clusters, needed for accurate matching, which is preferable to setting the model order using trial and error. We show how to combine the hypotheses provided by PCA and NMF factor loadings, thereby obtaining more accurate retrieval than using either approach alone. In experiments on a database of urban images, we obtain a top-1 retrieval accuracy of 89% and a top-3 accuracy of 92.5%.
KW - Clustering
KW - Information retrieval
KW - Non-negative matrix factorization
KW - Principal component analysis
UR - http://www.scopus.com/inward/record.url?scp=84992034683&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84992034683&partnerID=8YFLogxK
U2 - 10.1109/ICMEW.2016.7574695
DO - 10.1109/ICMEW.2016.7574695
M3 - Conference contribution
AN - SCOPUS:84992034683
T3 - 2016 IEEE International Conference on Multimedia and Expo Workshop, ICMEW 2016
BT - 2016 IEEE International Conference on Multimedia and Expo Workshop, ICMEW 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2016 IEEE International Conference on Multimedia and Expo Workshop, ICMEW 2016
Y2 - 11 July 2016 through 15 July 2016
ER -