In this paper, we present a methodology to categorize camera captured documents into pre-defined logo classes. Unlike scanned documents, camera captured documents suffer from intensity variations, partial occlusions, cluttering, and large scale variations. Furthermore, the existence of non-uniform folds and the lack of document being flat make this task more challenging. We present the selection of robust local features and the corresponding parameters by comparisons among SIFT, SURF, MSER, Hessian-affine, and Harris-affine. We evaluate the system not only with respect to amount of space required to store the local features information but also with respect to categorization accuracy. Moreover, the system handles the identification of multiple logos on the document at the same time. Experimental results on a challenging set of real images demonstrate the efficiency of our approach.