RepConv: A novel architecture for image scene classification on Intel scenes dataset

Document Type : Original Article

Authors

1 39 Alzohour St,Cairo,Egypt

2 Information Systems, Faculty of Computer and Information Sciences, Ain Shams University

3 Department of Information Systems, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, 11566, Egypt

Abstract

Image understanding and scene classification are keystone tasks in computer vision. The advancement of technology and the abundance of available datasets in the field of image classification and recognition study provide plenty of attempts for advancement. In the scene classification problem, transfer learning is commonly utilized as a branch of machine learning. Despite existing machine learning models' superior performance in image interpretation and scene classification, there are still challenges to overcome. The weights and current models aren't suitable in most circumstances. Instead of using the weights of data-dependent models, in this work, a novel machine learning model for the scene classification task is provided that converges rapidly. The proposed model has been tested on the Intel scenes dataset for a comprehensive evaluation of our model. The proposed model RepConv over-performed four existing benchmark models in a low number of epochs and training parameters, and it achieved 93.55 ± 0.11, 75.54 ± 0.14 accuracies for training and validation data respectively. Furthermore, re-categorization of the data set is performed for a new classification problem that is not previously reported in the literature (natural scenes; real scenes). The accuracy of the proposed model on the binary model was 98.08 ± 0.05 on training data and 92.70 ± 0.08 on validation data which is not reported previously in any other publication.

Keywords