Pre-trained CNNs: Evaluating Emergency Vehicle Image Classification
DOI:
https://doi.org/10.56294/dm2023153Keywords:
: Pre-trained CNN Models, Image Classification, Computer Vision, Emergency VehiclesAbstract
In this paper, we aim to provide a comprehensive analysis of image classification, specifically in the context of emergency vehicle classification. We have conducted an in-depth investigation, exploring the effectiveness of six pre-trained Convolutional Neural Network (CNN) models. These models, namely VGG19, VGG16, MobileNetV3Large, MobileNetV3Small, MobileNetV2, and MobileNetV1, have been thoroughly examined and evaluated within the domain of emergency vehicle classification. The research methodology utilized in this study is carefully designed with a systematic approach. It includes the thorough preparation of datasets, deliberate modifications to the model architecture, careful selection of layer operations, and fine-tuning of the model compilation. To gain a comprehensive understanding of the performance, we conducted a detailed series of experiments. We analyzed nuanced performance metrics such as accuracy, loss, and training time, considering important factors in the evaluation process. The results obtained from this study provide a comprehensive understanding of the advantages and disadvantages of each model. Moreover, they emphasize the crucial significance of carefully choosing a suitable pre-trained Convolutional Neural Network (CNN) model for image classification tasks. Essentially, this article provides a comprehensive overview of image classification, highlighting the crucial significance of pre-trained CNN models in achieving precise outcomes, especially in the demanding field of emergency vehicle classification
References
1. Zegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., & Wojna, Z. (2016). Rethinking the Inception Architecture for Computer Vision. In CVPR.
2. Taye, M.M.: Theoretical understanding of convolutional neural network: concepts, architectures, applications, future directions. Computation (Basel). 11, 52 (2023). https://doi.org/10.3390/computation11030052.
3. Fan, J., Lee, J., Lee, Y.: A transfer learning architecture based on a support vector machine for histopathology image classification. Applied Sciences. 11, 6380 (2021). https://doi.org/10.3390/app11146380.
4. Sharma, N., Jain, V., Mishra, A.: An Analysis Of Convolutional Neural Networks For Image Classification. Procedia Computer Science. 132, 377–384 (2018). https://doi.org/10.1016/j.procs.2018.05.198
5. Goyal, P., Dollár, P., Girshick, R., Noordhuis, P., Wesolowski, L., Kyrola, A., ... & Zheng, A.: Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour. (2017).
6. Yang, H., Zhang, Y., Chao, Y., Ding, W.: Ultra-lightweight CNN design based on neural architecture search and knowledge distillation: A novel method to build the automatic recognition model of space target ISAR images. Defence Technology. 18, 1073–1095 (2022). https://doi.org/10.1016/j.dt.2021.04.014.
7. Fan, J., Lee, J., Lee, Y.: A transfer learning architecture based on a support vector machine for histopathology image classification. Applied Sciences. 11, 6380 (2021). https://doi.org/10.3390/app11146380.
8. Neelam Jaikishore, C., Podaturpet Arunkumar, G., Jagannathan Srinath, A., Vamsi, H., Srinivasan, K., Ramesh, R.K., Jayaraman, K., Ramachandran, P.: "Implementation of Deep Learning Algorithm on a Custom Dataset for Advanced Driver Assistance Systems Applications" (2022).
9. Li, Z., Liu, F., Yang, W., Peng, S., Zhou, J.: A Survey of Convolutional Neural Networks: Analysis, applications, and Prospects. IEEE Transactions on Neural Networks and Learning Systems. 33, 6999–7019 (2022). DOI.
10. Esteva, A., Kuprel, B., Novoa, R. A., Ko, J., Swetter, S. M., Blau, H. M., & Thrun, S. (2017). Dermatologist-level classification of skin cancer with deep neural networks. In Nature.
11. Gonzalez-Argote J. A Bibliometric Analysis of the Studies in Modeling and Simulation: Insights from Scopus. Gamification and Augmented Reality 2023;1:5–5. https://doi.org/10.56294/gr20235.
12. Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet Classification with Deep Convolutional Neural Networks. In NIPS.
13. Omari Alaoui A, El Bahi O, Oumoulylte M, El Youssefi A, Farhaoui Y, El Allaoui A (2023) Optimizing Emergency Vehicle Detection for Safer and Smoother Passages. In: Proceedings of the 6th International Conference on Networking, Intelligent Systems & Security. Association for Computing Machinery, Larache, Morocco. https://doi.org/10.1145/3607720.3607728.
14. S. Kaushik, A. Raman and K. V. S. Rajeswara Rao: "Leveraging Computer Vision for Emergency Vehicle Detection-Implementation and Analysis" (2020).
15. S. Roy and M. S. Rahman: Emergency Vehicle Detection on Heavy Traffic Road from CCTV Footage Using Deep Convolutional Neural Network", International Conference on Electrical, Computer, and Communication Engineering (ECCE), Cox'sBazar, Bangladesh, (2019).
16. Hasan S, Rabbi G, Islam R, Imam Bijoy H, Hakim A (2022) Bangla Font Recognition using Transfer Learning Method. In: 2022 International Conference on Inventive Computation Technologies (ICICT). pp 57–62. DOI: 10.1109/ICICT54344.2022.9850765
17. Simonyan K, Zisserman A (2015) Very Deep Convolutional Networks for Large-Scale Image Recognition. Doi : 10.48550/arXiv.1409.1556
18. Romero-Carazas R. Prompt lawyer: a challenge in the face of the integration of artificial intelligence and law. Gamification and Augmented Reality 2023;1:7–7. https://doi.org/10.56294/gr20237.
19. Qian S, Ning C, Hu Y (2021) MobileNetV3 for Image Classification. In: 2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE). pp 490–497. doi :10.1109/ICBAIE52039.2021.9389905
20. Sandler, M., Howard, A.W., Zhu, M., Zhmoginov, A., Chen, L.-C.: MobileNetV2: Inverted Residuals and Linear Bottlenecks. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp. 4510-4520. (2018).
21. Howard, A.G. Zhu, M., Chen, B., Kalenichenko, D., ... & Adam, H.: MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. (2017).
22. Bai, Y.: RELU-Function and Derived Function review. SHS Web of Conferences. 144, 02006 (2022). https://doi.org/10.1051/shsconf/202214402006.S
23. Pearce, T.: Understanding softmax confidence and uncertainty, https://arxiv.org/abs/2106.04972.
24. Russakovsky, O., Deng, J., Su, H., ... & Fei-Fei, L.: ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision, 115(3), 211-252. (2015).
25. xml.etree.ElementTree — The ElementTree XML API — Python 2.7.18 documentation, https://docs.python.org/2/library/xml.etree.elementtree.html.
26. Gonzalez-Argote D, Gonzalez-Argote J, Machuca-Contreras F. Blockchain in the health sector: a systematic literature review of success cases. Gamification and Augmented Reality 2023;1:6–6. https://doi.org/10.56294/gr20236.
27. Auza-Santiváñez JC, Díaz JAC, Cruz OAV, Robles-Nina SM, Escalante CS, Huanca BA. mHealth in health systems: barriers to implementation. Health Leadership and Quality of Life 2022;1:7-7. https://doi.org/10.56294/hl20227.
Published
Issue
Section
License
Copyright (c) 2023 Ali Omari Alaoui, Omaima El Bahi , Mohamed Rida Fethi, Othmane Farhaoui, Ahmad El Allaoui, Yousef Farhaoui (Author)
This work is licensed under a Creative Commons Attribution 4.0 International License.
The article is distributed under the Creative Commons Attribution 4.0 License. Unless otherwise stated, associated published material is distributed under the same licence.