An Innovative Framework for Intelligent Computer Vision Empowered by Deep Learning

Authors

DOI:

https://doi.org/10.15359/ru.39-1.13

Keywords:

Computer vision, Super pixel, Deep learning, Object detection, Image Classification, Object recognition, Neural networks

Abstract

[Objective] The field of computer vision has seen remarkable progress, largely due to the advancements in deep learning. These developments have revolutionized image recognition, interpretation, and application across numerous domains. This paper introduces a new framework designed to expand the potential of computer vision systems by harnessing the power of deep learning techniques. Deep neural networks are at the core of this new system, providing exceptional accuracy and reliability in tasks such as object recognition, image segmentation, and scene understanding. [Methodology] Furthermore, this framework offers a versatile platform for real-time image processing, paving the way for numerous applications in areas like industrial automation, medical diagnostics, and autonomous vehicles. This study comprehensively explores the architectural elements and methodologies that drive this innovative framework. It emphasizes the framework's technological capabilities, scalability, adaptability, and potential for broad adoption across industries seeking advanced computer vision solutions. [Results] The proposed model, Convolutional Neural Network-Feature Pyramid Network (CNN-FPN), demonstrates superior performance across all evaluated metrics for object detection compared to existing models. Specifically, it achieves the highest scores in Accuracy (57.2%), Recall (60.4%), Precision (94.1%), F1-Score (73.5%), and AUC (0.983). These results indicate that the proposed model offers superior performance and reliability in object detection tasks, demonstrating its potential for high-precision computer vision applications. [Conclusions] In conclusion, this innovative architecture represents a significant advancement in computer vision, enabled by the capabilities of deep learning. Our test findings demonstrate that compared to conventional algorithms, the enhanced CNN-FPN produced more accurate results.

Downloads

Download data is not yet available.

References

Abdusalomov, A. B., Islam, B. M. S., Nasimov, R., Mukhiddinov, M., Whangbo, T. K. (2023). An improved forest fire detection method based on the detectron2 model and a deep learning approach. Sensors, 23(3), 1512. https://doi.org/10.3390/s23031512

Alsakka, F., Assaf, S., El-Chami, I., Al-Hussein, M. (2023). Computer vision applications in offsite construction. Automation in Construction, 154, 104980. https://doi.org/10.1016/j.autcon.2023.104980

Ariyanto, M., Purnamasari, P. D. (2021, October). Object detection system for self- checkout cashier system based on faster region-based convolution neural network and YOLO9000. In 2021 17th International Conference on Quality in Research (QIR): International Symposium on Electrical and Computer Engineering (pp. 153- 157). IEEE. https://doi.org/10.1109/QIR54354.2021.9716200

Ballard, D. H. (2021). Animat vision. In Computer vision: A reference guide (pp. 52-57). https://doi.org/10.1007/978-3-030-63416-2_273

Batchu, R.K., Bikku, T., Thota, S., Seetha, H., Ayoade, A.A. (2024). A novel optimization-driven deep learning framework for the detection of DDoS attacks. Scientific Reports, 14, 28024 (2024). https://doi.org/10.1038/s41598-024-77554-9

Bhatt, D., Patel, C., Talsania, H., Patel, J., Vaghela, R., Pandya, S., Ghayvat, H. (2021). CNN variants for computer vision: History, architecture, application, challenges and future scope. Electronics, 10(20), 2470. https://doi.org/10.3390/electronics10202470

Bikku, T., Malligunta, K.K., Thota, S., Surapaneni, P. P. (2024b). Improved Quantum Algorithm: A Crucial Stepping Stone in Quantum-Powered Drug Discovery. Journal of Electronic Materials, 54(2024), 3434-3443. https://doi.org/10.1007/s11664-024-11275-7

Bikku, T., Thota, S., Shanmugasundaram, P. (2024a). A Novel Quantum Neural Network Approach to Combating Fake Reviews. International Journal of Networked and Distributed Computing, 2024, 1-11. https://doi.org/10.1007/s44227-024-00028-x

Efthymiou, S., Ramos-Calderer, S., Bravo-Prieto, C., Pérez-Salinas, A., García-Martín, D., Garcia-Saez, A., Carrazza, S. (2021). Qibo: a framework for quantum simulation with hardware acceleration. Quantum Science and Technology, 7(1), 015018. https://doi.org/10.1088/2058-9565/ac39f5

Han, K., Wang, Y., Chen, H., Chen, X., Guo, J., Liu, Z., Tao, D. (2022). A survey on vision transformer. IEEE transactions on pattern analysis and machine intelligence, 45(1), 87-110. https://doi.org/10.1109/TPAMI.2022.3152247

Hasan, N., Bao, Y., Shawon, A., Huang, Y. (2021). DenseNet convolutional neural networks application for predicting COVID-19 using CT image. SN computer science, 2(5), 389. https://doi.org/10.1007/s42979-021-00782-7

Jain, S. (2024). Deepseanet: Improving underwater object detection using efficientdet. In 2024 4th International Conference on Applied Artificial Intelligence (ICAPAI) (pp. 1-11). IEEE. https://doi.org/10.1109/ICAPAI61893.2024.10541265

Kim, J., Davis, T., Hong, L. (2022). Augmented intelligence: enhancing human decision making. In Bridging Human Intelligence and Artificial Intelligence (pp. 151-170). Cham: Springer International Publishing. https://doi.org/10.1007/978-3-030-84729-6_10

Manakitsa, N., Maraslidis, G. S., Moysis, L., Fragulis, G. F. (2024). A Review of Machine Learning and Deep Learning for Object Detection, Semantic Segmentation, and Human Action Recognition in Machine and Robotic Vision. Technologies, 12(2), 15. https://doi.org/10.3390/technologies12020015

Nazar, N., Subash, T. D. (2024). Navigating Augmented Realities: A Review Of Advancements, Applications, And Future Prospects. Educational Administration: Theory and Practice, 30(4), 4182-4186. https://doi.org/10.53555/kuey.v30i4.2173

Pujari, J. J., et al. (2024). Deep fake Image Verification using DCNN with MobileNetV2. In 3rd Edition of IEEE Delhi Section Flagship Conference (DELCON). IEEE, 2024. https://doi.org/10.1109/DELCON64804.2024.10866044

Ravikumar, A. & Sriraman, H. (2023). Acceleration of Image Processing and Computer Vision Algorithms. In Handbook of Research on Computer Vision and Image Processing in the Deep Learning Era (pp. 1-18). IGI Global. https://doi.org/10.4018/978-1-7998-8892-5.ch001

Safaldin, M., Zaghden, N., Mejdoub, M. (2024). An Improved YOLOv8 to Detect Moving Objects. IEEE Access. https://doi.org/10.1109/ACCESS.2024.3393835

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature. https://doi.org/10.1007/978-3-030-34372-9

Thota, S., Bikku, T., Rakshitha, T, (2025). Hybrid optimization technique for matrix chain multiplication using Strassen’s algorithm, F1000Research, 2025, 1-14. https://doi.org/10.12688/f1000research.162848.1

Thota, S., Gopisairam, T., Bikku, T. (2024). Modelling LCR-Circuit into Integro-Differential Equation Using Variational Iteration Method and GRU-Based Recurrent Neural Network, In 2024 3rd Edition of IEEE Delhi Section Flagship Conference (DELCON), New Delhi, India.https://doi.org/10.1109/DELCON64804.2024.10866074

Wang, B., Ji, R., Zhang, L., Wu, Y. (2022). Bridging multi-scale context-aware representation for object detection. In IEEE Transactions on Circuits and Systems for Video Technology. https://doi.org/10.1109/TCSVT.2022.3221755

Xavier, A. I., Villavicencio, C., Macrohon, J. J., Jeng, J. H., Hsieh, J. G. (2022). Object detection via gradient-based mask R-CNN using machine learning algorithms. Machines, 10(5), 340. https://doi.org/10.3390/machines10050340

Yadav, S. P., Jindal, M., Rani, P., de Albuquerque, V. H. C., dos Santos Nascimento, C., Kumar, M. (2024). An improved deep learning-based optimal object detection system from images. Multimedia Tools and Applications, 83(10), 30045-30072. https://doi.org/10.1007/s11042-023-16736-5

Zhao, H., Zhang, H., Zhao, Y. (2023). Yolov7-sea: Object detection of maritime uav images based on improved yolov7. In Proceedings of the IEEE/CVF winter conference on applications of computer vision (pp. 233-238). https://doi.org/10.1109/WACVW58289.2023.00029

Zhao, L., Li, S. (2020). Object detection algorithm based on improved YOLOv3. Electronics, 9(3), 537. https://doi.org/10.3390/electronics9030537

Zhao, X., Wang, L., Zhang, Y., Han, X., Deveci, M., Parmar, M. (2024). A review of convolutional neural networks in computer vision. Artificial Intelligence Review, 57(4), 99. https://doi.org/10.1007/s10462-024-10721-6

Zheng, Y. X., Chee, K. W. G. A., Paul, A., Kim, J., Lv, H. (2023). Electronics Engineering Perspectives on Computer Vision Applications: An Overview of Techniques, Sub- areas, Advancements and Future Challenges. Cutting Edge Applications of Computational Intelligence Tools and Techniques, 113-142. https://doi.org/10.1007/978-3-031-44127-1_6

Zou, Z., Chen, K., Shi, Z., Guo, Y., Ye, J. (2023). Object detection in 20 years: A survey. Proceedings of the IEEE, 111(3), 257-276.https://doi.org/10.1109/JPROC.2023.3238524

Published

2025-11-30

Issue

Section

Original scientific papers (evaluated by academic peers)

How to Cite

Bikku, T., Thota, S., & Ayoade, A. A. (2025). An Innovative Framework for Intelligent Computer Vision Empowered by Deep Learning. Uniciencia, 39(1), 1-17. https://doi.org/10.15359/ru.39-1.13