An Innovative Framework for Intelligent Computer Vision Empowered by Deep Learning
DOI:
https://doi.org/10.15359/ru.39-1.13Keywords:
Computer vision, Super pixel, Deep learning, Object detection, Image Classification, Object recognition, Neural networksAbstract
[Objective] The field of computer vision has seen remarkable progress, largely due to the advancements in deep learning. These developments have revolutionized image recognition, interpretation, and application across numerous domains. This paper introduces a new framework designed to expand the potential of computer vision systems by harnessing the power of deep learning techniques. Deep neural networks are at the core of this new system, providing exceptional accuracy and reliability in tasks such as object recognition, image segmentation, and scene understanding. [Methodology] Furthermore, this framework offers a versatile platform for real-time image processing, paving the way for numerous applications in areas like industrial automation, medical diagnostics, and autonomous vehicles. This study comprehensively explores the architectural elements and methodologies that drive this innovative framework. It emphasizes the framework's technological capabilities, scalability, adaptability, and potential for broad adoption across industries seeking advanced computer vision solutions. [Results] The proposed model, Convolutional Neural Network-Feature Pyramid Network (CNN-FPN), demonstrates superior performance across all evaluated metrics for object detection compared to existing models. Specifically, it achieves the highest scores in Accuracy (57.2%), Recall (60.4%), Precision (94.1%), F1-Score (73.5%), and AUC (0.983). These results indicate that the proposed model offers superior performance and reliability in object detection tasks, demonstrating its potential for high-precision computer vision applications. [Conclusions] In conclusion, this innovative architecture represents a significant advancement in computer vision, enabled by the capabilities of deep learning. Our test findings demonstrate that compared to conventional algorithms, the enhanced CNN-FPN produced more accurate results.
Downloads
References
Abdusalomov, A. B., Islam, B. M. S., Nasimov, R., Mukhiddinov, M., Whangbo, T. K. (2023). An improved forest fire detection method based on the detectron2 model and a deep learning approach. Sensors, 23(3), 1512. https://doi.org/10.3390/s23031512
Alsakka, F., Assaf, S., El-Chami, I., Al-Hussein, M. (2023). Computer vision applications in offsite construction. Automation in Construction, 154, 104980. https://doi.org/10.1016/j.autcon.2023.104980
Ariyanto, M., Purnamasari, P. D. (2021, October). Object detection system for self- checkout cashier system based on faster region-based convolution neural network and YOLO9000. In 2021 17th International Conference on Quality in Research (QIR): International Symposium on Electrical and Computer Engineering (pp. 153- 157). IEEE. https://doi.org/10.1109/QIR54354.2021.9716200
Ballard, D. H. (2021). Animat vision. In Computer vision: A reference guide (pp. 52-57). https://doi.org/10.1007/978-3-030-63416-2_273
Batchu, R.K., Bikku, T., Thota, S., Seetha, H., Ayoade, A.A. (2024). A novel optimization-driven deep learning framework for the detection of DDoS attacks. Scientific Reports, 14, 28024 (2024). https://doi.org/10.1038/s41598-024-77554-9
Bhatt, D., Patel, C., Talsania, H., Patel, J., Vaghela, R., Pandya, S., Ghayvat, H. (2021). CNN variants for computer vision: History, architecture, application, challenges and future scope. Electronics, 10(20), 2470. https://doi.org/10.3390/electronics10202470
Bikku, T., Malligunta, K.K., Thota, S., Surapaneni, P. P. (2024b). Improved Quantum Algorithm: A Crucial Stepping Stone in Quantum-Powered Drug Discovery. Journal of Electronic Materials, 54(2024), 3434-3443. https://doi.org/10.1007/s11664-024-11275-7
Bikku, T., Thota, S., Shanmugasundaram, P. (2024a). A Novel Quantum Neural Network Approach to Combating Fake Reviews. International Journal of Networked and Distributed Computing, 2024, 1-11. https://doi.org/10.1007/s44227-024-00028-x
Efthymiou, S., Ramos-Calderer, S., Bravo-Prieto, C., Pérez-Salinas, A., García-Martín, D., Garcia-Saez, A., Carrazza, S. (2021). Qibo: a framework for quantum simulation with hardware acceleration. Quantum Science and Technology, 7(1), 015018. https://doi.org/10.1088/2058-9565/ac39f5
Han, K., Wang, Y., Chen, H., Chen, X., Guo, J., Liu, Z., Tao, D. (2022). A survey on vision transformer. IEEE transactions on pattern analysis and machine intelligence, 45(1), 87-110. https://doi.org/10.1109/TPAMI.2022.3152247
Hasan, N., Bao, Y., Shawon, A., Huang, Y. (2021). DenseNet convolutional neural networks application for predicting COVID-19 using CT image. SN computer science, 2(5), 389. https://doi.org/10.1007/s42979-021-00782-7
Jain, S. (2024). Deepseanet: Improving underwater object detection using efficientdet. In 2024 4th International Conference on Applied Artificial Intelligence (ICAPAI) (pp. 1-11). IEEE. https://doi.org/10.1109/ICAPAI61893.2024.10541265
Kim, J., Davis, T., Hong, L. (2022). Augmented intelligence: enhancing human decision making. In Bridging Human Intelligence and Artificial Intelligence (pp. 151-170). Cham: Springer International Publishing. https://doi.org/10.1007/978-3-030-84729-6_10
Manakitsa, N., Maraslidis, G. S., Moysis, L., Fragulis, G. F. (2024). A Review of Machine Learning and Deep Learning for Object Detection, Semantic Segmentation, and Human Action Recognition in Machine and Robotic Vision. Technologies, 12(2), 15. https://doi.org/10.3390/technologies12020015
Nazar, N., Subash, T. D. (2024). Navigating Augmented Realities: A Review Of Advancements, Applications, And Future Prospects. Educational Administration: Theory and Practice, 30(4), 4182-4186. https://doi.org/10.53555/kuey.v30i4.2173
Pujari, J. J., et al. (2024). Deep fake Image Verification using DCNN with MobileNetV2. In 3rd Edition of IEEE Delhi Section Flagship Conference (DELCON). IEEE, 2024. https://doi.org/10.1109/DELCON64804.2024.10866044
Ravikumar, A. & Sriraman, H. (2023). Acceleration of Image Processing and Computer Vision Algorithms. In Handbook of Research on Computer Vision and Image Processing in the Deep Learning Era (pp. 1-18). IGI Global. https://doi.org/10.4018/978-1-7998-8892-5.ch001
Safaldin, M., Zaghden, N., Mejdoub, M. (2024). An Improved YOLOv8 to Detect Moving Objects. IEEE Access. https://doi.org/10.1109/ACCESS.2024.3393835
Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature. https://doi.org/10.1007/978-3-030-34372-9
Thota, S., Bikku, T., Rakshitha, T, (2025). Hybrid optimization technique for matrix chain multiplication using Strassen’s algorithm, F1000Research, 2025, 1-14. https://doi.org/10.12688/f1000research.162848.1
Thota, S., Gopisairam, T., Bikku, T. (2024). Modelling LCR-Circuit into Integro-Differential Equation Using Variational Iteration Method and GRU-Based Recurrent Neural Network, In 2024 3rd Edition of IEEE Delhi Section Flagship Conference (DELCON), New Delhi, India.https://doi.org/10.1109/DELCON64804.2024.10866074
Wang, B., Ji, R., Zhang, L., Wu, Y. (2022). Bridging multi-scale context-aware representation for object detection. In IEEE Transactions on Circuits and Systems for Video Technology. https://doi.org/10.1109/TCSVT.2022.3221755
Xavier, A. I., Villavicencio, C., Macrohon, J. J., Jeng, J. H., Hsieh, J. G. (2022). Object detection via gradient-based mask R-CNN using machine learning algorithms. Machines, 10(5), 340. https://doi.org/10.3390/machines10050340
Yadav, S. P., Jindal, M., Rani, P., de Albuquerque, V. H. C., dos Santos Nascimento, C., Kumar, M. (2024). An improved deep learning-based optimal object detection system from images. Multimedia Tools and Applications, 83(10), 30045-30072. https://doi.org/10.1007/s11042-023-16736-5
Zhao, H., Zhang, H., Zhao, Y. (2023). Yolov7-sea: Object detection of maritime uav images based on improved yolov7. In Proceedings of the IEEE/CVF winter conference on applications of computer vision (pp. 233-238). https://doi.org/10.1109/WACVW58289.2023.00029
Zhao, L., Li, S. (2020). Object detection algorithm based on improved YOLOv3. Electronics, 9(3), 537. https://doi.org/10.3390/electronics9030537
Zhao, X., Wang, L., Zhang, Y., Han, X., Deveci, M., Parmar, M. (2024). A review of convolutional neural networks in computer vision. Artificial Intelligence Review, 57(4), 99. https://doi.org/10.1007/s10462-024-10721-6
Zheng, Y. X., Chee, K. W. G. A., Paul, A., Kim, J., Lv, H. (2023). Electronics Engineering Perspectives on Computer Vision Applications: An Overview of Techniques, Sub- areas, Advancements and Future Challenges. Cutting Edge Applications of Computational Intelligence Tools and Techniques, 113-142. https://doi.org/10.1007/978-3-031-44127-1_6
Zou, Z., Chen, K., Shi, Z., Guo, Y., Ye, J. (2023). Object detection in 20 years: A survey. Proceedings of the IEEE, 111(3), 257-276.https://doi.org/10.1109/JPROC.2023.3238524
Published
Issue
Section
License
Copyright (c) 2025 Shared by Journal and Authors (CC-BY-NC-ND)

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who publish with this journal agree to the following terms:
1. Authors guarantee the journal the right to be the first publication of the work as licensed under a Creative Commons License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal https://creativecommons.org/licenses/by-nc-nd/4.0/.
2. Authors can set separate additional agreements for non-exclusive distribution of the version of the work published in the journal (eg, place it in an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
3. The authors have declared to hold all permissions to use the resources they provided in the paper (images, tables, among others) and assume full responsibility for damages to third parties.
4. The opinions expressed in the paper are the exclusive responsibility of the authors and do not necessarily represent the opinion of the editors or the Universidad Nacional.
Uniciencia Journal and all its productions are under Creative Commons Atribución-NoComercial-SinDerivadas 4.0 Unported.
There is neither fee for access nor Article Processing Charge (APC)
