Real-Time Object Detection in Complex Environments: Leveraging Deep Learning and Computer Vision Techniques

Dawood Hussian; Heba Raheem; Maha  Altememe; Yan  Li; Shahab  Abdulla

doi:10.22153/kej.2026.03.001

المؤلفون

Dawood Hussian Computer Technology Engineering, Al Taff University College, karbala, Iraq
Heba Raheem University of Kerbala https://orcid.org/0000-0002-0140-1805
Maha Altememe University of Kerbala
Yan Li University of Southern Queensland
Shahab Abdulla University of Southern Queensland

DOI:

https://doi.org/10.22153/kej.2026.03.001

الكلمات المفتاحية:

Real-time object detection; YOLOv5; Deep learning; Computer vision; Google colab; COCO dataset; Live webcam detection

الملخص

This research tackles major real-time detection obstacles such as changing lighting effects and object blocks and background elements and different object sizes because such problems occur frequently in autonomous driving and surveillance and smart infrastructure applications. A detection pipeline system with modular functionality enabled processing of images, videos and webcams in real-time and had specific optimizations for each input format. The YOLOv5s model was selected due to its accurate and fast performance characteristics so we deployed it in a cloud-based Google Colab system with GPU acceleration capabilities. Real-world data collection succeeded in quantitative analysis through assessment of inference duration together with frame speed and detection precision along with confidence values while qualitative methods measured box precision and label validity. The system produced exceptional results by processing images and video data within 28 to 35 milliseconds and webcam frames between 1.8 to 2.3 seconds while generating confidence scores between 0.70 and 0.93. Real-time applications benefit from this system because it presents stable detection while being environmentally flexible and practically applicable. YOLOv5 proves robust based on the discovered test results which indicate future potential deployments of intelligent visual monitoring systems across all dynamic environments.

التنزيلات

تنزيل البيانات ليس متاحًا بعد.

المراجع

[1] A. Bochkovskiy, C. Y. Wang, and H. Y. M. Liao, “YOLOv4: Optimal speed and accuracy of object detection,” arXiv preprint arXiv:2004.10934, 2020, https://doi.org/10.48550/arXiv.2004.10934

[2] J. Redmon and A. Farhadi, “YOLOv3: An incremental improvement,” arXiv preprint arXiv:1804.02767, 2018, https://doi.org/10.48550/arXiv.1804.02767

[3] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You only look once: Unified, real-time object detection,” in Proc. IEEE CVPR, 2016, pp. 779–788, https://doi.org/10.1109/CVPR.2016.91

[4] N. Carion et al., “End-to-end object detection with transformers,” in Proc. ECCV, 2020, pp. 213–229, https://doi.org/10.1007/978-3-030-58452-8_13

[5] G. Jocher, “YOLOv5 by Ultralytics,” GitHub, 2020. [Online]. Available: https://github.com/ultralytics/yolov5

[6] C. Y. Wang, A. Bochkovskiy, and H. Y. M. Liao, “YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors,” in Proc. IEEE/CVF CVPR, 2023, pp. 7464–7475, https://doi.org/10.1109/CVPR52729.2023.00721

[7] Z. Ge, S. Liu, F. Wang, Z. Li, and J. Sun, “YOLOX: Exceeding YOLO series in 2021,” arXiv preprint arXiv:2107.08430, 2021, https://doi.org/10.48550/arXiv.2107.08430

[8] X. Long et al., “PP-YOLO: An effective and efficient implementation of object detector,” arXiv preprint arXiv:2007.12099, 2020, https://doi.org/0.48550/arXiv.2007.12099

[9] S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards real-time object detection with region proposal networks,” in Adv. Neural Inf. Process. Syst., vol. 28, 2015, https://doi.org/10.48550/arXiv.1506.01497

[10] A. Dosovitskiy et al., “An image is worth 16×16 words: Transformers for image recognition at scale,” arXiv preprint arXiv:2010.11929, 2020, https://doi.org/10.48550/arXiv.2010.11929

[11] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proc. IEEE CVPR, 2016, pp. 770–778, https://doi.org/10.1109/CVPR.2016.90

[12] M. Tan, R. Pang, and Q. V. Le, “EfficientDet: Scalable and efficient object detection,” in Proc. IEEE/CVF CVPR, 2020, pp. 10781–10790, https://doi.org/10.1109/CVPR42600.2020.01079

[13] T. Azfar et al., “Deep learning-based computer vision methods for complex traffic environments perception: A review,” Data Science for Transportation, vol. 6, no. 1, 2024, https://doi.org/10.1007/s42421-023-00086-7

[14] A. Devasani, P. Vineeth, and S. Haripal, “Real-time object detection using deep learning for video streams,” 2024, https://doi.org/10.48047/IJIEMR/V13/ISSUE05/07

[15] D. K. Pramudito, “Enhancing real-time object detection in autonomous systems using deep learning and computer vision techniques,” Journal of Academic Science, vol. 1, no. 6, pp. 788–804, 2024, https://doi.org/10.59613/v3015d10

[16] M. T. Hosain et al., “Synchronizing object detection: Applications, advancements and existing challenges,” IEEE Access, 2024, https://doi.org/10.1109/ACCESS.2024.3388889

[17] S. Tuli, N. Basumatary, and R. Buyya, “EdgeLens: Deep learning based object detection in integrated IoT, fog and cloud computing environments,” in Proc. ISCON, 2019, pp. 496–502, https://doi.org/10.1109/ISCON47742.2019.9036216

[18] M. Andronie et al., “Big data management algorithms and deep learning-based object detection technologies,” ISPRS Int. J. Geo-Inf., vol. 12, p. 35, 2023, https://doi.org/10.3390/ijgi12020035

[19] A. Kaur et al., “A survey on deep learning approaches to medical images,” Archives of Computational Methods in Engineering, 2022, https://doi.org/10.1007/s11831-021-09649-9

[20] N. Manakitsa et al., “A review of machine learning and deep learning for object detection, semantic segmentation, and human action recognition,” Technologies, vol. 12, no. 2, p. 15, 2024, https://doi.org/10.3390/technologies12020015

[21] Y. Ghasemi et al., “Deep learning-based object detection in augmented reality: A systematic review,” Computers in Industry, vol. 139, p. 103661, 2022, https://doi.org/10.1016/j.compind.2022.103661

[22] M. J. Shafiee et al., “Fast YOLO: A fast you only look once system for real-time embedded object detection in video,” arXiv preprint arXiv:1709.05943, 2017, https://doi.org/10.48550/arXiv.1709.05943

[23] M. Ahmed et al., “Survey and performance analysis of deep learning-based object detection in challenging environments,” Sensors, vol. 21, no. 15, p. 5116, 2021, https://doi.org/10.3390/s21155116

[24] Z. Cao et al., “Real-time object detection based on UAV remote sensing: A systematic literature review,” Drones, vol. 7, no. 10, p. 620, 2023, https://doi.org/10.3390/drones7100620

[25] M. Simon et al., “Complexer-YOLO: Real-time 3D object detection and tracking on semantic point clouds,” in Proc. IEEE/CVF Workshops, 2019, https://doi.org/10.48550/arXiv.1904.07537

[26] Z.-Q. Zhao, P. Zheng, S.-T. Xu, and X. Wu, “Object detection with deep learning: A review,” IEEE Transactions on Neural Networks and Learning Systems, vol. 30, no. 11, pp. 3212–3232, Nov. 2019, https://doi.org/10.1109/TNNLS.2018.2876865

[27] S. S. A. Zaidi et al., “A survey of modern deep learning based object detection models,” Digital Signal Processing, vol. 126, p. 103514, 2022, https://doi.org/10.1016/j.dsp.2022.103514

[28] D. T. Hristopulos et al., “Open challenges in environmental data analysis and ecological complex systems,” Euro physics Letters, vol. 132, no. 6, p. 68001, 2021, https://doi.org/10.1209/0295-5075/132/68001

[29] N. H. Abdulghafoor and H. N. Abdullah, “A novel real-time multiple objects detection and tracking framework for different challenges,” Alexandria Engineering Journal, vol. 61, no. 12, pp. 9637–9647, 2022, https://doi.org/10.1016/j.aej.2022.02.068