TY - JOUR
T1 - An IoT-based human detection system for complex industrial environment with deep learning architectures and transfer learning
AU - Ahmed, Imran
AU - Anisetti, Marco
AU - Jeon, Gwanggil
N1 - Publisher Copyright:
© 2021 Wiley Periodicals LLC.
PY - 2022/12
Y1 - 2022/12
N2 - Artificial intelligence (AI), combined with the Internet of Things (IoT), plays a beneficial role in various fields, including intelligent surveillance applications. With IoT and 5G advancement, intelligent sensors, and devices in the surveillance environment collect large amounts of data in the form of videos and images. These collected data require intelligent information processing solutions, help analyze the recorded videos and images to detect and identify various objects in the scene, particularly humans. In this study, an automated human detection system is presented for a complex industrial environment, in which people are monitored/detected from a top view perspective. A top view is usually preferred because it can provide sufficient coverage and enough visibility of a scene. This study demonstrates the applications, efficiency, and effectiveness of deep learning architectures, that is, Faster Region Convolutional Neural Network (Faster R-CNN), Single Shot MultiBox Detector (SSD), and You Only Look Once (YOLOv3), with transfer learning. Experimental results reveal that with additional training and transfer learning, the performance of all detection architectures is significantly improved. The detection results are also compared using the same data set. The deep learning architectures achieve promising results with maximum true-positive rate of 93%, 94%, and 94% for Faster-RCNN, SSD, and YOLOv3, respectively. Furthermore, a detailed study is performed on output results that highlight challenges and probable future trends.
AB - Artificial intelligence (AI), combined with the Internet of Things (IoT), plays a beneficial role in various fields, including intelligent surveillance applications. With IoT and 5G advancement, intelligent sensors, and devices in the surveillance environment collect large amounts of data in the form of videos and images. These collected data require intelligent information processing solutions, help analyze the recorded videos and images to detect and identify various objects in the scene, particularly humans. In this study, an automated human detection system is presented for a complex industrial environment, in which people are monitored/detected from a top view perspective. A top view is usually preferred because it can provide sufficient coverage and enough visibility of a scene. This study demonstrates the applications, efficiency, and effectiveness of deep learning architectures, that is, Faster Region Convolutional Neural Network (Faster R-CNN), Single Shot MultiBox Detector (SSD), and You Only Look Once (YOLOv3), with transfer learning. Experimental results reveal that with additional training and transfer learning, the performance of all detection architectures is significantly improved. The detection results are also compared using the same data set. The deep learning architectures achieve promising results with maximum true-positive rate of 93%, 94%, and 94% for Faster-RCNN, SSD, and YOLOv3, respectively. Furthermore, a detailed study is performed on output results that highlight challenges and probable future trends.
KW - artificial intelligence
KW - complex industrial environment
KW - deep learning
KW - internet of things
KW - person detection
KW - top view
UR - http://www.scopus.com/inward/record.url?scp=85106307782&partnerID=8YFLogxK
U2 - 10.1002/int.22472
DO - 10.1002/int.22472
M3 - Article
AN - SCOPUS:85106307782
SN - 0884-8173
VL - 37
SP - 10249
EP - 10267
JO - International Journal of Intelligent Systems
JF - International Journal of Intelligent Systems
IS - 12
ER -