
YOLOv9-Based Human Face Detection and Counting Under Human-Animal Faces, Complex Imaging Environments, and Image Qualities
Author(s) -
Sivaranjini Perikamarayanan,
M. Sabarimalai Manikandan,
Linga Reddy Cenkeramaddi
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3591247
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Automatic human face detection and counting can play a vital role in the recognition and tracking of infant and adult faces in both outdoor and indoor human surveillance applications and facial-vital sign measurement. Despite the advancements in deep learning networks, accurate and reliable detection is still a challenging task in the presence of different kinds of objects, animal faces, and image characteristics. In this paper, we study the effectiveness of the YOLOv9-based face detection and counting method under skin color variations, different face sizes and number of faces, mixture of human and animal faces, image properties (brightness, contrast, illumination, glare), image qualities (image blurring, low-light facial images, lens flared images, grainy images taken at night, different types of noises), and pareidolia effects. The proposed method is trained and validated using the Wider Face database. In addition, we created image databases with different kinds of image qualities and image characteristics with the above-mentioned challenges. The YOLOv9-based face detection model achieves a precision of 86%, a recall of 62.8%, and a mean average precision of 70.8% at an inference time of 15.2 ms on the Wider Face database. Evaluation results demonstrate that the YOLOv9-based face counting outperforms most of the state-of-the-art face detection and people counting methods with a mean absolute error (MAE) of 3.36 and root mean square error (RMSE) of 22.38. The model was also deployed on the Raspberry Pi edge computing platform to study the real-time performance. The YOLOv9-based method achieves an MAE of 0.53-5.76 on the untrained infant database with Gaussian, salt and pepper, and speckle noises and an MAE of 0.43-2.87 on faces inside vehicles. The study further highlights the effectiveness of the YOLOv9 model in achieving promising face detection and counting results under a range of illumination and skin color variations. Evaluation results on a wide variety of animal faces and pareidolia-induced faces demonstrate more false positives due to the lack of contextual intelligence in the generation of deep-face models. Further, results show that the deep-face model detects artificial faces (statues, art, paintings, posters) if the model is deployed in uncontrolled face-based application environments. The performance of the model is degraded under different kinds of noise and blurred images. The results of this study highlight that performance can be improved by using noise-specific filtering techniques with optimal filtering parameters, but this requires the automatic identification of noise types and their corresponding parameters.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom