Abstract
Neonatal face detection is the prerequisite for face-based intelligent medical applications. Nevertheless, it has been found that this area has received minimal attention in existing research. The paucity of open-source, large-scale datasets significantly constrains current studies, which are further compounded by issues such as large-scale occlusions, class imbalance, and precise localization requirements. This work aims to address these challenges from both data and methodological perspectives. We constructed the first open-source face detection dataset for neonates, involving images from 1,000 neonates in the neonatal wards. Utilizing this dataset and adopting NICUface-RF as the baseline, we introduce two novel modules. The hierarchical contextual classification aims to improve the positive/negative anchor ratios and alleviate large-scale occlusions. Concurrently, the DIoU-aware NMS is designed to preserve bounding boxes of superior localization quality by employing predicted DIoUs as the ranking criterion in NMS procedures. Experimental results illustrate the superiority of our method. The dataset and code is available at https://github.com/neonatal-pain.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Awais, M., Long, X., Yin, B., Abbasi, S.F., Akbarzadeh, S., Lu, C., Wang, X., Wang, L., Zhang, J., Dudink, J., Chen, W.: A hybrid dcnn-svm model for classifying neonatal sleep and wake states based on facial expressions in video. IEEE Journal of Biomedical and Health Informatics 25(5), 1441–1449 (2021)
Brahnam, S., Chuang, C.F., Shih, F.Y., Slack, M.R.: Svm classification of neonatal facial images of pain. In: Fuzzy Logic and Applications: 6th International Workshop, WILF 2005, Crema, Italy, September 15-17, 2005, Revised Selected Papers 6. pp. 121–128. Springer (2006)
Brahnam, S., Nanni, L., Sexton, R.: Introduction to neonatal facial pain detection using common and advanced face classification techniques. In: Advanced Computational Intelligence Paradigms in Healthcare–1, pp. 225–253. Springer (2007)
Dai, J., Li, Y., He, K., Sun, J.: R-FCN: object detection via region-based fully convolutional networks. In: Proceedings of the 30th International Conference on Neural Information Processing Systems. pp. 379–387 (2016)
Deng, J., Guo, J., Zhou, Y., Yu, J., Kotsia, I., Zafeiriou, S.: Retinaface: Single-stage dense face localisation in the wild. arXiv preprint arXiv:1905.00641 (2019)
Dollar, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: An evaluation of the state of the art. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(4), 743–761 (2011)
Dosso, Y.S., Kyrollos, D., Greenwood, K.J., Harrold, J., Green, J.R.: Nicuface: Robust neonatal face detection in complex nicu scenes. IEEE Access 10, 62893–62909 (2022)
Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the thirteenth international conference on artificial intelligence and statistics. pp. 249–256. JMLR Workshop and Conference Proceedings (2010)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 770–778 (2016)
Huang, B., Chen, W., Lin, C.L., Juang, C.F., Xing, Y., Wang, Y., Wang, J.: A neonatal dataset and benchmark for non-contact neonatal heart rate monitoring based on spatio-temporal neural networks. Engineering Applications of Artificial Intelligence 106, 104447 (2021)
Khanam, F.T.Z., Perera, A.G., Al-Naji, A., Gibson, K., Chahl, J.: Non-contact automatic vital signs monitoring of infants in a neonatal intensive care unit based on neural networks. Journal of Imaging 7(8), 122 (2021)
Li, J., Wang, Y., Wang, C., Tai, Y., Qian, J., Yang, J., Wang, C., Li, J., Huang, F.: DSFD: dual shot face detector. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 5060–5069 (2019)
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 2980–2988 (2017)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., Berg, A.C.: SSD: Single shot multibox detector. In: Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. pp. 21–37. Springer (2016)
Najibi, M., Singh, B., Davis, L.S.: FA-RPN: Floating region proposals for face detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 7723–7732 (2019)
Nihila, S., Rajalakshmi, T., Panda, S.S., Lhazay, N., Giri, G.D.: Detection of jaundice in neonates using artificial intelligence. In: Soft Computing: Theories and Applications: Proceedings of SoCTA 2020, Volume 2. pp. 431–443. Springer (2021)
Olmi, B., Manfredi, C., Frassineti, L., Dani, C., Lori, S., Bertini, G., Gabbanini, S., Lanatà, A.: Aggregate channel features for newborn face detection in neonatal intensive care units. In: 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). pp. 455–458. IEEE (2022)
Qi, D., Tan, W., Yao, Q., Liu, J.: YOLO5Face: why reinventing a face detector. In: European Conference on Computer Vision. pp. 228–244. Springer (2022)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: Towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems. vol. 28 (2015)
Yang, S., Luo, P., Loy, C.C., Tang, X.: Wider face: A face detection benchmark. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 5525–5533 (2016)
Zhang, C., Xu, X., Tu, D.: Face detection using improved faster rcnn. arXiv preprint arXiv:1802.02142 (2018)
Zhang, J., Wu, X., Hoi, S.C., Zhu, J.: Feature agglomeration networks for single stage face detection. Neurocomputing 380, 180–189 (2020)
Zhang, S., Zhu, R., Wang, X., Shi, H., Fu, T., Wang, S., Mei, T., Li, S.Z.: Improved selective refinement network for face detection. arXiv preprint arXiv:1901.06651 (2019)
Zhang, Y., Xu, X., Liu, X.: Robust and high performance face detector. arXiv preprint arXiv:1901.02350 (2019)
Zhao, Y., Zhu, H., Chen, X., Luo, F., Li, M., Zhou, J., Chen, S., Pan, Y.: Pose-invariant and occlusion-robust neonatal facial pain assessment. Computers in Biology and Medicine 165, 107462 (2023)
Zhu, C., Tao, R., Luu, K., Savvides, M.: Seeing small faces from robust anchor’s perspective. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 5127–5136 (2018)
Zhu, H., Zhao, Y., Chen, X., Luo, F., Mei, L., Chen, S., Pan, Y.: Video-based neonatal pain assessment in uncontrolled conditions. IEEE Journal of Biomedical and Health Informatics 28(1), 239–250 (2024)
Zhu, Y., Cai, H., Zhang, S., Wang, C., Xiong, Y.: Tinaface: Strong but simple baseline for face detection. arXiv preprint arXiv:2011.13183 (2020)
Acknowledgements
This work was supported by the National Natural Science Foundation of China (62306272 and 62276237), the National Key Research and Development Program of China (2023YFC3603102), the Pioneer and “Leading Goose” Research and Development Program of Zhejiang (2024C03027).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Ethics declarations
Disclosure of Interests
The authors have no competing interests to declare.
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zhao, Y., Zhu, H., Shu, Q., Huan, R., Chen, S., Pan, Y. (2024). Towards a Deeper Insight Into Face Detection in Neonatal Wards. In: Linguraru, M.G., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2024. MICCAI 2024. Lecture Notes in Computer Science, vol 15005. Springer, Cham. https://doi.org/10.1007/978-3-031-72086-4_66
Download citation
DOI: https://doi.org/10.1007/978-3-031-72086-4_66
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-72085-7
Online ISBN: 978-3-031-72086-4
eBook Packages: Computer ScienceComputer Science (R0)