Abstract
This modern world, driven by technological advancement and inventions, is moving toward automation of processes, technology, manufacturing, etc. We have self-driving cars, and robots are used in production and so on. Computer vision has a huge role to play in it. With the advancement in computational intelligence, it has been possible to create complex algorithms and tools which have brought a revolution in the field of computer vision. In this chapter, we are going to look at the most recent developments in the field of computer vision. Computer vision itself is a vast field, so we will be sticking to object detection, which is one of the main tasks of computer vision. It also forms the base for further developments in this field. The paper looks back at the inspiration that led to the idea of computer vision, its history, and how it has developed over the years; these are covered in brief in the chapter. Various algorithms form the majority of the portions of the section. Starting from the basics, the paper slowly and gradually builds the base and move on to understand more complex recent algorithms that are being used in real-life applications. These help build the understanding, which can be applied for research as shown with a case study. Finally, the paper looks at some of the latest innovations in the field of computer vision, those that have been developed based on these concepts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
P. Viola, M. Jones, Rapid object detection using a boosted cascade of simple features, in IEEE Conf Comput Vis Pattern Recognit, vol. 1, (2001), pp. I–511. https://doi.org/10.1109/CVPR.2001.990517
K. O’Shea, R. Nash, An Introduction to Convolutional Neural Networks. (2015). ArXiv e-prints.
S. Albawi, T.A. Mohammed, S. Al-Zawi, Understanding of a convolutional neural network, in 2017 International Conference on Engineering and Technology (ICET), (Antalya, 2017), pp. 1–6. https://doi.org/10.1109/ICEngTechnol.2017.8308186
N. Srivastava, Improving neural networks with dropout. Ph.D. thesis, University of Toronto (2013)
M.D. Zeiler, R. Fergus, Stochastic Pooling for Regularization of Deep Convolutional Neural Networks. arXiv preprint arXiv:1301.3557 (2013)
Z. Sun, M. Ozay, T. Okatani, Design of Kernels in Convolutional Neural Networks for Image Classification. (2015)
G. Zoumpourlis, A. Doumanoglou, N. Vretos, P. Daras, Non-linear Convolution Filters for CNN-Based Learning (2017). https://doi.org/10.1109/ICCV.2017.510
A. Khan, A. Sohail, U. Zahoora, A. Saeed, A Survey of the Recent Architectures of Deep Convolutional Neural Networks. Artif. Intell. Rev. (2019). https://doi.org/10.1007/s10462-020-09825-6
J. Long, E. Shelhamer, T. Darrell, Fully convolutional networks for semantic segmentation, in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (Boston, MA, 2015), pp. 3431–3440. https://doi.org/10.1109/CVPR.2015.7298965
A. Voulodimos, N. Doulamis, A. Doulamis, E. Protopapadakis, Deep learning for computer vision: A brief review. Comput. Intell. Neurosci., 1–13 (2018, 2018). https://doi.org/10.1155/2018/7068349
R. Girshick, J. Donahue, T. Darrell, J. Malik, Rich feature hierarchies for accurate object detection and semantic segmentation, in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, (2013). https://doi.org/10.1109/CVPR.2014.81
R. Girshick, Fast R-CNN, in 2015 IEEE International Conference on Computer Vision (ICCV), (Santiago, 2015), pp. 1440–1448. https://doi.org/10.1109/ICCV.2015.169
S. Ren, K. He, R. Girshick, J. Sun, Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39 (2015). https://doi.org/10.1109/TPAMI.2016.2577031
J. Redmon, S. Divvala, R. Girshick, A. Farhadi, You only look once: Unified, real-time object detection, in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (Las Vegas, NV, 2016), pp. 779–788. https://doi.org/10.1109/CVPR.2016.91
J. Redmon, A. Farhadi, YOLO9000: Better, Faster, Stronger. (2016)
J. Redmon, A. Farhadi, YOLOv3: An Incremental Improvement. (2018)
A. Bochkovskiy, C.-Y. Wang, H.-Y. Liao, YOLOv4: Optimal Speed and Accuracy of Object Detection. (2020)
W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, A. Berg, SSD: Single Shot MultiBox Detector, vol 9905 (2016), pp. 21–37. https://doi.org/10.1007/978-3-319-46448-0_2
T. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan, S. Belongie, Feature pyramid networks for object detection, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (Honolulu, HI, 2017), pp. 936–944. https://doi.org/10.1109/CVPR.2017.106
T. Lin, P. Goyal, R. Girshick, K. He, P. Dollár, Focal loss for dense object detection, in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, (2020), pp. 318–327. https://doi.org/10.1109/TPAMI.2018.2858826
C. Kuo, Understanding Convolutional Neural Networks with A Mathematical Model. J. Vis. Commun. Image Repr. 41 (2016). https://doi.org/10.1016/j.jvcir.2016.11.003
K. Bhatia, S. Arora, R. Tomar, Diagnosis of diabetic retinopathy using machine learning classification algorithm, in 2016 2nd International Conference on Next Generation Computing Technologies (NGCT), (Dehradun, India, 2016), pp. 347–351. https://doi.org/10.1109/NGCT.2016.7877439
R. Tomar, A. Khanna, A. Bansal, V. Fore, An architectural view towards autonomic cloud computing, in Data Engineering and Intelligent Computing. Advances in Intelligent Systems and Computing, ed. by S. Satapathy, V. Bhateja, K. Raju, B. Janakiramaiah, vol. 542, (Springer, Singapore, 2018). https://doi.org/10.1007/978-981-10-3223-3_55
Acknowledgment
The research work mentioned in the case study was carried out at IIIT Allahabad, by Sourabh Prakash under the supervision of Dr. Satish Kumar Singh and mentorship of Albert Mundu.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Prakash, S., Shukla, A.N., Yadav, A.K. (2022). Insights to Computational Intelligence Techniques for Computer Vision. In: Tomar, R., Hina, M.D., Zitouni, R., Ramdane-Cherif, A. (eds) Innovative Trends in Computational Intelligence. EAI/Springer Innovations in Communication and Computing. Springer, Cham. https://doi.org/10.1007/978-3-030-78284-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-78284-9_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-78283-2
Online ISBN: 978-3-030-78284-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)