Abstract
Plant leaf recognition is a computer vision task used to identify plant species. To address the problem that current plant leaf recognition algorithms have difficulty in recognizing fine-grained leaf classification between classes, this paper proposes a DMSNet (Deep Multi-Scale Network) model, a plant leaf classification algorithm based on multi-scale feature extraction. In order to improve the extraction ability of different fine-grained features of the model, the model is improved on the basis of Multi-scale Backbone Architecture model. In order to achieve better plant leaf classification, a visual attention mechanism module to DMSNet is added and ADMSNet (Attention-based Deep Multi-Scale Network), which makes the model focus more on the plant leaf itself, is proposed, essential features are enhanced, and useless features are suppressed. Experiments on real datasets show that the classification accuracy of the DMSNet model reaches 96.43%. In comparison, the accuracy of ADMSNet with the addition of the attention module reaches 97.39%, and the comparison experiments with ResNet-50, ResNext, Res2Net-50 and Res2Net-101 models on the same dataset show that DMSNet improved the accuracy by 4.6%, 18.57%, 3.72% and 3.84%, respectively. The experimental results confirm that the DMSNet and ADMSNet plant leaf recognition models constructed in this paper can accurately recognize plant leaves and have better performance than the traditional models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zoran, D., Chrzanowski, M., Huang, P., et al.: Towards robust image classification using sequential attention models. arXiv: Computer Vision and Pattern Recognition (2019)
Xie, Q., Hovy, E., Luong, M., et al.: Self-training with noisy student improves ImageNet classification. arXiv: Learning (2019)
Bertinetto, L., Muller, R., Tertikas, K., et al.: Making better mistakes: leveraging class hierarchies with deep networks. arXiv: Computer Vision and Pattern Recognition (2019)
Wu, Y., Zhang, K., Wu, D., et al.: Person re-identification by multi-scale feature representation learning with random batch feature mask. IEEE Tran. Cogn. Dev. Syst. (2020)
Chen, Z., Wei, X., Wang, P., et al.: Multi-label image recognition with graph convolutional networks. In: Computer Vision and Pattern Recognition, pp. 5177–5186 (2019)
Wang, G., Wang, K., Lin, L., et al.: Adaptively connected neural networks. In: Computer Vision and Pattern Recognition, pp. 1781–1790 (2019)
Yuan, C., Wu, Y., Qin, X., et al.: An effective image classification method for shallow densely connected convolution networks through squeezing and splitting techniques. Appl. Intell. 49(10), 3570–3586 (2019)
Uzal, L.C., Larese, M.G., et al.: Deep learning for plant identification using vein morphological patterns. Comput. Electron. Agric. 127, 418–424 (2016)
Sun, Y., Liu, Y., et al.: Deep learning for plant identification in natural environment. Comput. Intell. Neurosci. 2017(4), 7361042–7361042 (2017)
Hu, J., Chen, Z., Yang, M., et al.: A multi-scale fusion convolutional neural network for plant leaf recognition. IEEE Sig. Process. Lett. 25, 853–857 (2018)
Bodhwani, V., Acharjya, D.P., Bodhwani, U.: Deep residual networks for plant identification. Procedia Comput. Sci. 152, 186–194 (2019)
He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778(2016)
Gao, S., Cheng, M.M., Zhao, K., et al.: Res2Net: a new multi-scale backbone architecture. IEEE Trans. Pattern Anal. Mach. Intell. 43, 652–662 (2019)
Wang, Q., Wu, B., Zhu, P., et al.: ECA-Net: efficient channel attention for deep convolutional neural networks. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2020)
Jie, H., Li, S., Gang, S.: Squeeze-and-excitation networks. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2018(
Xie, S., Girshick, R., Dollár, P., et al.: Aggregated residual transformations for deep neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)
Acknowledgements
This paper was supported by the National Natural Science Foundation of China (Grant No. 61962006, 61802035, 61772091); the Project of Science Research and Technology Development in Guangxi (Grant No. AA18118047, AD18126015, AB16380272); the BAGUI Scholar Program of Guangxi Zhuang Autonomous Region of China (Grant No. 2016 [21], 2019 [79]); the Natural Science Foundation of Guangxi (Grant No. 2018GXNSFAA138005) and the Sichuan Science and Technology Program (Grant No. 2018JY0448, 2019YFG0106, 2020YJ0481, 2019YFS0067, 2020YFS0466, 2020JDR0164, 2020YJ0430).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Qin, X. et al. (2021). Attention-Based Deep Multi-scale Network for Plant Leaf Recognition. In: Huang, DS., Jo, KH., Li, J., Gribova, V., Bevilacqua, V. (eds) Intelligent Computing Theories and Application. ICIC 2021. Lecture Notes in Computer Science(), vol 12836. Springer, Cham. https://doi.org/10.1007/978-3-030-84522-3_24
Download citation
DOI: https://doi.org/10.1007/978-3-030-84522-3_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-84521-6
Online ISBN: 978-3-030-84522-3
eBook Packages: Computer ScienceComputer Science (R0)