YOLOv8-Coal: a coal-rock image recognition method based on improved YOLOv8

View article
PeerJ Computer Science

Main article text

 

Introduction

  • (1) Deformable Convolutional Networks Version 3 (DCNv3) has been introduced to enhance its ability to adjust to the specific features of an image by incorporating dynamically deformable convolution kernels. This allows for better representation of irregular shapes and intricate textures in coal-rock images, ultimately leading to improved model accuracy.

  • (2) Polarized Self-Attention (PSA) is incorporated to pinpoint and emphasize crucial aspects of the features while diminishing attention towards less significant information. This allows the model to concentrate on the essential elements in the image, thereby enhancing its performance.

  • (3) The GhostNet concept is integrated into the C2f module to create the C2fGhost module, which enhances the model’s performance, reduces its need for computer resources, and optimizes its computational efficiency and response time.

Materials and Methods

Overview of the YOLOv8 algorithm

YOLOv8-Coal algorithm

DCNv3 module

PSA module

C2fGhost module

Experiments, results and discussion

Experiment

Coal-rock dataset

Experimental environment and parameter configuration

Evaluation metrics

Results and discussion

Ablation experiments

Comparison experiments

Visualization of experimental results

Conclusions

Supplemental Information

Literature table for coal-rock image recognition.

DOI: 10.7717/peerj-cs.2313/supp-1

Additional Information and Declarations

Competing Interests

The authors declare that they have no competing interests.

Author Contributions

Wenyu Wang conceived and designed the experiments, performed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Yanqin Zhao conceived and designed the experiments, analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Zhi Xue performed the experiments, prepared figures and/or tables, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The literature table for coal-rock image recognition is available in the Supplemental File.

The code of YOLOv8-Coal is available at Zenodo: Wang, W. (2024). Jason-2k/YOLOv8-Coal: V 1.1 (release1.1). Zenodo. https://doi.org/10.5281/zenodo.12582996.

The model weights are available at Zenodo: Wang, W. (2024). The model weights file of YOLOv8-Coal. Zenodo. https://doi.org/10.5281/zenodo.10728156.

The code of YOLOv5-Coal and YOLOv10-Coal are available at Zenodo:

Wang, W., & Xue, Z. (2024). Jason-2k/YOLOv8-Coal-Comparison-experiments: V 1.0 (release). Zenodo. https://doi.org/10.5281/zenodo.12704209

The original dataset is available at Zenodo: Wang, W. (2024). Coal and Rock [Data set]. Zenodo. https://doi.org/10.5281/zenodo.10702704

The processed dataset is available at Zenodo: Wang, W. (2024). Coal and Rock after processing in YOLO format [Data set]. Zenodo. https://doi.org/10.5281/zenodo.10702879.

Funding

This work was supported by the Basic Research Operating Costs of Undergraduate Colleges and Universities in Heilongjiang Province Project (No.2022-KYYWF-0565) and the Heilongjiang University of Science and Technology 2024 College Students’ Innovation and Entrepreneurship Training Program Project. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

686 Visitors 559 Views 41 Downloads