Data Augmentation for Image Classification using Generative AI

Rahat, Fazle; Hossain, M Shifat; Ahmed, Md Rubel; Jha, Sumit Kumar; Ewetz, Rickard

Computer Science > Computer Vision and Pattern Recognition

arXiv:2409.00547 (cs)

[Submitted on 31 Aug 2024]

Title:Data Augmentation for Image Classification using Generative AI

Authors:Fazle Rahat, M Shifat Hossain, Md Rubel Ahmed, Sumit Kumar Jha, Rickard Ewetz

View PDF HTML (experimental)

Abstract:Scaling laws dictate that the performance of AI models is proportional to the amount of available data. Data augmentation is a promising solution to expanding the dataset size. Traditional approaches focused on augmentation using rotation, translation, and resizing. Recent approaches use generative AI models to improve dataset diversity. However, the generative methods struggle with issues such as subject corruption and the introduction of irrelevant artifacts. In this paper, we propose the Automated Generative Data Augmentation (AGA). The framework combines the utility of large language models (LLMs), diffusion models, and segmentation models to augment data. AGA preserves foreground authenticity while ensuring background diversity. Specific contributions include: i) segment and superclass based object extraction, ii) prompt diversity with combinatorial complexity using prompt decomposition, and iii) affine subject manipulation. We evaluate AGA against state-of-the-art (SOTA) techniques on three representative datasets, ImageNet, CUB, and iWildCam. The experimental evaluation demonstrates an accuracy improvement of 15.6% and 23.5% for in and out-of-distribution data compared to baseline models, respectively. There is also a 64.3% improvement in SIC score compared to the baselines.

Comments:	19 pages, 15 figures, 4 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
ACM classes:	I.2.10; I.5.1
Cite as:	arXiv:2409.00547 [cs.CV]
	(or arXiv:2409.00547v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2409.00547

Submission history

From: Sumit Kumar Jha [view email]
[v1] Sat, 31 Aug 2024 21:16:43 UTC (6,607 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Data Augmentation for Image Classification using Generative AI

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Data Augmentation for Image Classification using Generative AI

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators