Skip to main navigation Skip to search Skip to main content

ReSmooth: Detecting and Utilizing OOD Samples When Training With Data Augmentation

  • School of Computer Science and Technology, Harbin Institute of Technology
  • Peng Cheng Laboratory

Research output: Contribution to journalArticlepeer-review

Abstract

Data augmentation (DA) is a widely used technique for enhancing the training of deep neural networks. Recent DA techniques which achieve state-of-the-art performance always meet the need for diversity in augmented training samples. However, an augmentation strategy that has a high diversity usually introduces out-of-distribution (OOD) augmented samples and these samples consequently impair the performance. To alleviate this issue, we propose ReSmooth, a framework that first detects OOD samples in augmented samples and then leverages them. To be specific, we first use a Gaussian mixture model (GMM) to fit the loss distribution of both the original and augmented samples and accordingly split these samples into in-distribution (ID) samples and OOD samples. Then we start a new training where ID and OOD samples are incorporated with different smooth labels. By treating ID samples and OOD samples unequally, we can make better use of the diverse augmented data. Furthermore, we incorporate our ReSmooth framework with negative DA (NDA) strategies. By properly handling their intentionally created OOD samples, the classification performance of NDAs is largely ameliorated. Experiments on several classification benchmarks show that ReSmooth can be easily extended to the existing augmentation strategies [such as RandAugment (RA), rotate, and jigsaw] and improve on them. Our code is available at https://github.com/Chenyang4/ReSmooth.

Original languageEnglish
Pages (from-to)7899-7910
Number of pages12
JournalIEEE Transactions on Neural Networks and Learning Systems
Volume35
Issue number6
DOIs
StatePublished - 1 Jun 2024
Externally publishedYes

Keywords

  • Data augmentation (DA)
  • out-of-distribution (OOD) detection
  • sample selection
  • visual recognition

Fingerprint

Dive into the research topics of 'ReSmooth: Detecting and Utilizing OOD Samples When Training With Data Augmentation'. Together they form a unique fingerprint.

Cite this