AUTHOR=Roda Hezi , Geva Amir B. 

TITLE=Semi-supervised active learning using convolutional auto- encoder and contrastive learning

JOURNAL=Frontiers in Artificial Intelligence

VOLUME=Volume 7 - 2024

YEAR=2024

URL=https://www.frontiersin.org/journals/artificial-intelligence/articles/10.3389/frai.2024.1398844

DOI=10.3389/frai.2024.1398844

ISSN=2624-8212

ABSTRACT=Active learning is a field of machine learning that seeks to find the most efficient labels to annotate with a given budget, particularly in cases where obtaining labeled data is expensive or infeasible. This is becoming increasingly important with the growing success of learning-based methods, which often require large amounts of labeled data. Computer vision is one area where active learning has shown promise in tasks such as image classification, semantic segmentation, and object detection. In this research, we propose a pool-based semi-supervised active learning method for image classification that takes advantage of both labeled and unlabeled data. Many active learning approaches do not utilize unlabeled data, but we believe that incorporating these data can improve performance. To address this issue, our method involves several steps. First, we cluster the latent space of a pre-trained convolutional autoencoder. Then, we use a proposed clustering contrastive loss to strengthen the latent space's clustering while using a small amount of labeled data. Finally, we query the samples with the highest uncertainty to annotate with an oracle. We repeat this process until the end of the given budget. Our method is effective when the number of annotated samples is small, and we have validated its effectiveness through experiments on benchmark datasets. Our empirical results demonstrate the power of our method for image classification tasks in accuracy terms.