Abstract:Diseases cause crop yield reduction and quality decline, which has a great impact on agricultural production. Plant disease recognition based on computer vision can help farmers quickly and accurately recognize diseases. However, the occurrence of diseases is random and the collection cost is very high. In many cases, the number of disease samples that can be used to train the disease classifier is small. To address this problem, we propose a few-shot disease recognition algorithm that uses supervised contrastive learning. Our algorithm is divided into two phases: supervised contrastive learning and meta-learning. In the first phase, we use a supervised contrastive learning algorithm to train an encoder with strong generalization capabilities using a large number of samples. In the second phase, we treat this encoder as an extractor of plant disease features and adopt the meta-learning training mechanism to accomplish the few-shot disease recognition tasks by training a nearest-centroid classifier based on distance metrics. The experimental results indicate that the proposed method outperforms the other nine popular few-shot learning algorithms as a comparison in the disease recognition accuracy over the public plant disease dataset PlantVillage. In few-shot potato leaf disease recognition tasks in natural scenarios, the accuracy of the model reaches the accuracy of 79.51% with only 30 training images. The experiment also revealed that, in the contrastive learning phase, the combination of different image augmentation operations has a greater impact on model. Furthermore, the introduction of label information in supervised contrastive learning enables our algorithm to still obtain high accuracy in few-shot disease recognition tasks with smaller batch size, thus allowing us to complete the training with less GPU resource compared to traditional contrastive learning.

Cucumber disease recognition with small samples using image-text-label-based multi-modal language model

Crop Disease Identification by Fusing Multiscale Convolution and Vision Transformer.

Few-Shot Image Classification of Crop Diseases Based on Vision–Language Models

Few-shot disease recognition algorithm based on supervised contrastive learning

Multi-label Cluster Discrimination for Visual Representation Learning

Cucumber Disease Image Classification with A Model Combining LBP and VGG-16 Features

Multi-modal Contrastive-Generative Pre-training for Fine-grained Skin Disease Diagnosis.

Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning

Transfer learning for versatile plant disease recognition with limited data

A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion

XLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training

Enhancing Biomedical Multi-modal Representation Learning with Multi-scale Pre-training and Perturbed Report Discrimination

Multimodal One-Shot Learning of Speech and Images

Unified Generative and Discriminative Training for Multi-modal Large Language Models

Vision-Language Pre-Training with Triple Contrastive Learning

EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis

Pre-Trained Vision-Language Models as Partial Annotators

MMCL-CPI: A multi-modal compound-protein interaction prediction model incorporating contrastive learning pre-training

Multimodal Multilabel Classification by CLIP

MLIP: Medical Language-Image Pre-training with Masked Local Representation Learning

A Plant Disease Recognition Method Based on Fusion of Images and Graph Structure Text