Abstract:Learning from data with long-tailed and open-ended distributions is highly challenging. In this work, we propose OLPR , which is a new dual-stream O pen-set L ong-tailed recognition framework based on orthogonal P rototype learning and false R ejection correction. It consists of a Probabilistic Prediction Learning (PPL) branch and a Distance Metric Learning (DML) branch. The former is used to generate prediction probability for image classification. The latter learns orthogonal prototypes for each class by computing three distance losses, which are the orthogonal prototype loss among all the prototypes, the balanced Softmin distance based cross-entropy loss between each prototype and its corresponding input sample, and the adversarial loss for making the open-set space more compact. Furthermore, for open-set learning, instead of merely relying on binary decisions, we propose an Iterative Clustering Module (ICM) to categorize similar open-set samples and correct the false rejected closed-set samples simultaneously. If a sample is detected as a false rejection, i.e., a sample of the known classes is incorrectly identified as belonging to the unknown classes, we will re-classify the sample to the closest known/closed-set class. We conduct extensive experiments on ImageNet-LT, Places-LT, CIFAR-10/100-LT benchmark datasets, as well as a new long-tailed open-ended dataset that we build. Experimental results demonstrate that OLPR improves over the best competitors by up to 2.2% in terms of overall classification accuracy in closed-set settings, and up to 4% in terms of F-measure in open-set settings, which are very remarkable.

Modernizing Open-Set Speech Language Identification

Robust Open-Set Spoken Language Identification and the CU MultiLang Dataset

Open Set Audio Classification Using Autoencoders Trained on Few Data

Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample

An Open-Set Recognition Approach for SAR Targets Using Only Classification Scores

Open-Set Recognition in the Age of Vision-Language Models

Open-Set Biometrics: Beyond Good Closed-Set Models

OpenSR: Open-Modality Speech Recognition Via Maintaining Multi-Modality Alignment.

Contrastive Open Set Recognition

Toward Open-Set Face Recognition

Open-Set Interference Signal Recognition Using Boundary Samples: A Hybrid Approach

OpenSD: Unified Open-Vocabulary Segmentation and Detection

Open set classification of sound event

Towards open-set text recognition via label-to-prototype learning

Open-set long-tailed recognition via orthogonal prototype learning and false rejection correction

An open-source voice type classifier for child-centered daylong recordings

Is Attention always needed? A Case Study on Language Identification from Speech

An Investigation into Using Parallel Data for Far-Field Speech Recognition.

Deep Open Set Identification for RF Devices

CNN-Based End-To-End Language Identification

Look, Listen and Learn - A Multimodal LSTM for Speaker Identification