Abstract:Multi-label text classification (MLTC) is a technique to categorize texts into more than a single category and used extensively in various real-life problems. Such classifications problems are challenging and dependent on many factors and changes according to the problem. Movie genre classification is a popular multi-label text classification problem as movies may belong to multiple genres at the same time. The major factors used for movie genre classification are based on parameters like movie plot, title, summary, and subtitles. In recent years, some neural networks based approaches are proposed for solving such problems, which turns the solution into resource intensive and time consuming activities. In this paper, we propose a novel method of movie genre classification using a combination of problem transformation techniques, namely binary relevance (BR) and label powerset (LP), text vectorizers and machine learning classifier models. We perform binary relevance task (BR) that converts multi-label classification tasks into independent binary classification tasks whereas label powerset transforms a multi-label problem into a multiclass problem with one multiclass classifier trained on all unique label combinations found in the training data. Further, we apply text vectorizers namely, CV (Count Vectorizer) and TF-IDF (Term Frequency - Inverse Document Frequency) to tokenize the textual data to build a word vocabulary followed by employing various classifiers i.e., Logistic Regression (LR), Multinomial Naive Bayes (MNB), K-Nearest Neighbor (KNN), Support Vector Classifier (SVC) with the combination of different vectorizers and problem transformation methods. To test the effectiveness of these combinations, we use the k-fold cross-validation technique. We construct different combination using problem transformation approaches, text vectorizers and classifier models leading to overall 16 different combinations for classifying movies into appropriate genres. Finally, we evaluate the performance of each combination on publicly available IMDb datasets with target on 27 major parent genres using different performance measures and reveal that the best result is obtained using the combination comprising of label powerset (LP) as Problem transformation approach, TF-IDF as the text vectorizer and support vector classifier (SVC) as the machine learning classifier model with a commendable accuracy of 0.95 and F1-score of 0.86.

A multimodal approach for multi-label movie genre classification

Exploring Textual Features for Multi-label Classification of Portuguese Film Synopses

Movie genre classification using binary relevance, label powerset, and machine learning classifiers

Rethinking movie genre classification with fine-grained semantic clustering

Multilevel profiling of situation and dialogue-based deep networks for movie genre classification using movie trailers

Movie Trailer Genre Classification Using Multimodal Pretrained Features

Moviescope: Large-scale Analysis of Movies using Multiple Modalities

Movie Genre Classification by Language Augmentation and Shot Sampling

Multi-label Movie Genre Detection from a Movie Poster Using Knowledge Transfer Learning

Demystifying Visual Features of Movie Posters for Multi-Label Genre Identification

Exploration of Speech and Music Information for Movie Genre Classification

Look and Listen: A Multi-modality Late Fusion Approach to Scene Classification for Autonomous Machines

Statistical and Visual Analysis of Audio, Text, and Image Features for Multi-Modal Music Genre Recognition

Unraveling Movie Genres through Cross-Attention Fusion of Bi-Modal Synergy of Poster

A unified framework of deep networks for genre classification using movie trailer

Deep multi-modal networks for book genre classification based on its cover

A movie genre prediction based on Multivariate Bernoulli model and genre correlations

Cross-Modality Clustering-based Self-Labeling for Multimodal Data Classification

Improving Music Genre Classification from Multi-Modal Properties of Music and Genre Correlations Perspective

A multinomial probabilistic model for movie genre predictions

A Case Study of Deep Learning Based Multi-Modal Methods for Predicting the Age-Suitability Rating of Movie Trailers