Abstract:Abstract With the frequent interaction and cooperation between different disciplines in recent years, the number of research papers associated with multiple subjects increased. Correspondingly, some of the existing literatures belong to a single discipline, while others may simultaneously involve more than 2 subjects. At this time, the traditional single-label text classification is not conducive to people obtaining comprehensive and cutting-edge research papers in real life. Thus, it’s of great importance to conduct a multi-label classification of research papers effectively. This paper tests the performance of multi-label learning tasks with text data obtained from the Kaggle website. Firstly, lemmatization and Term Frequency-Inverse Document Frequency (TF-IDF) are used for feature extraction in the pre-processing part. The critical information of text content is statistically analysed, and text content is converted into numerical and high-dimensional vector space. As the traditional single-label classification algorithm is not suitable for the above problem, this paper adopts the Multi-Label K-Nearest Neighbour (ML-KNN) algorithm framework for classification. Experimental results report that the ML-KNN algorithm has achieved better results in multi-label text classification problems than a traditional multi-label algorithm, which proves the effectiveness of the ML-KNN algorithm for text data prediction with multiple subjects. Moreover, the work in this paper is analysed and summarized.

Multi-label text categorization using k-nearest neighbor approach with m-similarity

A Bayesian Network nearest k-labels method for Multi-label classification

An kNN Algorithm Based on Vector Angle for Multi-label Text Categorization

RW.KNN: a proposed random walk KNN algorithm for multi-label classification.

Label Distribution Learning-Enhanced Dual-KNN for Text Classification

An improved ML-kNN approach for multi-label text categorization

Learning Semantic Similarity For Multi-Label Text Categorization

ML-KNN: A lazy learning approach to multi-label learning

Contrastive Learning-Enhanced Nearest Neighbor Mechanism for Multi-Label Text Classification.

Multi-label Text Categorization with Joint Learning Predictions-as-Features Method

A K-Nearest Neighbor Based Algorithm for Multi-Label Classification.

Multi-Label Classification of Research Papers Using Multi-Label K-Nearest Neighbour Algorithm

Fast text categorization based on collaborative work in the semantic and class spaces

A modular k-nearest neighbor classification method for massively parallel text categorization

Muli-label Text Categorization with Hidden Components.

A Debiased Nearest Neighbors Framework for Multi-Label Text Classification

A Label Information Aware Model for Multi-label Text Classification

A multi‐label social short text classification method based on contrastive learning and improved ml‐KNN

A Comparative Study on Two Large-Scale Hierarchical Text Classification Tasks' Solutions

A New multi-instance multi-label learning approach for image and text classification

An Empirical Comparison of Min–max-Modular K -NN with Different Voting Methods to Large-Scale Text Categorization