Abstract:With breakthroughs in pretrained language models, a large number of finetuned models specialized in distinct domains have surfaced online. Yet, when faced with a fresh dataset covering multiple (sub)domains, their performance might degrade. Reusing these available finetuned models to train a new model is a more feasible solution than the finetuning method that demands extensive manual labeling. Knowledge Amalgamation (KA) is such a model reusing technique, which derives a new model (termed student model) by amalgamating those trained models (termed teacher models) tailored for distinct domains, bypassing the need for manual labeling. However, when the domains of text samples are unknown, selecting a number of appropriate teacher models (simply called a combination) for reuse becomes complicated. To learn an accurate student model, the classical KA method resorts to manual selections, a process both tedious and inefficient. Our study pioneers the automation of this combination selection process for KA in the fundamental text classification task, an area previously unexplored. In this paper, we introduce BoKA : an automatic knowledge amalgamation framework for identifying a combination that can learn a superior student model without human labor. Through the lens of Bayesian optimization, BoKA iteratively samples a subset of possible combinations for amalgamation instead of manual selections. Furthermore, we introduce a novel KA method tailored for text classification, which guides the student model using both soft and pseudo-hard labels from the teacher models when their predictions are closely aligned; in cases of significant disagreement, it uses randomly generated labels. Experiments on two public multi-domain datasets show that BoKA achieves remarkable efficiency by sampling only up to 5.5% of all potential combinations. Moreover, BoKA is capable of matching or even surpassing leading zero-shot large language models, despite having dozens of times fewer parameters.

Improving Text Classification Using Knowledge in Labels

Incorporating Knowledge into Neural Network for Text Representation.

Knowledge-based Document Embedding for Cross-Domain Text Classification

BoKA: Bayesian Optimization Based Knowledge Amalgamation for Multi-unknown-domain Text Classification

Improving semi-supervised text classification by using wikipedia knowledge

Label Distribution Learning-Enhanced Dual-KNN for Text Classification

Text Classification Based on Knowledge Graphs and Improved Attention Mechanism

Improve Text Classification Accuracy with Intent Information

Enhancing Hierarchical Text Classification through Knowledge Graph Integration

Robustly Leveraging Prior Knowledge in Text Classification

Automatic tagging of knowledge points for K12 math problems

Adaptive micro- and macro-knowledge incorporation for hierarchical text classification

Knowledgeable Salient Span Mask for Enhancing Language Models As Knowledge Base

KeNet:Knowledge-enhanced Doc-Label Attention Network for Multi-label text classification

KMTLabeler: an Interactive Knowledge-Assisted Labeling Tool for Medical Text Classification

Deep learning-based text knowledge classification for whole-process engineering consulting standards

On the Value of Head Labels in Multi-Label Text Classification

A Knowledge Point Labeling Method by Introduce Knowledge Points Labels Information

Classifying Math KCs via Task-Adaptive Pre-Trained BERT

Research of Weibo Text Classification Based on Knowledge Distillation and Joint Model