BERTogether: Multitask Ensembling with Hyperparameter Optimization

Stanford CS224N Default Project,•. Mentor,Timothy Dai,Ivan Liongson,Erik Luna
Abstract:This paper presents BERTogether, a novel approach to multitask learning that leverages the strengths of minBERT, an efficient BERT-based model, optimized for high performance across three NLP tasks: Sentiment Classification, Paraphrase Detection, and Semantic Similarity Evaluation. We introduce a weighted-average 3-BERT ensemble method for each task coupled with hyperparameter tuning to enhance performance. Our methodology outperforms baseline models by using SMART regularization techniques and adding an extra dataset for Similarity Evaluation. We demonstrate significant improvements in accuracy and correlation metrics via weighted ensembling.
Computer Science
What problem does this paper attempt to address?