Few-Shot and Prompt Training for Text Classification in German Doctor's Letters

Phillip Richter-Pechanski,Philipp Wiesenbach,Dominic M Schwab,Christina Kiriakou,Mingyang He,Nicolas A Geis,Anette Frank,Christoph Dieterich
DOI: https://doi.org/10.3233/SHTI230275
2023-05-18
Abstract:To classify sentences in cardiovascular German doctor's letters into eleven section categories, we used pattern-exploiting training, a prompt-based method for text classification in few-shot learning scenarios (20, 50 and 100 instances per class) using language models with various pre-training approaches evaluated on CARDIO:DE, a freely available German clinical routine corpus. Prompting improves results by 5-28% accuracy compared to traditional methods, reducing manual annotation efforts and computational costs in a clinical setting.
What problem does this paper attempt to address?