Self-QA: Unsupervised Knowledge Guided Language Model Alignment

Xuanyu Zhang,Qing Yang
2023-05-20
Abstract:Large-scale language models like ChatGPT and GPT-4 have gained attention for their impressive conversational and generative capabilities. However, the creation of supervised paired question-answering data for instruction tuning presents formidable challenges. This endeavor necessitates substantial human effort for data annotation and wrestles with issues concerning data quality, diversity, accuracy, and other related factors. To overcome these obstacles, we introduce an innovative framework named Self-QA, which replaces the traditional practice of human-written instruction seeds with a vast amount of unsupervised knowledge, enabling the model to generate a larger quantity of correct and domain-specific instruction data. The effectiveness of our proposed method is demonstrated through experiments conducted on unsupervised corpora from various domains.
Computation and Language
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenges faced in generating data for instruction tuning. Specifically, constructing Supervised Fine - Tuning (SFT) data requires a great deal of human effort to label the data, and there are many problems in terms of the quality, diversity, and accuracy of these data. These problems limit the development of the technology. To solve these problems, the author proposes a new framework named SELF - QA. This framework can generate more correct and domain - specific instruction data from unsupervised knowledge, thereby reducing the dependence on manual annotation and improving the diversity and accuracy of the data. Through experimental verification, SELF - QA has demonstrated its effectiveness on unsupervised corpora in multiple domains and can generate diverse, correct, and domain - specific instruction data, which is of great significance for enhancing the understanding and generation abilities of language models.