FastLearn: A Rapid Learning Agent for Chat Models to Acquire Latest Knowledge
Chenhan Fu,Guoming Wang,Rongxing Lu,Siliang Tang
DOI: https://doi.org/10.1109/mipr62202.2024.00035
2024-01-01
Abstract:Most large language models (LLMs), especially chat models, do not undergo updates post-deployment, which results in a lack of awareness of constantly changing new real-time information. External knowledge retrieval methods exist to enhance these models, but for high-traffic chat models, the extensive use of search engine API resources can be costly. In our work, we study the performance of LLMs in answering tasks involving real-time factual knowledge. Due to the lack of datasets for Q&A involving the latest real-time information, we first carefully constructed NewlyQA, a new dynamic QA benchmark including questions about world knowledge whose answers evolve over time. Inspired by human learning methods, we propose FastLearn, a simple rapid learning method that enhances model responses by merging relevant, up-to-date information retrieved from search engine APIs (filtered through our Time-Prompt mechanism) into prompts, followed by Lora fine-tuning to “inject” new knowledge into LLMs via external parameters, significantly improving performance on FreshQA, NewlyQA and OpenDataEval. Our experiments show that FastLearn outperforms most other search engine augmented prompt methods, like FreshPrompt, Self-Ask, and the commercial PERPLEXITY.AI. Additionally, we further analyze the two main mechanisms of FastLearn, Time-Prompt and Self-Correction, finding that both play key roles in enhancing model answer performance. To facilitate future research, we will open-source NewlyQA and commit to regularly updating its answers.