Multilevel Large Language Models for Everyone

Yuanhao Gong
2023-07-25
Abstract:Large language models have made significant progress in the past few years. However, they are either generic {\it or} field specific, splitting the community into different groups. In this paper, we unify these large language models into a larger map, where the generic {\it and} specific models are linked together and can improve each other, based on the user personal input and information from the internet. The idea of linking several large language models together is inspired by the functionality of human brain. The specific regions on the brain cortex are specific for certain low level functionality. And these regions can jointly work together to achieve more complex high level functionality. Such behavior on human brain cortex sheds the light to design the multilevel large language models that contain global level, field level and user level models. The user level models run on local machines to achieve efficient response and protect the user's privacy. Such multilevel models reduce some redundancy and perform better than the single level models. The proposed multilevel idea can be applied in various applications, such as natural language processing, computer vision tasks, professional assistant, business and healthcare.
Computer Vision and Pattern Recognition,Artificial Intelligence,Computational Engineering, Finance, and Science,Distributed, Parallel, and Cluster Computing,General Economics
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are the limitations of existing large - language models (LLMs), specifically including: 1. **High consumption of computing resources**: Traditional large - language models require a large amount of computing resources (such as thousands of GPUs) for training and deployment, which is a huge obstacle for many researchers and organizations. 2. **Model staticness**: Existing large - language models are static and cannot be dynamically adjusted and evolved according to user input, thus limiting their performance improvement in specific fields. 3. **Privacy issues**: Since these models require a large amount of data for training, they may capture sensitive information, leading to the risk of user privacy leakage. 4. **Lack of domain - specificity**: Although general - purpose large - language models are applicable to a wide range of fields, their performance in specific fields is not excellent. To solve these problems, the paper proposes a multilevel large - scale language model framework, aiming to improve existing models in the following ways: - **Global Level**: It contains models with a very large number of parameters. These models are trained based on all available data sets, have extensive knowledge but are not focused on any specific domain. They are mainly used to update and improve domain - specific models. - **Field Level**: Models optimized for specific domains (such as poetry, medical diagnosis, programming, etc.). These models obtain knowledge from the global model and are further fine - tuned to improve performance in specific domains. - **User Level**: Small - scale models running on local machines can be customized according to users' personal preferences, protecting user privacy while providing efficient responses. In addition, the paper also proposes an economic ecosystem. Through blockchain technology, distributed computing is realized, the threshold for developing and using large - language models is lowered, and positive interactions between users and developers are encouraged, forming a sustainable development ecosystem. In conclusion, the goal of this paper is to solve the deficiencies of existing large - language models in terms of computing cost, dynamic adaptability, privacy protection, and domain - specificity by constructing a multilevel language model framework, thereby promoting the development of natural - language processing technology.