ChatGPT Alternative Solutions: Large Language Models Survey

Hanieh Alipour,Nick Pendar,Kohinoor Roy
DOI: https://doi.org/10.5121/csit.2024.140514
2024-03-21
Abstract:In recent times, the grandeur of Large Language Models (LLMs) has not only shone in the realm of natural language processing but has also cast its brilliance across a vast array of applications. This remarkable display of LLM capabilities has ignited a surge in research contributions within this domain, spanning a diverse spectrum of topics. These contributions encompass advancements in neural network architecture, context length enhancements, model alignment, training datasets, benchmarking, efficiency improvements, and more. Recent years have witnessed a dynamic synergy between academia and industry, propelling the field of LLM research to new heights. A notable milestone in this journey is the introduction of ChatGPT, a powerful AI chatbot grounded in LLMs, which has garnered widespread societal attention. The evolving technology of LLMs has begun to reshape the landscape of the entire AI community, promising a revolutionary shift in the way we create and employ AI algorithms. Given this swift-paced technical evolution, our survey embarks on a journey to encapsulate the recent strides made in the world of LLMs. Through an exploration of the background, key discoveries, and prevailing methodologies, we offer an up-to-the-minute review of the literature. By examining multiple LLM models, our paper not only presents a comprehensive overview but also charts a course that identifies existing challenges and points toward potential future research trajectories. This survey furnishes a well-rounded perspective on the current state of generative AI, shedding light on opportunities for further exploration, enhancement, and innovation.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
This paper aims to provide a comprehensive review of recent advances in Large Language Models (LLMs), particularly alternative solutions to ChatGPT. It explores the role of LLMs in natural language processing, training methods, existing challenges, and future research directions. The paper analyzes the characteristics of various LLMs, compares their performance, and identifies limitations and areas for improvement in existing models. Additionally, the paper discusses applications in fields such as education, information retrieval, and others, as well as engineering approaches like zero-shot learning and chain thinking. Finally, the paper introduces OpenAI's ChatGPT and its features, and evaluates other alternative solutions such as OpenAssistance and LLaMA.