Towards Optimizing the Costs of LLM Usage

Shivanshu Shekhar,Tanishq Dubey,Koyel Mukherjee,Apoorv Saxena,Atharv Tyagi,Nishanth Kotla
2024-01-30
Abstract:Generative AI and LLMs in particular are heavily used nowadays for various document processing tasks such as question answering and summarization. However, different LLMs come with different capabilities for different tasks as well as with different costs, tokenization, and latency. In fact, enterprises are already incurring huge costs of operating or using LLMs for their respective use cases.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?