MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks

Lei Zhang,Yuge Zhang,Kan Ren,Dongsheng Li,Yuqing Yang
2024-01-01
Abstract:The field of machine learning (ML) has gained widespread adoption, leading tosignificant demand for adapting ML to specific scenarios, which is yetexpensive and non-trivial. The predominant approaches towards the automation ofsolving ML tasks (e.g., AutoML) are often time-consuming and hard to understandfor human developers. In contrast, though human engineers have the incredibleability to understand tasks and reason about solutions, their experience andknowledge are often sparse and difficult to utilize by quantitative approaches.In this paper, we aim to bridge the gap between machine intelligence and humanknowledge by introducing a novel framework, which leverages thestate-of-the-art large language models to develop ML solutions for novel tasks.We showcase the possibility of extending the capability of LLMs to comprehendstructured inputs and perform thorough reasoning for solving novel ML tasks.And we find that, after some dedicated design, the LLM can (i) observe from theexisting experiences of ML tasks and (ii) reason effectively to deliverpromising results for new tasks. The solution generated can be used directly toachieve high levels of competitiveness. Examples and code available athttps://github.com/microsoft/CoML.
What problem does this paper attempt to address?