Abstract:Artificial Intelligence (AI) has made incredible progress recently. On the one hand, advanced foundation models like ChatGPT can offer powerful conversation, in-context learning and code generation abilities on a broad range of open-domain tasks. They can also generate high-level solution outlines for domain-specific tasks based on the common sense knowledge they have acquired. However, they still face difficulties with some specialized tasks because they lack enough domain-specific data during pre-training or they often have errors in their neural network computations on those tasks that need accurate executions. On the other hand, there are also many existing models and systems (symbolic-based or neural-based) that can do some domain-specific tasks very well. However, due to the different implementation or working mechanisms, they are not easily accessible or compatible with foundation models. Therefore, there is a clear and pressing need for a mechanism that can leverage foundation models to propose task solution outlines and then automatically match some of the sub-tasks in the outlines to the off-the-shelf models and systems with special functionalities to complete them. Inspired by this, we introduce <a class="link-external link-http" href="http://TaskMatrix.AI" rel="external noopener nofollow">this http URL</a> as a new AI ecosystem that connects foundation models with millions of APIs for task completion. Unlike most previous work that aimed to improve a single AI model, <a class="link-external link-http" href="http://TaskMatrix.AI" rel="external noopener nofollow">this http URL</a> focuses more on using existing foundation models (as a brain-like central system) and APIs of other AI models and systems (as sub-task solvers) to achieve diversified tasks in both digital and physical domains. As a position paper, we will present our vision of how to build such an ecosystem, explain each key component, and use study cases to illustrate both the feasibility of this vision and the main challenges we need to address next.

Mobile Foundation Model As Firmware the Way Towards a Unified Mobile AI Landscape

Mobile Foundation Model as Firmware

Close the Gap Between Deep Learning and Mobile Intelligence by Incorporating Training in the Loop

Explore Training of Deep Convolutional Neural Networks on Battery-powered Mobile Devices: Design and Application

Rethinking Mobile AI Ecosystem in the LLM Era

MobileNetV4 -- Universal Models for the Mobile Ecosystem

Automating Cloud Deployment for Real-Time Online Foundation Model Inference

Foundation Model Based Native AI Framework in 6G with Cloud-Edge-End Collaboration

Training Large-scale Foundation Models on Emerging AI Chips

A Novel Heterogeneous Computing Middleware for Mobile AI Services

Training and Serving System of Foundation Models: A Comprehensive Survey

MNN: A Universal and Efficient Inference Engine

Comparison and Benchmarking of AI Models and Frameworks on Mobile Devices

Hardware-middleware system co-design for flexible training of foundation models in the cloud

Foundation models in brief: A historical, socio-technical focus

MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

Configurable Foundation Models: Building LLMs from a Modular Perspective

Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities

TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs

Foundation Models in Robotics: Applications, Challenges, and the Future

A Survey of Resource-efficient LLM and Multimodal Foundation Models