MLOps: Automatic, Zero-Touch and Reusable Machine Learning Training and Serving Pipelines

D. Lu,I. Baran,Deven Panchal,D. Musgrove
DOI: https://doi.org/10.1109/IoTaIS60147.2023.10346079
2023-11-28
Abstract:Due to increased interest in Machine Learning, Deep Learning and AI and easy availability of ML toolk-its/libraries/ frameworks as well as talent that has ML skillsets and know-how to use these ML tools, Machine learning model building has become easier and is now being employed to solve a variety of problems across different fields. But when the time comes to be able to take these ML/DL models to production i.e., integrate them with business applications, it is often a challenge. This is due to the many skillsets in software development, cloud, DevOps, system design, data engineering, that this requires. Furthermore, we see that ML engineers or data scientists and software developers often work in silos to make locally optimum decisions without thinking about how the model is going to be used in production, how it is going to be trained on fresh new incoming data and how and when it is going to be replaced with a new better model. In short, most of the time, there is no plan for how models will behave when in production nor any automatic pipelines to manage them. This leads to lots of difficulty in productionalizing the ML models and leads to wasted time and effort. We we will demonstrate how we can take an ML model to production very easily using components from the Acumos AI project and do much more by creating zero-touch i.e., automatic and reusable ML model training and ML model serving pipelines and reusable model infrastructures using Acumos and Nifi. We will create a Cloud Forensics ML model, productionalize it and show how it can be consumed in a business application.
Computer Science
What problem does this paper attempt to address?