Applications of Machine Learning in Biopharmaceutical Process Development and Manufacturing: Current Trends, Challenges, and Opportunities

Thanh Tung Khuat,Robert Bassett,Ellen Otte,Alistair Grevis-James,Bogdan Gabrys
2023-10-16
Abstract:While machine learning (ML) has made significant contributions to the biopharmaceutical field, its applications are still in the early stages in terms of providing direct support for quality-by-design based development and manufacturing of biopharmaceuticals, hindering the enormous potential for bioprocesses automation from their development to manufacturing. However, the adoption of ML-based models instead of conventional multivariate data analysis methods is significantly increasing due to the accumulation of large-scale production data. This trend is primarily driven by the real-time monitoring of process variables and quality attributes of biopharmaceutical products through the implementation of advanced process analytical technologies. Given the complexity and multidimensionality of a bioproduct design, bioprocess development, and product manufacturing data, ML-based approaches are increasingly being employed to achieve accurate, flexible, and high-performing predictive models to address the problems of analytics, monitoring, and control within the biopharma field. This paper aims to provide a comprehensive review of the current applications of ML solutions in a bioproduct design, monitoring, control, and optimisation of upstream, downstream, and product formulation processes. Finally, this paper thoroughly discusses the main challenges related to the bioprocesses themselves, process data, and the use of machine learning models in biopharmaceutical process development and manufacturing. Moreover, it offers further insights into the adoption of innovative machine learning methods and novel trends in the development of new digital biopharma solutions.
Machine Learning
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly focus on the following aspects: 1. **Improving the Quality and Efficiency of Biopharmaceutical Processes**: The paper discusses how to use machine learning (ML) techniques to support Quality - by - Design (QbD) - based biopharmaceutical development and manufacturing. Traditional biopharmaceutical production relies on costly and time - consuming quality testing methods, which often fail to gain in - depth understanding of the fundamental problems in biological processes. By introducing ML techniques, more effective process control strategies can be implemented, thereby improving product quality and production efficiency. 2. **Addressing the Challenges in Biopharmaceutical Processes**: The production process of biopharmaceuticals is complex and involves multiple inter - related steps, such as cell culture, purification, product formulation, etc. The paper analyzes the main challenges existing in the current biopharmaceutical processes, including product heterogeneity, limitations in process monitoring, and the lack of online process control strategies. These problems limit the development potential of biopharmaceutical automation. 3. **Promoting Data - Driven Biopharmaceutical Manufacturing**: With the accumulation of large - scale production data, the trend of using ML models to replace traditional multivariate data analysis methods is becoming increasingly evident. The paper explores how to use advanced Process Analytical Technologies (PAT) to monitor process variables and product quality attributes in real - time, and solve the analysis, monitoring, and control problems in the biopharmaceutical field by constructing accurate, flexible, and high - performance prediction models. 4. **Driving the Digital Transformation of the Biopharmaceutical Industry**: The paper also discusses the trend of the biopharmaceutical industry towards digital transformation, including using soft sensors and PAT methods to develop advanced control strategies, developing hybrid models that combine data - driven and knowledge - driven approaches, and establishing a central data warehouse to support data integration and access across different scales and unit operations. These measures are helpful for realizing fully automated data - driven biopharmaceutical manufacturing facilities. In summary, this paper aims to provide a comprehensive review, covering the current application status of ML in the design, monitoring, control, and optimization of upstream, downstream, and product formulation processes of biopharmaceutical products. At the same time, it deeply discusses the challenges related to the biological process itself, process data, and the use of machine learning models, and proposes directions for future research.