Big data collection in pharmaceutical manufacturing and its use for product quality predictions

Janja Žagar,Jurij Mihelič
DOI: https://doi.org/10.1038/s41597-022-01203-x
2022-03-23
Scientific Data
Abstract:Abstract Advances in data science and digitalization are transforming the world, and the pharmaceutical industry is no exception. Multiple sensor-equipped manufacturing processes and laboratory analysis are the main sources of primary data, which have been utilized for the presented dataset of 1005 actual production batches of selected medicine. This dataset includes incoming raw material quality results, compression process time series and final product quality results for the selected product. The data is highly valuable for it provides an insight into every 10 seconds of the process trajectory for 1005 actual production batches along with product quality collected over several years. It therefore offers an opportunity to develop advanced analysis models and procedures which would lead to the omission of current conventional and time consuming laboratory testing. Benefits for both the industry and patient are obvious: reducing product lead times and costs of manufacture.
multidisciplinary sciences
What problem does this paper attempt to address?