Abstract:Data science has revolutionized chemical research and continues to break down barriers with new interdisciplinary studies. The introduction of computational models and machine learning (ML) algorithms in combination with automation and traditional experimental techniques has enabled scientific advancement across nearly every discipline of chemistry, from materials discovery, to process optimization, to synthesis planning. However, predictive tools powered by data science are only as good as their data sets and, currently, many of the data sets used to train models suffer from several limitations, including being sparse, limited in scope and requiring human curation. Likewise, computational data faces limitations in terms of accurate modeling of nonideal systems and can suffer from low translation fidelity from simulation to real conditions. The lack of diverse data and the need to be able to test it experimentally reduces both the accuracy and scope of the predictive models derived from data science. This Account contextualizes the need for more complex and diverse experimental data and highlights how the seamless integration of robotics, machine learning, and data-rich monitoring techniques can be used to access it with minimal human labor. We propose three broad categories of data in chemistry: data on fundamental properties, data on reaction outcomes, and data on reaction mechanics. We highlight flexible, automated platforms that can be deployed to acquire and leverage these data. The first platform combines solid- and liquid-dosing modules with computer vision to automate solubility screening, thereby gathering fundamental data that are necessary for almost every experimental design. Using computer vision offers the additional benefit of creating a visual record, which can be referenced and used to further interrogate and gain insight on the data collected. The second platform iteratively tests reaction variables proposed by a ML algorithm in a closed-loop fashion. Experimental data related to reaction outcomes are fed back into the algorithm to drive the discovery and optimization of new materials and chemical processes. The third platform uses automated process analytical technology to gather real-time data related to reaction kinetics. This system allows the researcher to directly interrogate the reaction mechanisms in granular detail to determine exactly how and why a reaction proceeds, thereby enabling reaction optimization and deployment.

Controlled Experimentation in Continuous Experimentation: Knowledge and Challenges

Evidence-Based Guidelines for Advancing Continuous Experimentation

Characteristics of an Online Controlled Experiment: Preliminary Results of a Literature Review

Continuous Experimentation and Human Factors An Exploratory Study

The Viability of Continuous Experimentation in Early-Stage Software Startups: A Descriptive Multiple-Case Study

Towards Continuous Compounding Effects and Agile Practices in Educational Experimentation

A theory of factors affecting continuous experimentation (FACE)

A/B testing: A systematic literature review

The Automotive Take on Continuous Experimentation: A Multiple Case Study

Conducting A/B Experiments with a Scalable Architecture

Time to Stop and Think: What kind of research do we want to do?

Statistical Challenges in Online Controlled Experiments: A Review of A/B Testing Methodology

Meta-experiments: Improving experimentation through experimentation

Automated Experimentation Powers Data Science in Chemistry.

Continuous Software Engineering in the Wild

Controlled Experiments with Student Participants in Software Engineering: Preliminary Results from a Systematic Mapping Study

Improving Software Engineering Research through Experimentation Workbenches

Continuous Integration, Delivery and Deployment: A Systematic Review on Approaches, Tools, Challenges and Practices

An architecture for enabling A/B experiments in automotive embedded software

Online Controlled Experiments for Personalised e-Commerce Strategies: Design, Challenges, and Pitfalls

Experimenting with Experimentation: Rethinking The Role of Experimentation in Educational Design