Data Publishing in Mechanics and Dynamics: Challenges, Guidelines, and Examples from Engineering Design

Henrik Ebel,Jan van Delden,Timo Lüddecke,Aditya Borse,Rutwik Gulakala,Marcus Stoffel,Manish Yadav,Merten Stender,Leon Schindler,Kristin Miriam de Payrebrune,Maximilian Raff,C. David Remy,Benedict Röder,Peter Eberhard
2024-10-08
Abstract:Data-based methods have gained increasing importance in engineering, especially but not only driven by successes with deep artificial neural networks. Success stories are prevalent, e.g., in areas such as data-driven modeling, control and automation, as well as surrogate modeling for accelerated simulation. Beyond engineering, generative and large-language models are increasingly performing and helping with tasks that, previously, were solely associated with creative human processes. Thus, it seems timely to seek artificial-intelligence-support for engineering design tasks to automate, help with, or accelerate purpose-built designs of engineering systems, e.g., in mechanics and dynamics, where design so far requires a lot of specialized knowledge. However, research-wise, compared to established, predominantly first-principles-based methods, the datasets used for training, validation, and test become an almost inherent part of the overall methodology. Thus, data publishing becomes just as important in (data-driven) engineering science as appropriate descriptions of conventional methodology in publications in the past. This article analyzes the value and challenges of data publishing in mechanics and dynamics, in particular regarding engineering design tasks, showing that the latter raise also challenges and considerations not typical in fields where data-driven methods have been booming originally. Possible ways to deal with these challenges are discussed and a set of examples from across different design problems shows how data publishing can be put into practice. The analysis, discussions, and examples are based on the research experience made in a priority program of the German research foundation focusing on research on artificially intelligent design assistants in mechanics and dynamics.
Computers and Society,Artificial Intelligence,Computational Engineering, Finance, and Science,Emerging Technologies,Systems and Control
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to explore the challenges, guidelines, and practical cases of data publishing in the fields of mechanics and dynamics, especially in engineering design tasks. Specifically, it focuses on the following aspects: 1. **Importance of data publishing**: - With the increasingly wide application of data - based methods (especially the success of deep neural networks) in the engineering field, the publication of data sets has become as important as the description of traditional methods. Data sets are not only the basis for training, validating, and testing models, but also the key to ensuring the reproducibility and transparency of research. 2. **Challenges of data publishing**: - **Interpretability and domain knowledge**: Raw data is difficult to understand for those who are not familiar with the data generation process. In addition, data representation may depend on the underlying system, analysis method, and observation method. - **Problem complexity**: In mechanics and dynamics, complex engineering problems are common, but overly complex problems will limit the interest and potential impact of machine - learning experts. - **Generalization ability**: Compared with other fields, problems in mechanics and engineering design may be too specific, resulting in under - utilization of data sets. - **Evaluation challenges**: In real - world engineering design problems, there is usually no "correct" reference solution, so evaluating the quality of candidate designs requires a complex simulation and computing environment. 3. **Strategies for dealing with challenges**: - **Evaluation mechanism**: Provide lightweight and easy - to - install - and - use evaluation tools, or upload results through a web interface for remote evaluation. - **Interpretability and domain knowledge**: Follow the FAIR principles (Findable, Accessible, Interoperable, Reusable), and attach descriptive metadata and visualization code. - **Problem complexity**: Adjust the problem complexity according to the needs of the target audience, and clearly state the advantages of the new method and the possibility of future expansion. - **Generalization ability**: Frame the problem appropriately, explain its general importance and data characteristics, and attract researchers from other fields. 4. **Practical cases**: - The paper presents six practical cases, covering the data set and code publication of different design problems such as vibrating plates and collision boxes. These cases show how to overcome the above challenges and provide references for other researchers. ### Summary By analyzing the value and challenges of data publishing in mechanics and dynamics, this paper proposes specific coping strategies and shows how to apply data publishing to engineering design tasks through practical cases. This not only helps to improve the reproducibility and transparency of research, but also promotes interdisciplinary cooperation and innovation.