SLNET: A Redistributable Corpus of 3rd-party Simulink Models

Sohil Lal Shrestha,Shafiul Azam Chowdhury,Christoph Csallner
DOI: https://doi.org/10.1145/3524842.3528001
2022-03-31
Abstract:MATLAB/Simulink is widely used for model-based design. Engineers create Simulink models and compile them to embedded code, often to control safety-critical cyber-physical systems in automotive, aerospace, and healthcare applications. Despite Simulink's importance, there are few large-scale empirical Simulink studies, perhaps because there is no large readily available corpus of third-party open-source Simulink models. To enable empirical Simulink studies, this paper introduces SLNET, the largest corpus of freely available third-party Simulink models. SLNET has several advantages over earlier collections. Specifically, SLNET is 8 times larger than the largest previous corpus of Simulink models, includes fine-grained metadata, is constructed automatically, is self-contained, and allows redistribution. SLNET is available under permissive open-source licenses and contains all of its collection and analysis tools.
Software Engineering
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the lack of a large - scale collection of third - party Simulink models that can be freely distributed, which limits the ability to conduct large - scale empirical research on Simulink models and their modeling practices. Specifically: 1. **Lack of a large - scale Simulink model collection**: Currently, there is no widely - used collection that contains a large number of third - party Simulink models, which makes it difficult for researchers to conduct large - scale empirical research. 2. **Problems with existing collections**: - **Incomplete metadata**: Existing model collections lack detailed metadata, which limits the depth and breadth of research. - **Not self - contained**: Existing model collections are usually not self - contained and require additional data or tools to be used. - **Not distributable**: Due to license restrictions, existing model collections are often not freely distributable, which further limits their application in research. To solve these problems, the paper introduces SLNET, which is currently the largest freely - available collection of third - party Simulink models. SLNET has the following advantages: - **Larger in scale**: The scale of SLNET is 8 times that of the previous largest collection. - **Contains detailed metadata**: SLNET provides rich project - level and model - level metadata. - **Automated construction**: The construction process of SLNET is automated, including data acquisition, cleaning, metric calculation, and packaging. - **Self - contained and distributable**: SLNET is self - contained and can be freely distributed while complying with the license. Through these improvements, SLNET aims to provide engineers and researchers with a powerful tool to more easily generate, reproduce, and verify empirical results regarding Simulink models, modeling practices, and related tools.