Synthetic images datasets of clean and dirty string insulators used in high-voltage power lines
Hericles Ferraz,Rogério Gonçalves,Breno Moura,Daniel Sudbrack,Paulo Trautmann,Bruno Clasen,Rafael Homma,Reinaldo A. C. Bianchi
DOI: https://doi.org/10.1007/s40430-024-05204-2
IF: 2.361
2024-10-10
Journal of the Brazilian Society of Mechanical Sciences and Engineering
Abstract:String insulators are interconnected disks that insulate the electrical conductors of high-voltage transmission lines from the supporting towers, withstanding both mechanical stress and electrical current. Regular cleaning is crucial for string insulators, as different types of pollutants can build up on the insulators, impairing their ability to prevent electrical arcing and short circuits. However, visual observation, the primary method for verifying cleaning needs, can be misleading. Developing robust algorithms for autonomous system-based inspection leveraging artificial intelligence presents a promising solution to address the limitations of visual inspection in preventive insulator maintenance. However, machine learning algorithms require considerable training data to learn from patterns, avoid overfitting, and account for outliers and data variation within the training dataset, especially considering the limited availability of images depicting dirty string insulators, as reported in the literature. In response to the limitations of existing training data for machine learning algorithms in insulator inspection, this work proposes developing a synthetic dataset for clean and dirty string insulators using software tools, such as SolidWorks, Inventor, and Unity 3D to model the towers, insulators, backgrounds, and various pollutant types. The clean string insulator dataset contains 47,286 synthetic images featuring diverse string insulator materials (glass, polymer, and porcelain) situated within various realistic backgrounds (mountain, forest, desert, city, river, and plantation). Similarly, the dirty insulator dataset comprises 14,424 distinct images, including clean insulators and simulated contaminants, such as soot, salt, and bird droppings. To test the proposed dataset, first, a classical neural semantic segmentation network was used to segment the clean string insulators dataset images, and an average dice coefficient of 0.95 was achieved by using only synthetic images. The same network, trained only with the synthetic dataset, was also tested using real-world images of string insulators, resulting in an average dice coefficient of 0.92. The synthetic dataset of dirty string insulators was used in a classical classification deep neural network, obtaining an average accuracy of 0.97 for real images of dirty insulators. This work demonstrates the potential of fully synthetic datasets for training machine learning models. Our results indicate that such models can achieve high accuracy in real-world applications, including accurately identifying insulator chains and classifying contaminants within real-world images.
engineering, mechanical