Inferring Software Behavioral Models with MapReduce

Chen Luo,Fei He,Carlo Ghezzi
DOI: https://doi.org/10.1007/978-3-319-25942-0_9
IF: 1.039
2017-01-01
Science of Computer Programming
Abstract:Software systems are often built without developing any explicit model and therefore research has been focusing on automatic inference of models by applying machine learning to execution logs. However, the logs generated by a real software system may be very large and the inference algorithm can exceed the capacity of a single computer. This paper focuses on inference of behavioral models and explores to use of MapReduce to deal with large logs. The approach consists of two distributed algorithms that perform trace slicing and model synthesis. For each job, a distributed algorithm using MapReduce is developed. With the parallel data processing capacity of MapReduce, the problem of inferring behavioral models from large logs can be efficiently solved. The technique is implemented on top of Hadoop. Experiments on Amazon clusters show efficiency and scalability of our approach.
What problem does this paper attempt to address?