Service Recommendations for Mashup Based on Generation Model

Guodong Fan,Shizhan Chen,Qiang He,Hongyue Wu,Jing Li,Xiao Xue,Zhiyong Feng
DOI: https://doi.org/10.1109/tsc.2023.3329511
IF: 11.019
2024-01-01
IEEE Transactions on Services Computing
Abstract:Service recommendations are crucial for developers to create mashups such as mobile applications, workflows, e-business solutions, etc. Existing methods based on collaborative filtering or content analysis are manual and cannot automatically acquire services that align with the requirements of mashup creation. A possible solution to automatically acquiring necessary services for mashups is the seq2seq (sequence to sequence) generation model, which has demonstrated promising performance in automatic text and program code generation. However, two main challenges must be tackled in service acquisition based on the seq2seq model. First, the seq2seq model can only acquire a set of services without inter-service dependencies, but such dependencies are crucial in the generation of sequences for services. Second, external knowledge must be leveraged to recommend services more accurately that fulfill developers' requirements, such as similar historical user requirements and combining mashup category information, due to the incomplete description of user requirements. To tackle these challenges, this paper proposes GSR ( G eneration of S ervice R ecommendations), an approach that can automatically acquire services based on user requirements. Specifically, GSR employs reinforcement learning to learn the inter-dependencies among services and integrate dependencies into service recommendations. To further improve the quality of the acquired services, GSR retrieves relevant user requirements based on BERT (Bidirectional Encoder Representation from Transformers) to help identify potential services. Experiment results conducted on real-world datasets show the superior performance of GSR. Compared with the existing recommendation approaches, the precision metric is increased by up to 1.99x, and the recall metric is increased by up to 12%.
What problem does this paper attempt to address?