Abstract:Large language models (LLMs) have seen rapid improvement in the recent years, and have been used in a wider range of applications. After being trained on large text corpus, LLMs obtain the capability of extracting rich features from textual data. Such capability is potentially useful for the web service recommendation task, where the web users and services have intrinsic attributes that can be described using natural language sentences and are useful for recommendation. In this paper, we explore the possibility and practicality of using LLMs for web service recommendation. We propose the large language model aided QoS prediction (llmQoS) model, which use LLMs to extract useful information from attributes of web users and services via descriptive sentences. This information is then used in combination with the QoS values of historical interactions of users and services, to predict QoS values for any given user-service pair. On the WSDream dataset, llmQoS is shown to overcome the data sparsity issue inherent to the QoS prediction problem, and outperforms comparable baseline models consistently.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively predict Quality of Service (QoS) in cloud service recommendation, especially in the case of sparse data. Specifically, the paper focuses on how to accurately predict the QoS values of unknown user - service pairs based on the historical interaction data of users and services among a large number of cloud services, so as to provide more appropriate service recommendations for users.
### Background and Challenges
1. **Growth of Cloud Services**: With the rapid development of cloud computing, the number of cloud services has increased dramatically, resulting in many services having the same or similar functions.
2. **Importance of QoS**: QoS (such as throughput, response time, and cost) is an important indicator for measuring service performance, which is crucial for differentiating similar services and helping users choose the most suitable service.
3. **Data Sparsity**: In actual scenarios, since users usually only interact with a small number of services, the historical QoS matrix is often highly sparse, which brings difficulties to QoS prediction.
### Existing Methods and Their Limitations
- **Collaborative Filtering (CF)**: It is one of the most commonly used methods, which predicts unknown QoS values by learning historical user - service interaction data. However, the data sparsity problem limits the effectiveness of this method.
- **Deep Learning Methods**: In recent years, deep - learning - based methods have also made significant progress in QoS prediction, but most methods mainly rely on the ID information of users and services and fail to fully utilize the rich information in natural - language descriptions.
### Innovations of the Paper
- **Introduction of Large - Language Models (LLMs)**: The paper proposes a new method, that is, using large - language models (such as RoBERTa and Phi3mini) to extract useful features from the attribute descriptions of users and services, and combining historical QoS values for QoS prediction.
- **Alleviation of Data Sparsity**: By using the text features extracted by LLMs, the information in historical interaction data can be supplemented, effectively alleviating the data sparsity problem and improving the accuracy and reliability of prediction.
### Method Overview
1. **Constructing Descriptive Sentences**: For each user and service, use their attributes (such as country, autonomous system, etc.) to construct descriptive sentences.
2. **Feature Extraction**: Use pre - trained LLMs (such as RoBERTa and Phi3mini) to extract text features from the descriptive sentences.
3. **Feature Fusion**: Combine the extracted LLM features with the ID features of users and services and input them into a multi - layer perceptron (MLP) network for QoS prediction.
### Experimental Results
- **Performance Improvement**: On the WSDream dataset, the proposed llmQoS model is significantly superior to the existing baseline models under different data sparsity levels, especially in the prediction of throughput and response time, with significant improvements in both MAE and RMSE.
- **Robustness**: The llmQoS model is not only applicable to different LLMs but can also be generalized to different collaborative filtering network architectures.
### Conclusion
The paper successfully solves the data sparsity problem in QoS prediction by introducing large - language models, improving the accuracy and reliability of service recommendation. This method provides new ideas and directions for future QoS prediction research.