Abstract:In recent years, digital platform companies have faced increasing challenges in managing customer complaints, driven by widespread consumer adoption. This paper introduces an end-to-end pipeline, named RE-GrievanceAssist, designed specifically for real estate customer complaint management. The pipeline consists of three key components: i) response/no-response ML model using TF-IDF vectorization and XGBoost classifier ; ii) user type classifier using fasttext classifier; iii) issue/sub-issue classifier using TF-IDF vectorization and XGBoost classifier. Finally, it has been deployed as a batch job in Databricks, resulting in a remarkable 40% reduction in overall manual effort with monthly cost reduction of Rs 1,50,000 since August 2023.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenges faced by online real - estate platforms in managing customer complaints. With the widespread adoption of these platforms by consumers, the number of customer complaints has increased dramatically, bringing huge management pressure to the platforms. Specifically, the paper aims to solve the following problems:
1. **High volume and complexity of complaints**: Online real - estate platforms receive a large number of customer complaints (more than 1,000) every day, involving multiple user types (such as property owners, brokers, developers, etc.) and service categories (such as property listings, interior home design, lease agreements, etc.). Traditional manual processing methods are difficult to deal with such a large volume of complaints efficiently.
2. **Subjectivity and inefficiency**: The current complaint - handling process relies on human agents, which is not only time - consuming but also prone to inconsistent classification and response due to the subjective judgment of different agents. Many complaints that do not require human intervention are still assigned to agents for handling, wasting a large amount of human resources.
3. **High cost**: Due to the need for a large number of human agents to handle complaints, the operating cost remains high.
To solve the above problems, the paper proposes an end - to - end machine - learning pipeline named **RE - GrievanceAssist**, which is specifically used for complaint management of real - estate customers. This pipeline automates and optimizes the complaint - handling process through the following three key components:
1. **Response/non - response model**: Use TF - IDF vectorization and XGBoost classifier to determine whether human intervention is required.
2. **User - type classifier**: Use FastText classifier to identify the type of complaining user.
3. **Problem/sub - problem classifier**: Also use TF - IDF vectorization and XGBoost classifier to classify specific complaint problems and their sub - problems.
Through these technical means, RE - GrievanceAssist significantly reduces the workload of manual processing, lowers the operating cost, and improves the accuracy and efficiency of customer complaint handling. Since its deployment in August 2023, the system has achieved a reduction of about 40% in manual workload and a cost savings of 150,000 rupees per month.
### Key formulas and methods
- **TF - IDF vectorization**: Used to convert text data into numerical feature vectors. The formula is as follows:
\[
\text{TF - IDF}(t, d)=\text{TF}(t, d)\times\text{IDF}(t)
\]
where,
\[
\text{TF}(t, d)=\frac{\text{The number of times the word }t\text{ appears in document }d}{\text{The total number of words in document }d}
\]
\[
\text{IDF}(t)=\log\left(\frac{\text{The total number of documents}}{\text{The number of documents containing the word }t + 1}\right)
\]
- **XGBoost classifier**: A machine - learning algorithm based on gradient - boosted decision trees, which can effectively handle classification tasks.
- **FastText classifier**: An efficient text - classification model, especially suitable for multi - class classification tasks.
Through the application of these technologies, the RE - GrievanceAssist system significantly improves the efficiency and accuracy of customer complaint management.