Multi-Objective Genetic Algorithm for Materialized View Optimization in Data Warehouses

Mahdi Manavi
2024-03-29
Abstract:Materialized views can significantly improve database query performance but identifying the optimal set of views to materialize is challenging. Prior work on automating and optimizing materialized view selection has limitations in execution time and total cost. In this paper, we present a novel genetic algorithm based approach to materialized view selection that aims to minimize execution time and total cost. Our technique encodes materialized view configurations as chromosomes and evolves the population over generations to discover high quality solutions. We employ an adaptive mutation rate, multi-objective fitness function, and lexicase selection to enhance genetic search. Comprehensive experiments on the TPC-H benchmark demonstrate the effectiveness of our algorithm. Compared to stateof-the-art methods, our approach improves average execution time by 11% and reduces total materialized view costs by an average of 16 million. These gains highlight the benefits of a datadriven evolutionary approach. Our genetic algorithm framework significantly outperforms current materialized view selection techniques in both efficiency and total cost reduction. This work represents an important advance in enabling performant and cost-effective utilization of materialized views in enterprise systems.
Databases
What problem does this paper attempt to address?
The paper proposes a new approach to solving the materialized view optimization problem in data warehousing. Materialized views can significantly improve database query performance, but selecting the optimal set of views is a challenge. Existing automated and optimized selection methods have limitations in terms of execution time and overall cost. The paper introduces a new approach based on genetic algorithms, aimed at minimizing execution time and overall cost. By encoding materialized view configurations as chromosomes and evolving populations over multiple generations to discover high-quality solutions, the method employs adaptive mutation rate, multi-objective fitness function, and lexical selection to enhance genetic search. Experimental results show that the algorithm can reduce execution time by an average of 11% and materialized view cost by 16 million compared to existing best methods. This work represents significant progress in achieving high performance and cost-effectiveness of materialized view usage in enterprise systems.