OPTIMIZATION OF DIMENSION TABLE SCHEMA IN DATA WAREHOUSE

Wang Yigui,Chen Hanwu
DOI: https://doi.org/10.3969/j.issn.1000-386X.2009.06.017
2009-01-01
Abstract:Using snowflake schema to group dimension table in a data warehouse will bring too much connection cost.To solve this problem,a cost estimation model is built up with the measurement standard of the time cost of queries and the storage cost of dimension tables,and the optimization algorithms of dimension tables schema is designed by using genetic algorithms.The purpose of the design is to realise automatic adjustment of the dimension tables schema so that the system has smallest storage cost of dimension tables and time cost of queries in dimension tables schema.From the experiment a conclusion can be drawn that the querying speed can be expedited remarkably at a smaller space cost.
What problem does this paper attempt to address?