Generating Market Basket Data with Temporal Information

Yingjiu Li,Peng Ning,Sean Wang,Sushil Jajodia
2001-01-01
Abstract:This paper presents a synthetic data generator that outputs timestamped transactional data with embedded tempo- ral patterns controlled by a set of input parameters. In particular, calendarschema, which is determined by a hierarchy of input time granularities, is used as a framework of possible temporal patterns. An example of calendar schema is (year, month, day), which provides a framework for calendar-based temporal patterns of the form , where each is either an integer or the symbol . For example, is such a pattern, which corresponds to the time intervals consisting of all the 16th days of all months in year 2000. This paper also evaluates the data generator through a series of experiments. The synthetic data generator is intended to provide support for data mining commu- nity in evaluating various aspects (especially the temporal aspects and the scalability) of data mining algorithms.
What problem does this paper attempt to address?