Expressive generalized itemset mining
SOURCE CODE
The source code of the MAX-EGI Miner algorithm is available here.
The source code of the Level-sharing itemset mining algorithm is available here.
EVALUATED BENCHMARK DATASETS AND TAXONOMIES
The collection of evaluated UCI datasets [Blake et al., 2012] and the corresponding taxonomies is available here.
[Blake et al., 2012] C. Blake and C. Merz. UCI repository of machine learning databases. Available at http://archive.ics.uci.edu/ml. Last accesed: 16/12/2013.
SYNTHETIC DATA AND TAXONOMY GENERATOR
This is an extended version of the Quest IBM synthetic dataset generator, first exploited in [Mehta et al., 1996] in the context of data classification. The data generator automatically produces structured datasets composed of a user-specified number of records and attributes supplied with a taxonomy built over the data items.
The synthetic data and taxonomy generator is available here.
[Mehta et al., 1996] M. Mehta, R. Agrawal, and J. Rissanen. Sliq: A fast scalable classifier for data mining. In EDBT, pages 18–32, 1996.