Expressive generalized itemset mining


This page has hierarchy - Parent page: Research

SOURCE CODE

The source code of the MAX-EGI Miner algorithm is available here.

The source code of the Level-sharing itemset mining algorithm is available here.

EVALUATED BENCHMARK DATASETS AND TAXONOMIES

The collection of evaluated UCI datasets [Blake et al., 2012] and the corresponding taxonomies is available here.

[Blake et al., 2012] C. Blake and C. Merz. UCI repository of machine learning databases. Available at http://archive.ics.uci.edu/ml. Last accesed: 16/12/2013.

SYNTHETIC DATA AND TAXONOMY GENERATOR

This is an extended version of the Quest IBM synthetic dataset generator, first exploited in [Mehta et al., 1996] in the context of data classification. The data generator automatically produces structured datasets composed of a user-specified number of records and attributes supplied with a taxonomy built over the data items.

The synthetic data and taxonomy generator is available here.

[Mehta et al., 1996] M. Mehta, R. Agrawal, and J. Rissanen. Sliq: A fast scalable classifier for data mining. In EDBT, pages 18–32, 1996.