SYNTHETIC DATA AND TAXONOMY GENERATOR
This is an extended version of the Quest IBM synthetic dataset generator, first exploited in [Mehta et al., 1996] in the context of data classification. The data generator automatically produces structured datasets composed of a user-specified number of records and attributes supplied with a taxonomy built over the data items.
The synthetic data and taxonomy generator is available here.
[Mehta et al., 1996] M. Mehta, R. Agrawal, and J. Rissanen. Sliq: A fast scalable classifier for data mining. In EDBT, pages 18–32, 1996.