Abstract
This paper presents new measures, based on the induced decision tree, to characterise datasets for meta-learning in order to select appropriate learning algorithms. The main idea is to capture the characteristics of dataset from the structural shape and size of decision tree induced from the dataset. Totally 15 measures are proposed to describe the structure of a decision tree. Their effectiveness is illustrated through extensive experiments, by comparing to the results obtained by the existing data characteristics techniques, including data characteristics tool (DCT) that is the most wide used technique in meta-learning, and Landmarking that is the most recently developed method.
Translated title of the contribution | Improved data set characterisation for meta-learning |
---|---|
Original language | English |
Title of host publication | Unknown |
Publisher | Springer |
Pages | 141 - 152 |
Number of pages | 11 |
ISBN (Print) | 3540001883 |
Publication status | Published - Jan 2002 |