CORRELATION WEIGHTED HETEROGENEOUS EUCLIDEAN-OVERLAP METRIC

Chaoqun Li and Hongwei Li

Keywords

Distance function, Heterogeneous Euclidean-Overlap Metric, correlation weighting

Abstract

Many data mining algorithms depend on a good distance function to be successful. Among large numbers of distance functions, Hetero- geneous Euclidean-Overlap Metric (simply HEOM) is the simplest but effective distance function to handle the applications with both continuous and nominal attributes. In order to scale up its general- ization performance, we present an improved HEOM by correlation weighting. We call our improved HEOM correlation weighted Het- erogeneous Euclidean-Overlap Metric (simply CWHEOM) in this paper. In CWHEOM, to discrete and continuous class problems, we apply different correlation functions to estimate the correlation between attribute variables and class variable. Experiments running on 36 discrete class data sets and 36 continuous class data sets validate its effectiveness.

Important Links:



Go Back