HIGH ACCURACY BACK-RETREAT DIFFUSION-FUZZY CLUSTERING OF BREAST CANCER DATA FOR THE DETECTION OF MALIGNANCY

Ashutosh Patri, Abhijit Nayak

View Full Paper

Keywords

DifFUZZY; Wisconsin Breast Cancer Data; Fuzzy C-Means; Breast Cancer; Fuzzy Clustering.

Abstract

A novel fuzzy clustering method has been proposed here for separating the breast cancer data, which operates with reasonable accuracy, allows flexibility in dataset and is modestly time consuming. This method can be applied to any type of cancer data set with some initial labels to ob- tain high accuracy result in the classification of unlabeled samples. Further, the curse of dimensionality is not an is- sue for the proposed scheme as it can be applied to data having any number of dimensions or attributes. The Dif- FUZZY unsupervised clustering algorithm is applied at the initial stage, giving an accuracy of 96.28% over Wisconsin Breast Cancer Dataset (WBCD); the result is further im- proved to 98.14% by using the proposed Back-Retreat al- gorithm. The formed clusters are estimated using three in- ternal cluster validation indices and the performance of the method is evaluated using receiver operating characteristic (ROC) curves. The clustering algorithm is compared with Fuzzy C-Means (FCM) algorithm and the results are com- pared with different classifiers and clustering techniques.

Important Links:



Go Back