A novel approach to cutting decision trees
CENTRAL EUROPEAN JOURNAL OF OPERATIONS RESEARCH, cilt.22, sa.3, ss.553-565, 2014 (SCI-Expanded, Scopus)
- Yayın Türü: Makale / Tam Makale
- Cilt numarası: 22 Sayı: 3
- Basım Tarihi: 2014
- Doi Numarası: 10.1007/s10100-013-0312-9
- Dergi Adı: CENTRAL EUROPEAN JOURNAL OF OPERATIONS RESEARCH
- Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
- Sayfa Sayıları: ss.553-565
- İstanbul Kültür Üniversitesi Adresli: Evet
Özet
In data mining, binary classification has a wide range of applications. Cutting Decision Tree (CDT) induction is an efficient mathematical programming based method that tries to discretize the data set on hand by using multiple separating hyperplanes. A new improvement to CDT model is proposed in this study by incorporating the second goal of maximizing the distance of the correctly classified instances to the misclassification region. Computational results show that developed model achieves better classification accuracy for Wisconsin Breast Cancer database and Japanese Banks data set when compared to existing piecewise-linear models in literature. Furthermore, remarkable results are obtained for the well-known benchmarking data sets (Buba Liver Disorders, Blood Tranfusion and Pima Indian Diabetes) when compared to the original CDT model.