Abstract
In this paper we develop a new post-pruning algorithm. This new pruning algorithm uses two or more postpruning algorithms to prune a decision tree that has been built on training set by different orders, and the "best" tree is selected based either on separate test set accuracy or cross-validations from trees coming from result of the above step. The algorithm is theoretically based on occam's razor that is a simpler model is chosen if two models have the same performance on the training set. An experiment is implemented on three databases in UCI machine learning repository and the new algorithm is employed to compares with two well-known post-pruning algorithms. The results show that the hybrid pruning algorithm effectively reduces the complexity of decision trees without sacrificing accuracy.
Original language | English |
---|---|
Title of host publication | Proceedings : 2009 International Forum on Computer Science-Technology and Applications |
Publisher | IEEE |
Pages | 30-33 |
Number of pages | 4 |
Volume | 3 |
ISBN (Electronic) | 9781424454235 |
ISBN (Print) | 9780769539300, 9781424454228 |
DOIs | |
Publication status | Published - 2009 |
Externally published | Yes |
Event | 2009 International Forum on Computer Science-Technology and Applications, IFCSTA 2009 - Chongqing, China Duration: 25 Dec 2009 → 27 Dec 2009 |
Publication series
Name | International Forum on Computer Science-Technology and Applications (IFCSTA) |
---|---|
Publisher | IEEE |
Conference
Conference | 2009 International Forum on Computer Science-Technology and Applications, IFCSTA 2009 |
---|---|
Country/Territory | China |
City | Chongqing |
Period | 25/12/09 → 27/12/09 |
Bibliographical note
This research is supported by the Natural Science Foundation of Hebei Province (F2008000635), by the key project foundation of applied fundamental research of Hebei Province (08963522D), by the plan of 100 excellent innovative scientists of the first group in Education Department of Hebei Province, and by the ScientificResearch Foundation of Hebei Province (06213548).
Keywords
- Decision tree simplification
- Decision trees
- Hybrid pruning algorithm
- Occam's razor
- Overfitting