Abstract
Due to its simplicity and good performance, Random Forest attains much interest from the research community. The splitting attribute at each node of a decision tree for Random Forest is determined from a predefined number of randomly selected attributes (a subset of the entire attribute set). The size of an attribute subset (subspace) is one of the most important factors that stems multitude of influences over Random Forest. In this paper, we propose a new technique that dynamically determines the size of subspaces based on the relative size of the current data segment to the entire data set. In order to assess the effects of the proposed technique, we conduct experiments involving five widely used data set from the UCI Machine Learning Repository. The experimental results indicate the capability of the proposed technique on improving the ensemble accuracy of Random Forest.
Original language | English |
---|---|
Title of host publication | Proceedings of the 13th International Conference of Advanced Data Mining and Applications (ADMA 2017) |
Editors | Gao Cong, Wen-Chih Peng, Wei Emma Zhang, Chengliang Li, Aixin Sun |
Publisher | Springer |
Pages | 303-312 |
Number of pages | 10 |
ISBN (Electronic) | 9783319691794 |
ISBN (Print) | 9783319691787 |
DOIs | |
Publication status | Published - 2017 |
Event | The 13th International Conference on Advanced Data Mining and Applications (ADMA): ADMA 2017 - Nanyang Technological University Alumni House, Singapore, Singapore Duration: 05 Nov 2017 → 06 Nov 2017 http://www.adma2017.net/#0 (Conference website) |
Publication series
Name | Edit Lecture Notes in Artificial Intelligence |
---|---|
Publisher | Springer |
Volume | 10604 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | The 13th International Conference on Advanced Data Mining and Applications (ADMA) |
---|---|
Country/Territory | Singapore |
City | Singapore |
Period | 05/11/17 → 06/11/17 |
Other | The year 2017 marks the 13th anniversary of the International Conference on Advanced Data Mining and Applications (ADMA 2017), which will be held in Singapore, 5—6 Nov 2016, co-located with ACM CIKM2017. It is our great pleasure to invite you to contribute papers and participate in this premier annual event on research and applications of data mining. A growing attention has been paid to the study, development, and application of data mining. As a result, there is an urgent need for sophisticated techniques and tools that can handle new subfields of data mining, e.g., smartphone and social network data mining, spatial data mining, streaming data mining, green computing data mining, biomedical data mining, the Internet of Things, and data mining for healthcare. Our expertise in data mining also has to be expanded to new applications. |
Internet address |
|