Abstract

Due to its simplicity and good performance, Random Forest attains much interest from the research community. The splitting attribute at each node of a decision tree for Random Forest is determined from a predefined number of randomly selected attributes (a subset of the entire attribute set). The size of an attribute subset (subspace) is one of the most important factors that stems multitude of influences over Random Forest. In this paper, we propose a new technique that dynamically determines the size of subspaces based on the relative size of the current data segment to the entire data set. In order to assess the effects of the proposed technique, we conduct experiments involving five widely used data set from the UCI Machine Learning Repository. The experimental results indicate the capability of the proposed technique on improving the ensemble accuracy of Random Forest.
Original languageEnglish
Title of host publicationProceedings of the 13th International Conference of Advanced Data Mining and Applications (ADMA 2017)
EditorsGao Cong, Wen-Chih Peng, Wei Emma Zhang, Chengliang Li, Aixin Sun
PublisherSpringer
Pages303-312
Number of pages10
ISBN (Electronic) 9783319691794
ISBN (Print)9783319691787
DOIs
Publication statusPublished - 2017
EventThe 13th International Conference on Advanced Data Mining and Applications (ADMA): ADMA 2017 - Nanyang Technological University Alumni House, Singapore, Singapore
Duration: 05 Nov 201706 Nov 2017
http://www.adma2017.net/#0 (Conference website)

Publication series

NameEdit Lecture Notes in Artificial Intelligence
PublisherSpringer
Volume10604
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceThe 13th International Conference on Advanced Data Mining and Applications (ADMA)
Country/TerritorySingapore
CitySingapore
Period05/11/1706/11/17
OtherThe year 2017 marks the 13th anniversary of the International Conference on Advanced Data Mining and Applications (ADMA 2017), which will be held in Singapore, 5—6 Nov 2016, co-located with ACM CIKM2017. It is our great pleasure to invite you to contribute papers and participate in this premier annual event on research and applications of data mining.

A growing attention has been paid to the study, development, and application of data mining. As a result, there is an urgent need for sophisticated techniques and tools that can handle new subfields of data mining, e.g., smartphone and social network data mining, spatial data mining, streaming data mining, green computing data mining, biomedical data mining, the Internet of Things, and data mining for healthcare. Our expertise in data mining also has to be expanded to new applications.
Internet address

Fingerprint

Dive into the research topics of 'Effects of Dynamic Subspacing in Random Forest'. Together they form a unique fingerprint.

Cite this