What should be the optimal parameters for Random Forest classifier?

Currently i am using RF toolbox on MATLAB for a binary classification Problem

Data Set: 50000 samples and more than 250 features

So what should be the number of trees and randomly selected feature on each split to grow the trees?
can any other parameter greatly affect the results?


Pick a large number of trees, say 100. From what I have read on the Internet, pick 250 randomly selected features. However, in the original paper, Breiman used about the closest integer to logMlog2.

I would say cross-validation is usually the key to finding optimal parameters, but I do not know enough about random forests.

Source : Link , Question Author : Rizwan , Answer Author : Wok

Leave a Comment