Impurity

Task

Formula

Description

Gini impurity

Classification

i = 1 C f i ( 1 f i )

fi is the frequency of label i at node and C is the number of unique labels

Entropy

Classification

i = 1 C f i log ( f i )

fi is the frequency of label i at node and C is the number of unique labels

Variance/Mean Square Error(MSE)

Regression

1 N i = 1 N ( y i μ ) 2

yi is the label for an instance, N is the number of instance and μ is the mean given by 1 N i = 1 N y i

Variance/Mean Absoluter Error (MAE) (Scikit-learn only)

Regression

1 N i = 1 N | y i μ |

yi is the label for an instance, N is the number of instance and μ is the mean given by 1 N i = 1 N y i