Experiment 3.1.3—unbalanced training data with feature selection, sex performance disparities
Random forest classifier | Logistic regression classifier | Support vector machine | Gaussian Naïve Bayes | |||||
Sex performance disparities (%) | t-test p value | Sex performance disparities (%) | t-test p value | Sex performance disparities (%) | t-test p value | Sex performance disparities (%) | t-test p value | |
Accuracy | 3.42 | 0.00 | −2.90 | 0.01 | −2.75 | 0.01 | −3.31 | 0.00 |
FScore | 15.36 | 0.00 | 15.79 | 0.00 | 16.50 | 0.00 | 15.29 | 0.00 |
ROC_AUC | 6.61 | 0.00 | 3.60 | 0.00 | 4.90 | 0.00 | 4.99 | 0.00 |
Precision | 9.85 | 0.00 | 0.24 | 0.44 | −0.87 | 0.90 | −3.41 | 0.03 |
Recall | 18.21 | 0.00 | 21.24 | 0.00 | 20.30 | 0.00 | 18.54 | 0.00 |
False negative rate | −18.21 | 0.00 | −21.24 | 0.00 | −20.30 | 0.00 | −18.54 | 0.00 |
True negative rate | −4.99 | 0.00 | −14.04 | 0.00 | −10.50 | 0.00 | −8.57 | 0.00 |
False positive rate | 4.99 | 0.00 | 14.04 | 0.00 | 10.50 | 0.00 | 8.57 | 0.00 |
True positive rate | 18.21 | 0.00 | 21.24 | 0.00 | 20.30 | 0.00 | 18.54 | 0.00 |