Facing Imbalanced Data Recommendations for the Use of Performance Metrics

László A Jeni; Jeffrey F Cohn; Fernando De La Torre

doi:10.1109/ACII.2013.47

Facing Imbalanced Data Recommendations for the Use of Performance Metrics

Int Conf Affect Comput Intell Interact Workshops. 2013:2013:245-251. doi: 10.1109/ACII.2013.47.

Authors

László A Jeni¹, Jeffrey F Cohn², Fernando De La Torre¹

Affiliations

¹ Carnegie Mellon University, Pittsburgh, PA.
² Carnegie Mellon University, Pittsburgh, PA ; University of Pittsburgh, Pittsburgh, PA, jeffcohn@cs.cmu.edu.

Abstract

Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones.

Grants and funding

R01 MH096951/MH/NIMH NIH HHS/United States