Table 3

F1 scores for fivefold cross-validation performance on the OIAM training set with different sets of stopwords

ModelNo removalEnglishMedicalCustomMedical+customEnglish+customEnglish+medical +custom
Naïve Bayes (multilabel)0.1570.1590.1540.1430.1660.1700.175
Naïve Bayes (multiclass)0.2250.2660.2430.2280.2450.2720.300
SVM (multilabel)0.1840.1840.1840.1840.1840.1840.184
SVM (multiclass)0.1410.1510.1410.1420.1420.1500.154
Nearest centroid0.2340.2560.2390.2340.2470.2520.278
  • OIAM, One in a Million; SVM, support vector machine.