Active learning: an empirical study of common baselines |
| |
Authors: | Maria E. Ramirez-Loaiza Manali Sharma Geet Kumar Mustafa Bilgic |
| |
Affiliation: | 1.Illinois Institute of Technology,Chicago,USA |
| |
Abstract: | Most of the empirical evaluations of active learning approaches in the literature have focused on a single classifier and a single performance measure. We present an extensive empirical evaluation of common active learning baselines using two probabilistic classifiers and several performance measures on a number of large datasets. In addition to providing important practical advice, our findings highlight the importance of overlooked choices in active learning experiments in the literature. For example, one of our findings shows that model selection is as important as devising an active learning approach, and choosing one classifier and one performance measure can often lead to unexpected and unwarranted conclusions. Active learning should generally improve the model’s capability to distinguish between instances of different classes, but our findings show that the improvements provided by active learning for one performance measure often came at the expense of another measure. We present several such results, raise questions, guide users and researchers to better alternatives, caution against unforeseen side effects of active learning, and suggest future research directions. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|