Gold standard datasets for evaluating word sense disambiguation programs期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Gold standard datasets for evaluating word sense disambiguation programs

Authors:	Adam Kilgarriff

Affiliation:	ITRI, University of Brighton, Lewes Rd, Brighton, BN22 4GJ, U.K.

Abstract:	There are now many computer programs for automatically determining the sense in which a word is being used. One would like to be able to say which are better, which worse, and also which words, or varieties of language, present particular problems to which algorithms. An evaluation exercise is required, and such an exercise requires a “gold standard” dataset of correct answers. Producing this proves to be a difficult and challenging task. In this paper I discuss the background, challenges and strategies, and present a detailed methodology for ensuring that the gold standard is not fool's gold.

Keywords:
本文献已被 ScienceDirect 等数据库收录！